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PREFACE TO THE THIRD EDITION 


I N the preparation of this new edition of Volume I the text of the 
corresponding volume published in 1921 has been subjected to careful 
revision. Various sections have been rewritten; among these may be 
mentioned the sections on the Riemann-Stieltjes integral. A considerable 
amount of new matter has been added; and thus the volume has been 
lengthened by upwards of sixty pages. The additions have been made 
without changing the numeration of the sections; so that the references 
in Volume 11 (1926) to sections in Volume I are applicable to this new 
edition. 

The parts of the subject which were dealt with in the first five chapters 
of the first edition, published in 1907, are represented in the eight chapters 
of the first volume of this new edition. With a view to greater unity of 
treatment of the Theory of Integration, some theorems which appeared 
in Chapter vi, of the first edition, have however been included in the 
present volume. A considerable part of the Theory of Integration, in 
relation to series and sequences, is dealt with in Volume II, published 
in 1926. 

On controversial matters connected with the fundamentals of the 
Theory of Aggregates, account has been taken of the considerable diversity 
of opinion amongst Mathematicians, which still exists, but in general no 
attempt has been made to give dogmatic decisions between opposed views. 
In view of the delicate questions which arise as to the legitimacy and 
meaning of the axiom known as the Multiplicative Axiom, or as the 
Principle of Zermelo, the policy has been adopted of so framing the proofs 
of theorems as to avoid an appeal to the axiom, whenever that course 
appeared to be possible; in other cases, the necessity for the employment 
of the axiom has been pointed out. 

Ample references to sources of information are given throughout, but 
such references do not provide the means for compiling a complete list of 
writings on the subject. No attempt has been made to settle questions 
of priority of discovery, but in this new edition the dates of publication 
of writings referred to have been inserted. 

The corrections and additions given at the end of Volume ii, pub¬ 
lished in 1926, have been embodied in this new edition of Volume I. The 
opportunity has been taken of giving at the end of the volume some 
corrections and additions to Volume n. 

E. W. HOBSON 

Christ’s College, Cambridge. 

May 11, 1927. 



PREFACE TO THE FIRST EDITION 


T HE theory of functions of a real variable, as developed during the 
last few decades, is a body of doctrine resting, first upon a definite 
conception of the arithmetic continuum which forms the field of the 
variable, and which includes a precise arithmetic theory of the nature of 
a limit, and secondly, upon a definite conception of the nature of the 
functional relation. The procedure of the theory consists largely in the 
development, based upon precise definitions, of a classification of functions, 
according as they possess, or do not possess, certain peculiarities, such as 
continuity, differentiability, &e., throughout the domain of the variable, 
or at points forming a selected set contained in that domain. The detailed 
consequences of the presence, or of the absence, of such peculiarities are 
then traced out, and are applied for the purpose of obtaining conditions 
for the validity of the processes of Mathematical Analysis. These pro¬ 
cesses, which have been long employed in the so-called Infinitesimal 
Calculus, consist essentially in the ascertainment of the existence, and in 
the evaluation, of limits, and are subject, in every case, to restrictive 
assumptions which are necessary conditions of their validity. The object 
to be attained by the theory of functions of a real variable consists then 
largely in the precise formulation of necessary and sufficient conditions 
for the validity of the limiting processes of Analysis. A necessary requisite 
in such formulation is a language descriptive of particular aggregates of 
values of the variable, in relation to which functions possess definite pecu¬ 
liarities. This language is provided by the Theory of Sets of Points, also 
known, in its more general aspects, as the Theory of Aggregates, which 
contains an analysis of the peculiarities of structure and of distribution 
in the field of the variable which such sets of points may possess. This 
theory, which had its origin in the exigencies of a critical theory of func¬ 
tions, and has since received wide applications, not only in Pure Analysis, 
but also in Geometry, must be regarded as an integral part of the subject. 
A most important part of the theory of functions is the theory of the 
representation of functions in a prescribed manner, especially by means 
of series or sequences of functions of prescribed types. Much progress has 
recently been made in this part of the subject, results having been obtained 
which have led to a classification of functions in accordance with the 
modes of representation of which they are capable. The special case of 
the conditions of representability of functions by means of trigonometrical 
series was historically the starting-point in which a great part of the 
modem development of the theory of functions of a real variable had its 
origin. 
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The course of study, of which the present treatise is the outcome, 
followed an order very similar to the historical order in which the subject 
was developed. Commencing with the study of Fourier’s series, in their 
application to the problems of Mathematical Physics, and provided with 
a knowledge of the Differential and Integral Calculus, of the traditional 
kind in which notions of the nature of continuity and of limits, founded 
on an uncritical use of intuitions of space and time, arc the stock in trade, 
I was kid, by the difficulties connected with the theory of t hese series, and 
through an attempt to understand the literature which deals with them, 
to a study of the theories of real number, due to Cantor and Dedekind, 
and to that of the theory of sets of points. A study of the foundations of 
the Integral Calculus, and of the general theory of functions of a real 
variable, formed the natural continuation of the course. The present work 
has been written with the object of presenting in a connected form, and 
of thus rendering more easily accessible than hitherto, the chief results 
which are to be found scattered through a very large number of memoirs, 
periodicals, and treatises. I have endeavoured, as far as possible, to fill 
up gaps in the various theories which occur in different parts of the subject. 
The proofs of theorems have in many cases been simplified, often in 
accordance with developments of the theory later in date than the original 
proofs; other theorems have been given in a form more general than that 
in which they were first discovered. In the literature of the subject, errors 
are not infrequent, largely owing to the fact that spatial intuition affords 
an inadequate corrective of the theories involved, and is indeed in some 
cases almost misleading. Although 1 have made every endeavour to attain 
to accuracy both in form and in substance, it is practically certain that 
the present work will form no exception to the rule of fallibility. Where 
I have called attention to what I regard as inadequate statements or errors 
on the part of other writers, I have done so solely for the purpose of 
directing the attention of students to the points in question, and with 
full consciousness that, at least in some cases, close examination might 
shew that what appeared to me to be erroneous was rather due to some 
misapprehension on my part of the meaning of the writers to whom 
reference is made. On some points connected with the theory of aggre¬ 
gates, which are at present matters of controversy, I have expressed 
definite opinions, although I fully recognize that, on such matters, a 
dogmatic attitude of mind is at the present time wholly out of place, and 
not unlikely to be avenged when the points concerned are finally settled 
to the general satisfaction of mathematicians. 

Chapter i contains a discussion of Number, and includes a full account 
of the theories of Real Number, due to Cantor and Dedekind. Whilst an 
indication has been given of the fundamental notions upon which the con¬ 
ceptions of cardinal and ordinal numbers rest, I have not attempted to 
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reduce these fundamental notions to a minimum of indefinables from which 
the whole theory might be deduced by means of formal logic. A slight 
perusal of the extremely extensive literature of the Philosophy of Arith¬ 
metic will shew that any such attempt could only have been made by 
entering upon a prolonged discussion of a philosophical character, wholly 
unsuited to a treatise of professedly mathematical complexion, and that 
any views expressed would have but little prospect of giving general satis¬ 
faction to logicians and philosophers. The modern theory of Real Numbers 
has been the subject of much criticism by philosophers and others. It has 
been represented that the modern extension of the notion of number to 
the case of irrational numbers is a sophistical attempt to obliterate the 
fundamental distinction between the discrete and the continuous. I ven¬ 
ture to think that such objections consist, in large part at least, of criticisms 
of the current terminology of the mathematical theories, especially in 
respect of the extensions of the use of the word “number,” and I think it 
probable that many of these criticisms would not survive a fair examina¬ 
tion of the theories themselves apart from the language in which they are 
expressed. An appropriate terminology, although a matter of convention, 
is no doubt a very important matter in relation to such fundamental 
matters, as it is conducive to clearness of thought; but the substance of 
the theories is of incomparably greater importance than the forms in which 
they are expressed, and those theories may be found on examination to 
be essentially sound, even if their terminology be regarded as in some 
recpects defective. 

Chapter 11 contains an exposition of the theory of sets of points, and 
includes an account of transfinite cardinal and ordinal Arithmetic, of a 
somewhat simpler and less general character than will be met with in the 
treatment of the general theory of aggregates, in Chapter nr. Students 
who do not care to embark upon the discussions in Chapter in will find a 
study of Chapter n amply sufficient to enable them to apply the ideas 
there developed in the general theory of functions. A slight account only 
has been given of the properties of plane sets of points. An account of the 
important recent investigations which had their origin in Jordan’s theorem, 
that a closed curve divides plane space into two regions, would have 
occupied more space than was at my disposal. This omission will be less 
felt than might have been the case, were not an excellent account of this 
subject to be found in Dr W. H. Young’s treatise on the theory of sets of 
points, which has appeared since this portion of the present work was 
printed. 

In Chapter iv, there will be found a discussion of the main properties 
of functions, in relation to continuity, discontinuity, &c., and investiga¬ 
tions of the properties of important classes of functions. Although the 
treatise is mainly one of functions of a single variable, a considerable 
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amount of space has been devoted to the consideration of functions of two 
variables, not only on account of the intrinsic importance of that subject, 
but because no adequate consideration of the properties of functions of a 
single variable is possible without the use of functions of two variables, 
as is seen, for example, from the consideration that a function defined by 
means of a sequence of functions of a single variable is virtually defined 
as a limit of a function of two variables. 

The foundations of the Integral Calculus, as based upon Riemann’s 
definition of a definite integral, and its extensions, are discussed in 
Chapter v, where an account of the development of the subject from the 
point of view of Lebesgue’s new definition of the definite Integral is also 
given. In later parts of the book I have introduced extensions of Lebesgue’s 
definition, to the cases of improper integrals, taken over finite or infinite 
domains, regarded as the limits of sequences of Lebesgue integrals. 

Chapter vi is concerned with functions defined as the limits of se¬ 
quences of functions, and contains an account of the principal properties 
of functions represented by series, and a discussion of important matters 
connected with the modes of convergence of series through whole intervals, 
or in the neighbourhood of particular points. Various matters relating to 
the processes of the Integral Calculus, which had not been considered in 
Chapter v, are here dealt with, because their adequate treatment pre¬ 
supposes a knowledge of the theorems relating to the convergence of 
sequences of functions. An account of the very general results recently 
obtained by Baire, relating to the represent ability of functions by means 
of series, will be found in this Chapter. 

Chapter vn is devoted to the theory of Fourier’s series. No apology 
is needed for the selection of this particular mode of representation of 
functions for full discussion in a treatise on the theory of functions of a real 
variable, in view of the historical relation of Fourier’s series to the de¬ 
velopment of the general theory. The history of the theory of Fourier’s 
series is exceedingly instructive, not merely from the point of view of the 
mathematician, but also from that of the epistemologist. I have therefore 
endeavoured, in my treatment of the subject, to preserve as much of the 
historical element as was possible in an account which should contain, in 
a moderate compass, not only indications of the various stages of develop¬ 
ment of the subject, but also the most recent results that have been 
obtained. I have made full use of the greater generality which can be 
introduced into many of the known results by means of the employment 
of the theory of integration developed by Lebesgue. 

In the preparation of the work, the treatises from which I have most 
largely drawn information are the German edition of Dini’s treatise on 
the subject, Stolz’s Grundzilge der Differential - und Integral-Rechnung, 
Schoenflies’ Bericht entitled Die Entwickelung der Lehre von den Punkt - 
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mannigfaltigkeiten, and the various treatises on different parts of the 
subject by Borel and Lebesgue. I have consulted a very large number of 
memoirs, articles, notes, and books, far too numerous to be here parti¬ 
cularized. In respect to the references given throughout the book, I wish 
it to be understood that I have made no attempt to settle questions of 
priority of discovery. The references given are to be regarded solely as 
indicating sources of information from which 1 have drawn, or where 
more detailed information on the various topics is to be found. 

I owe a debt of gratitude to my friend Mr J. W. Sharpe, formerly 
Fellow of Gonville and Caius College, who has read with the greatest care 
the proofs of about two-thirds of the book. Many points of difficulty I 
have fully discussed with him; many obscurities of expression have been 
removed, and many improvements in substance have been made, owing 
to the care he has bestowed in reading the proofs. I felt it as a great loss 
when, owing to a temporary failure of health, he was unable to continue his 
laborious work. To Dr H. F. Baker, F.R.S., Fellow of St John’s College, 
and Cayley Lecturer in Mathematics, who has kindly read some of the 
earlier proofs, I owe several valuable suggestions. On several points con¬ 
nected with the treatment of Number in Chapter i, I have had the ad¬ 
vantage of consulting J)r James Ward, F.B.A., Fellow of Trinity College, 
and Professor of Mental Philosophy and Logic in the University. 

My thanks are due to the officials of the University Press for the 
readiness with which they have met my views, and for the care which 
they have bestowed upon the work connected with the printing. I desire 
especially to express my sense of the value of the excellent work done by 
the readers of the Press; to their care is due the elimination of many 
typographical and other blemishes which would otherwise have remained 
undetected. 


Christ’s College, Cambridge. 
May J5, 1907. 


E. W. HOBSON 
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CHAPTER I 

NUMBER 

1. The operation of counting, in which the integral numbers are em¬ 
ployed, can be carried out by a mind to which discrete objects, which may 
be either physical or ideal*, are presented, and which possesses certain 
fundamental notions which we proceed to specify. 

(1) The notion of unity , a form under which an object is conceived 
when it is regarded as a single one. An object so regarded may be either 
of a material or of a purely abstract or ideal nature, and may be recognized, 
for all other purposes than that of counting, as possessing any degree 
of complexity. It is sufficient, in order that the object may be regarded 
under the form of unity, that it be so far distinct from other objects, as 
to be recognized at the time when it is counted, as discrete and identifiable. 
What external marks are necessary that an object may be so recognized 
as discrete, is a matter for the judgment of the mind at the time when the 
object is counted. The unity under which the object is apprehended is a 
formal or logical, rather than a natural unity; it is more or less arbitrarily 
attributed to the object by the mind. 

(2) The notion of a collection or aggregate of objects, which is conceived 
of as containing more or fewer objects, or as possessing a greater or less 
degree of plurality. A group of objects regarded as an aggregate is con¬ 
ceived of, not merely as a plurality of objects to each of which unity is 
ascribed as in (1), but also as itself an object to which unity is ascribed 
when it is regarded as a single whole. The single objects of which the 
aggregate is composed may be spoken of as the elements of the aggregate; 
such elements need not possess any parity as regards size or any other 
special quality, but may be of the most diverse characters: a certain 
logical parity is however ascribed to them in the process of counting, in 
virtue of the fact that each of them is regarded as a single object. A 
sensibly continuous presentation cannot be regarded as an aggregate 
containing a plurality of elements, until the mind has recognized in it 
sufficiently distinct lines of division to serve the purpose of marking off 
distinct objects within it, the totality of which makes up the whole 
presentation ; for instance, the history of a country could be regarded as 

* It is held by some authors that the operation of counting is primarily applicable to physical 
objects only. Thus, J. S; Mill writes:—“The fact asserted in the definition of a number is a 
physical fact. Each of the numbers, two, three, four, etc., denotes physical phenomena, and con¬ 
notes a physical property of these phenomena.” See Logic , 9th edition, vol. n, p. 150. That 
objects which are not physical, can be counted, was maintained by Leibnitz and by Locke. See 
also Frege’s QrundXagm der Arithmctik , Breslau, 1884, where an account is given of various views 
as to the nature and origin of the idea of Number. 
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an aggregate of distinct periods, only when sufficient salient features had 
been recognized in that history to warrant a judgment that periods were 
to be found in it, each of which had a sufficient degree of discreteness to 
be subsumed under the form of unity. In actual counting, the aggregate 
is not necessarily determinate before the counting is commenced, but 
becomes so when the process is completed; the notion of an aggregate is 
thus still necessary to the process of counting, if the process is ever to 
come to an end, or to be conceived of as having come to an end. 

It has been held* that, when an aggregate is counted, the elements 
must remain distinct from one another, not disappearing or combining 
with each other during the process. That this condition is unnecessary 
may be seen, for example, by considering the case of counting breakers 
on the sea-shore, or that of counting the vibrations of a pendulum; thus 
no physical permanence, but only an ideal one, is necessary. 

A discussion of the characteristics which an aggregate (not necessarily 
finite) must possess, in order that it may be an object of mathematical 
thought, will be given in Chapter tv. 

(3) The notion of order , in virtue of which relative rank is given to 
each object in a collection, so that the collection becomes an ordered 
aggregate. In actual counting, the order is assigned to the objects during 
the process itself, as an order in time, and this may be done in an arbitrary 
manner; the order of the elements in an aggregate may, however, be 
assigned in a manner dependent upon their sizes ; weights, or other qualities, 
or in accordance with their positions in space. Order may, however, be 
regarded as an abstract conception, independent of a particular mode of 
ordering; for an aggregate to be an ordered one, it is necessary that in 
some manner or other, each element be recognized as possessing a certain 
rank, in virtue of which it is known as regards any two elements which 
may be chosen, which of them has the lower, and which the higher rank. 
An element is said to precede any other element of higher rank than itself. 

(4) The notion of correspondence , which underlies the process of tallying. 
The elements of one aggregate may be made to stand in some logical 
relation with those of another one, so that a definite element of one 
aggregate is regarded as correspondent to a definite element of another 
aggregate. 

The correspondence may be complete, in the sense that, to every 
element of either aggregate there corresponds one element, and one only, 
of the other aggregate; or the correspondence may be incomplete, in which 
case one of the aggregates has one or more elements to which no elements 
in the other aggregate correspond. In the latter case we say that the 
aggregate with the superfluous element or elements contains more elements 

* See Helmholtz’s Zdhlen unti Meaaen, Leipzig, 1887; Wusens. Abhandl. vol. lit, p. 372. 
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than the other aggregate, and that the latter contains fewer elements than 
the former. 

A correspondence between two aggregates is defined when specifications 
or rules are laid down which suffice to decide which element of one aggregate 
corresponds to each element of the other; so that, in the case of complete 
correspondence, no element of either aggregate is without a corresponding 
one in the other. 

Whether, or how far, these fundamental notions of unity , aggregate , 
order , and correspondence should be regarded as derived empirically from 
experience, by a process of abstraction, or whether it must be held that 
they are original forms which the mind possesses prior to, and as the 
necessary conditions of the possibility of such experience, are questions 
into which it is beyond our province to enter. It is certain that civilized 
man possesses these fundamental notions, and it is highly probable that 
primitive man possessed them long before the notion of abstract number 
had appeared in an explicit and developed form. The investigation of the 
origin of these notions, and their further analysis, are matters for the 
Psychologist and for the Philosopher. Mathematical Science, as any other 
special science, must take its fundamental notions as data; it is concerned 
with the analysis of them, only so far as suffices to establish that they 
possess the degree of definiteness which such data must have, if they are 
to lie at the base of a logically ordered system. 


ORDINAL NUMBERS 

2. If some of the elements are removed from an ordered aggregate, 
the aggregate which remains is said to be a part of the original aggregate. 
It will be observed that the relative order of any two elements in the part 
is the same as the relative order of those elements in the original aggregate. 

An ordered aggregate is said to be finite when it satisfies the following 
conditions: 

(1) There is one element which has lower rank than any of the others. 

(2) There is one element which has higher rank than any of the others. 

(3) Every part of the aggregate which contains more than one element 
has an element which has higher rank than every other element in the 
part, and also it has an element which has lower rank than any other 
element in the part. 

These conditions are equivalent to the statement that a finite aggregate, 
and also each part of it, has a first and a last element. 

Every part of a finite ordered aggregate is also a finite ordered aggregate. 

If M be the aggregate, and M x a part of it containing more than one 
element, then M x has a highest and a lowest element; also every part of 
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M 1 , being also a part of M , has a lowest and a highest element in case it 
contains more than one element; therefore M x is itself finite. 

3. Two finite ordered aggregates are said to be similar when they can 
be made completely to correspond, so that to each element of either of 
them there corresponds a single element of the other, and so that to any 
two elements P, Q of the one there correspond two elements P\ Q' of the 
other, which have the same relation as regards rank; viz. that, if P is of 
lower rank than Q, then P' is of lower rank than Q', and if P is of higher 
rank than Q , then P’ is of higher rank than Q'. 

Two finite ordered aggregates which are similar are said to have the 
same ordinal number. 

If each of two ordered aggregates is similar to a third, they are similar 
to one another. For if an element P of the first corresponds to an element 
R of the third, and the element Q of the second corresponds to P, it is 
clear that, if we make P correspond to Q , the first two aggregates are made 
to correspond in such a way that the relative order is preserved. 

It thus appears that an ordinal number is characteristic of a class of 
similar ordered aggregates. 

An aggregate which consists of a single element A is said to have the 
ordinal number one, denoted by the symbol 1. The ordinal number 1 is 
characteristic of every aggregate which consists of a single element. 

If, to the aggregate which consists of an element A , we adjoin a new 
element B , and assign to B a higher rank than A , we obtain an aggregate 
(A, B) which has an ordinal number 2, characteristic of all aggregates which 
are formed in this manner; A is said to be the first element, B the second. 
If, to an ordered aggregate ( A , P), of which the ordinal number is 2, we 
adjoin another element C , and regard this as having higher rank than A 
and B, we obtain an ordered aggregate ( A , B , C ), of which the ordinal 
number is called 3, and is characteristic of all ordered aggregates formed 
in this manner. Proceeding in this way, if we have formed an ordered 
aggregate (A, B , C , ... H ), of which the ordinal number is n 9 and adjoin 
to this aggregate a new element K , we obtain a new aggregate 

{A, B, C, ... H, K), 

of which the ordinal number n' is different from n. 

Any ordered aggregate which is formed in the manner described is finite. 

This can be proved by induction. Let us assume that M is a finite 
ordered aggregate: it will then be proved that (M , e), the ordered aggregate 
obtained by adjoining an element e of higher rank than the elements of M , 
is also finite. Since M has a lowest element, (M, e) has the same lowest 
element, also (M, e) has a highest element e. Again, if M x is a part of 
(M , e) which does not contain e, then M x is a part of M, and therefore has 
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a highest and a lowest element. If M 1 is a part of ( M , e) which contains e , 
let it be (Jf 2 , e), where M 2 is a part of M, and therefore contains a lowest 
element which is also the lowest element of (Jf 2 , e)\ also (M 2 , e) contains 
a highest element e. It has thus been shewn that (M, e) satisfies the re¬ 
quisite conditions that it should be finite, provided M does so. The 
aggregates A,(A,B) are clearly finite: hence the method of induction 
proves that every ordered aggregate which can be formed by continually 
adjoining new elements to an aggregate which originally contained one 
element is a finite one. 

Conversely, it can be shewn that every finite ordered aggregate can be 
formed in the manner above described . 

Let M be a finite ordered aggregate, and let e' be its highest element, 
thus M m (M 1 , e f ). Now M l , being a part of M, has a highest element e", 
thus M y = ( M 2 , e"), or M = (M 2 , e", e'). Proceeding in this manner, if we 
do not reach an aggregate M r which contains a single element only, there 

must exist a part (.e'", e", e', e) of M which has no element of lowest 

rank. But this is impossible, since M is by hypothesis finite, and therefore 
contains no part without a lowest element. It has thus been shewn that 
M can be reduced, in the manner indicated, to an aggregate with a single 
element: and conversely, starting with this latter aggregate, M is obtained 
by adjoining to it successively new elements. 

A finite ordered aggregate is not similar to any part of itself. 

This theorem may also be proved by induction. For if we assume that 
the finite ordered aggregate M is not similar to any part of itself, it can 
be shewn that the same holds for (ilf, e). If possible let M 1 be a part of 
(if, e) which is similar to (M, e) ; then if M x contains e, it must be of the 
form (Jf 2 , e ), and if (M 2 , e) is similar to (if, e), M 2 must be similar to if, 
which is contrary to the hypothesis that M contains no part similar to 
itself. If M t does not contain e, it must be of the form (Jf 2 ,/), where 
/ is the element which corresponds to e in (if, e); in this case again Jf 2 is 
similar to if, and is a part of it; thus we have again a contradiction. The 
theorem holds for (^4, B ), and therefore generally. 

It follows from this theorem that the ordinal numbers 1 . 2, 3, ... which 
have been defined as the ordinal numbers of aggregates 
{A),(A,B), (A, B, C y ), ... 

are all different from one another , for each of these aggregates being a part 
of each of those which follow it, cannot be similar to any of the aggregates 
which follow it. 

Each of the ordinal numbers is to be regarded as a unique ideal object 
in that it is a permanent object for thought. The relation of an ordinal 
number to an ordered aggregate of objects which is characterized by that 
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number, may be illustrated by the analogy of the relation between the 
colour red and a particular red object. 

4. A simply infinite ascending aggregate , or simple sequence , is an 
ordered aggregate which has no element of higher rank than all the others, 
and is such that every part which has an element of higher rank than all 
the other elements in that part is a finite ordered aggregate. 

The fundamental assumption must be made that such an aggregate 
may be regarded as a definite object which possesses certain properties 
that can be formulated. The justification for the assumption is to be found 
in the fact that no contradiction arises in the theory based on it. 

It follows from this definition that, in a simple sequence there is one 
element of lower rank than all the others; and further, that every part of 
the simple sequence has an element of lower rank than all the other 
elements in that part. 

A simply infinite ascending aggregate differs from a finite ordered 
aggregate in having no element which is of higher rank than all the other 
elements. 

The totality of ordinal numbers forms a simply infinite ascending 
aggregate; those objects may be represented by a set of signs 

«, P, y, 8, ••• 

or 1, 2, 3, 4, ... 

where it is assumed that some adequate scheme of such signs has been 
devised. 

The order of the elements is assigned by the successive formation, as 
above, of aggregates having the various elements for their ordinal numbers, 
and it has been shewn that, if an aggregate has the ordinal number n , 
another aggregate having a different ordinal number n', taken to be of 
next higher rank than n } can be formed. There exists therefore no highest 
ordinal number. 

Instead of using the expressions “of higher rank” and “of lower rank,” 
it is usual to say that a number m is less than a number n , when m is of 
lower rank than n in the ordered aggregate of ordinal numbers, and that 
n is greater than m. The terms “greater” and “less” are borrowed from 
the language primarily applicable to the description of magnitudes: but 
in pure arithmetic and pure analysis generally, they are used only in the 
sense in which they indicate higher or lower rank, and this rank has no 
necessary reference to relations of magnitude or of measurable quantity. 

The operation of counting a finite aggregate of objects of any kind may 
be conceived of as the process of putting the objects into correspondence 
with the elements of the aggregate of ordinal numbers, in such a way that, 
when any ordinal number has an element of the aggregate which corre- 
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sponds to it, each of the preceding ordinal numbers also has an element 
which corresponds to it. The finite aggregate is usually ordered by the 
process itself, the ranks of the various elements being successively assigned 
to them as the counting proceeds. Those ordinal numbers which are em¬ 
ployed in counting such an aggregate may be regarded as forming an 
aggregate which is similar to the given aggregate, as ordered by the process 
of counting. The last of the ordinal numbers employed in counting a finite 
aggregate is the ordinal number , or simply the number (Anzahl) of the ordered 
aggregate. 

The theorem that an ordered aggregate is not similar to any of its parts 
holds only as regards finite aggregates. It will appear in the course of the 
discussion in Chapter iv that every aggregate which is not finite has parts 
which are similar to the whole; and this property is sometimes taken as the 
basis of the definition of an infinite, or transfinite, aggregate. For example, 
the aggregate of ordinal numbers 1, 2, 3, ... is similar to the part 2, 4, 6, ... 
which contains the even numbers only. 

MATHEMATICAL INDUCTION 

5. The proofs of theorems in § 3 have been referred to as proofs by 
induction. In its general form the principle of Mathematical Induction 
may be stated as follows: 

u, in respect of a given simply infinite ordered aggregate , it be known 
(1), that, in case any element of the aggregate possesses a certain property P , 
the element of next higher rank also possesses the property P, (2), that the 
element of lowest rank possesses the property P; it then follows that every 
element of the aggregate possesses the property P. 

The truth of the principle follows as a consequence of the properties 
assigned to the aggregate by means of the definition in § 4. 

For, let it be assumed, if possible, that there exists a part of the given 
aggregate A, such that the elements belonging to that part do not possess 
the property P. This part contains one or more elements; let M be such an 
element. Let us consider the finite aggregate which consists of all those 
elements of A which are of rank not higher than that of M. Of this finite 
aggregate there exists a part B , such that no element of B possesses the 
property P. The aggregate B contains the element M , and possibly other 
elements. Since B , being part of a finite aggregate, is itself a finite aggre¬ 
gate, it contains an element m of lower rank than all the others. By 
hypothesis, m cannot be the element of lowest rank in A , since the latter 
possesses the property P. Therefore there is in A an element m' of next 
lower rank than m; and m ' possesses the property P. But by hypothesis, 
if m ' possesses that property, so also does m. The assumption that A con¬ 
tains a part such that the elements of the part do not possess the property 
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P has thus been shewn to lead to a contradiction. The truth of the principle 
has therefore been established. 

It has been maintained by Poincare* that the principle of Mathematical 
Induction is a special characteristic of mathematical reasoning as distinct 
from that of general Logic, not being reducible to the principle of con¬ 
tradiction, as it essentially involves the employment of an unending chain 
of syllogisms. That this is the case is suggested by the form in which the 
principle is frequently applied in elementary mathematical treatises, viz. 
that since the property P holds good for the first element, it therefore 
holds good for the second, and therefore for the third, etc.; and conse¬ 
quently, by the employment of an unending set of such syllogisms, it is 
inferred that the property P holds good for all the elements of the infinite 
aggregate. It has however been shewn above that it is not necessary to 
state the principle in this form, but that the truth of the principle follows 
by applying the ordinary Logic to deduce the consequences of the posses¬ 
sion by the infinite ordered aggregate of certain properties, in accordance 
with the principle of contradiction. 

CARDINAL NUMBERS 

6. If any finite ordered aggregate be re-ordered in any manner , the new 
ordered aggregate is finite , and has the same ordinal number as the original 
one . 

In order to prove this theorem, the following particular case will be 
first established:—If Q is a finite ordered aggregate, the aggregate ( Q , e), 
obtained by adjoining to Q a new element e of higher rank than all the 
elements of Q, is similar to (e, Q ), in which e has a lower rank than all 
the elements of Q . For let Q = (Q 1 , / ), and let us assume that the theorem 
holds for Q y , i.e. that (Q x , e) is similar to (e, Q x ); it follows, since a complete 
correspondence can be established between the elements of (Q lf e) and 
(e, Q x ), that the same is true of the two aggregates (Qi, e,f ) and (e, Q x ,f). 
Now ($ 1 , e,f) is similar to (Q x ,f, e), since Q A can be made to correspond 
to itself; e to /, and f to e, therefore (# 2 , /, e) is similar to (e, Qi,f) or, 

( Q , e) to (e, Q), and thus the theorem holds for Q = (Quf), provided it 
holds for Q x . Now it clearly holds if Q x consists of a single element; hence 
by induction it holds for any finite ordered aggregate Q. To prove the 
theorem in the general case, let us assume that it is true for an aggregate 
M ; it will then be shewn to be true for ( M , e). For let an aggregate obtained 
by re-ordering (M , e) be (R, e, $), where either R or S may be absent; 
(R y e , S) is similar to (R , *S\ e), for R corresponds with itself, and it has 

* See his work La science et Vhypothese , Paris (1902), p. 19; also an article in the Revue de 
Meta physique, vol. n (1894), p. 371. For a criticism of Poincare’s view, see an article by A. Padoa, 
Proc. of the fifth annual Congress of Mathematicians, vol. n, p. 471. 
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been shewn above that (e, 8) is similar to ( S , e). Since (JR, $) is by hypothesis 
similar to M , it follows that ( R , $, e) is similar to (if, e), and therefore 
(jR, e, $) is similar to (if, e). The theorem clearly holds for an aggregate 
(A, B) which contains two elements, hence by induction it holds for every 
finite ordered aggregate. 

It follows from the theorem which has been established above, that. 
for any aggregate which can be ordered as a finite ordered aggregate , the 
ordinal number is independent of the mode in which the aggregate is ordered. 

It will be found, when the generalization of ordinal numbers for non- 
finite aggregates is considered in Chapter iv, that this property, that the 
ordinal number of an aggregate is independent of the mode of ordering, 
is peculiar to finite aggregates. 

7. Two aggregates are said to be equivalent , when their elements can 
be placed into correspondence so that to each element of either aggregate 
there corresponds one and only one element of the other aggregate. 

It will be observed that the relation of equivalence differs from that of 
similarity, in that it contains no reference to order. It is clear that two 
aggregates which are each equivalent to a third are equivalent to one 
another. 

An unordered aggregate is said to be finite when it can be so ordered 
that the ordered aggregate is finite in accordance with the definition given 
in § 2. 

Two (finite) aggregates which are equivalent are said to have the same 
cardinal number. 

It thus appears that a cardinal number is characteristic of a class of 
equivalent aggregates. 

Each of the cardinal numbers is to be regarded as a unique ideal 
object; the relation of a cardinal number to a member of the class of 
equivalent aggregates of objects, of which it is characteristic, may be 
illustrated in the same manner as in § 3, in the case of the ordinal numbers. 

Since all similar aggregates are also equivalent, and since, in the case 
of a finite aggregate, the ordinal number is independent of the mode m 
which the aggregate is ordered, it follows that for every finite ordinal 
number there is a corresponding cardinal number. 

The cardinal numbers of finite aggregates are denoted by the same 
symbols 1, 2, 3, ... as the corresponding ordinal numbers. The two kinds 
of numbers are not symbolically distinguished from each other, although 
logically they are not identical. 

It will be seen in Chapter iv that this practical identity of ordinal and 
cardinal numbers is confined to the case of the numbers corresponding to 
finite aggregates, and therefore called finite numbers. The finite cardinal 
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numbers form a simple sequence 1, 2, 3, ... similar to the sequence of finite 
ordinal numbers; the expressions “ greater ” and “ less ” are used in relation 
to two cardinal numbers in the same purely ordinal sense, denoting higher 
and lower rank, as in the case of ordinal numbers. 

It is impossible, in a purely mathematical work, to enter into a full 
discussion of the nature and proper definition of number from a philoso¬ 
phical point of view. One view of number, which is widely held, is embodied 
in the definition by abstraction, in which the cardinal number* is regarded 
as the concept of an aggregate which remains when we make abstraction 
of the nature of the objects forming the aggregate, and of the order in 
which they are given ; the ordinal number is then regarded as the concept 
obtained by making abstraction of the nature of the objects only, retaining 
the orderf in which they are given in the aggregate. The view has also 
been maintained J that a cardinal number is simply the class of all equiva¬ 
lent aggregates. A tendency has been exhibited amongst mathematicians§ 
to regard numbers, at least for the purposes of analysis, as identical with 
the symbols which represent them. In accordance with this view, abstract 
arithmetic is cut entirely adrift from the fundamental notions related to 
experience in which it had its origin, and it is thus reduced to a species 
of mechanical game played in accordance with a set of rules which, when 

* This view is that of G. Cantor; see Math. Annalen , vol. xlvi (1895), p. 481, where the 
following definition is given:—“‘Maehtigkeit,’ oder ‘Cardinalzahl’ von M nennen wir den 
Allgerneinbegriff welcher mit Hiilfe unseres activen Denkvermogens aus der Menge M hervorgeht, 
dass von der Beschaffenheit ihrer verschiedenen Element© m, und von der Ordnung ihres Gege- 
benseins abstrahirt wird.” See also Peano, FormuUiire.8 de Mathematiques, 1901, § 32, -0 Note. 

f Ordinal numbers are frequently regarded as logically prior to cardinal numbers, but this 
order of procedure is not a necessary one. In Dedekind’s tract 14 Was sind und was sollen die 
Zahlen,” Brunswick, 1887 and 1893, which has been translated into English by Prof. W. W. 
Bernan, under the title “Essays on the Theory of Numbers,” 1901, a detailed treatment of the 
subject is given) in which the notion of order is regarded as fundamental. 

X See B. Russell, The Principles of Mathematics , vol. i (1903), chap. xi. 

§ For example see Heine, Crelle's Journal , vol. lxxiv (1872), p. 173, where the matter is stated 
in the following plain form: “Ich nenne gewisse greifbare Zeichen Zahlen, sodass die Existenz 
dieser Zahlen also nicht in Frage steht.” Again, Helmholtz appears to hold a view closely 
approaching the notion that Arithmetic is the art of manipulating certain signs according to 
certain rules of operation; he writes in Oes. Ahh. vol. in, p. 359, “Ich betrachte die Arithmetik 
oder die Lehre von den reinen Zahlen als eine auf rein psychologische Thatsachen aufgebaute 
Methode, durch die die folgerichtige Anwendung eines Zeichensystems (namlich der Zahlen) von 
unbegrenzter Ausdehnung und unbegrenzter Mbglichkeit der Verfeinerung gelehrt wird.” Refer¬ 
ence may be made to an essay by A. Pringsheim in the Jahresberichte der d. math. Vereinigung t 
vol. vi (1899), p. 73, “Ueber den Zahl- und GrenzbegrifI im Unterricht.” In an article entitled 
“Die Du Bois-Reymond’sche Convorgenz-Grenze,” Sitzungsberichte d. bayer. Akad. vol. xxvii 
(1897), Pringsheim speaks of numbers as “Zeichen, denen lediglich eine bestimrate Succession 
zukommt.” See p. 326. This article contains various remarks on arithmetization, and especially 
a criticism of the views of P. Du Bois-Reymond. A searching criticism of the tendency to reduce 
Arithmetic to the formal manipulation of symbols is given in L. Couturat’s work De Vinfini 
mathdmatique , Paris (1896), which contains a valuable account and discussion of theories of the 
philosophy of arithmetic. 
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divorced from their origin, have the appearance of being perfectly arbitrary; 
though it may, of course, be said that it is possible at the end of any 
arithmetical process to reconnect the symbols employed with the ideas 
which originally suggested them, and thus to interpret the results of the 
purely symbolical processes. Whatever view* be adopted as to the real 
nature of number and its place in a general scheme of thought, the as¬ 
sumption of the right to hypostatize numbers would appear to be an 
essential condition of the possibility of developing an abstract arithmetic, 
and consequently of the establishment of mathematical analysis in general. 


THE OPERATIONS ON INTEGRAL NUMBERS 

8. If two finite ordered aggregates A and B , of which the ordinal 
numbers are a and b respectively, are combined into a single ordered 
aggregate in which the elements of A have all lower rank than those of 
B , and in which any two elements of A , and any two elements of 1B, have 
the same relative orders as in the original aggregates, then the ordinal 
number of the combined, aggregate is said to be the sum of the ordinal numbers 
a and b , and is denoted by a -j- b. 

It can be shewn that the new aggregate is a finite one, and that its 
ordinal number is unaltered if for A and B there be substituted aggregates 
which are similar to them; it thus appears that the sum a -f b is a finite 
number which depends only upon a and b. 

The aggregate (A, B) has as lowest element the lowest element of A , 
and as highest element the highest element of B; moreover any part of 
(A, B) is of the form ( A ', B'), where A’ is a part of A, and B' is a part of 
B\ or else it has one of the forms A', B' . Since A', B' have each a lowest 
and a highest element, any such part of (A, B) has a lowest and a highest 
element. Thus {A, B) is finite. 

Again, if Aj, B x are aggregates which are similar to A and B respectively, 
the elements of A may be placed in correspondence with those of A t , and 
the elements of B with those of B x ; we have then a (1, 1) correspondence 

* References to the literature relating to the Philosophy of Number will be found in the 
article 1. A. 1, “Grundlagen der Arithmetik,” by H. Schubert, in the Encyclopddie der mathe- 
matischen Wissenschaften, vol. I; also in E. G. Husserl’s Philosophic der Arithmetik , vol. i, chaps. 
5 and 6 , Halle (1891). Erege’s treatises, the Grundlagen der Arithmetik , Breslau (1884), and the 
Grundgesetze der Arithmetik, Jena (1893 and 1903), also Whitehead and Russell’s work, Principia 
Mathematica, Cambridge (1910 and later), may be here specially referred to, in connection with 
the relation of the foundations of Arithmetic to general Logic. The view that Number is 
fundamentally dependent on the notion of Time was developed by Sir W. R. Hamilton; see the 
Dublin Transactions, vol. xvri (II) (1835), “Theory of Conjugate Functions or Algebraic Couples 
with a Preliminary and Elementary Essay on Algebra as the Science of Pure Time”; see also 
Helmholtz’s essay Zahlen und Messen (1887), where the view is adopted that the axioms of 
Arithmetic have a relation to the intuitional form of time, similar to that which the axioms of 
Geometry have to the intuitional form of space. 
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between the elements of (A, B) and those of (A t , BJ; thus the ordinal 
number of ( A , B) is the same as that of (A l9 B 2 ). 

Since (A, B) has the same ordinal number as (B, A) it follows that 
a 4- b — b + a, which is known as the commutative law of addition. 

If a, b are the cardinal numbers of two finite aggregates A , B , then the 
cardinal number of the aggregate formed by combining the two aggregates 
into one is said to be the sum of a and b, and is denoted by a -f b. That 
a + b is a definite finite number, dependent only on a and 6, follows at 
once from the corresponding theorem which has been proved for ordinal 
numbers. 

The operation of finding the sum of two numbers a and b is known as 
the operation of addition, and it has been shewn that this operation is 
commutative. It should be observed that the sum of two numbers a and 
b cannot be determined merely by contemplating those numbers them¬ 
selves as abstract concepts, but can only be defined as above, by referring 
to aggregates of which a and b are the numbers, and then combining those 
aggregates. The number of the combined aggregate is then conceived of 
as the result of a symbolical operation upon the numbers a and b. For 
example, the equation 5 4 3 = 8 docs not imply that the concept 8 is 
obtainable by placing the concepts 5 , 3 as it were in juxtaposition, but 
can only be regarded as a symbolical expression of the fact that an 
aggregate of 5 objects together with one of 3 objects makes up an aggregate 
of 8 objects. Bearing this observation in mind, the numbers 1, 2, 3, ... 
are represented symbolically as the results of successive operations of 
addition, 1 + 1 = 2, 2+1—3, 3+1 — 4, etc.; but these equations do 
not express definitions of the numbers 2, 3, 4, ..., since, from the concept 
unity taken by itself, no other concept is directly derivable. 

The operation of addition can be extended by continued repetition. 
Thus the sum of a, b, c, ... k is a finite number represented by 

a + b + c + ... + k, 

and, in particular, any number w is represented by n = 1 + 1 + 1 + ... + 1. 
An immediate induction shews that the result of the operation of addition 
repeated any definite number of times is a finite number dependent only 
on the constituents of the summation. 

The associative law of addition, a + (b + c) -= (a + b) + c, follows from 
the irrelevancy of the order in which the operations are performed. This 
is seen from the contemplation of aggregates of which a, 6, c are either the 
ordinal or the cardinal numbers. 


9 . If in a finite aggregate of which the number is 6, each element be 
replaced by a finite aggregate of which the number is a, the number of the 
new aggregate so formed is said to be the product of b by a, and is denoted 
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by ab. This operation is said to be that of multiplying b by a. By taking 
the aggregates to be ordered, it is seen at once that the new aggregate 
satisfies the conditions that it is finite, and that its number is unaltered 
by the substitution of similar aggregates of other objects for those originally 
employed. Thus ab is a definite number dependent only on a and b. 

It is clear that ab may be regarded as the sum a -f a -f a 4 ..., where 
a occurs b times in the operation. 

If the ordered aggregate of which the number is ab be re-ordered in 
the following manner:—take the first element of each of the aggregates 
of which a is the number, then the second elements of these aggregates, 
and so on, with lastly the ath elements of these aggregates, then we have 
as the result of the process an aggregate of which the number is a, and 
each element of which consists of an aggregate of which the number is b ; 
the re-ordered aggregate has the number ha. It has thus been shewn that 
ah — ba, which is expressed by saying that the operation of multiplication 
of finite integers is commutative. 

The distributive law for multiplication, a (b c) ~ ab -j- ac, follows 
from the definition of the operation, by considering the aggregates of 
which a, 6, c are the numbers. 

An immediate induction shews that the repetition of the operation of 
multiplication any definite number of times gives a finite number dependent 
only on the numbers multiplied, and independent of the order in which the 
operations are perform ed. 

The result of the operation of multiplying the number a by itself is 
denoted by a n , where n is the number of times a occurs in the product 
a,a.a .. .a. From this definition the law a™ .a n — a m{n is directly deducible. 

10. If the sum of two numbers a, b be denoted by c, the number a is 
uniquely determined when b , c are fixed; and it is then regarded as the 
result of the operation of subtracting 6 from c. The operation of sub¬ 
traction is thus defined as inverse to that of addition. If c — a + 6, a is 
obtained as the result of the operation denoted by c — 6, which is such 
that (c — b) -f- b = c. It is obvious that the operation of subtraction of 
b from c is only possible in case c > b. 

If the product of two numbers a, 6 be the number c, then the number 
a is uniquely determined when b and c are given; and a is regarded as the 
result of the operation of division of c by b. The operation of division so 
defined is inverse to that of multiplication; it is clear that the operation is 
only possible in case c is one of the class of numbers 6, 2b , 36,. 
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FRACTIONAL NUMBERS 

11 . The operation of multiplying two integers a, b together is one 
which is always a possible operation, in accordance with the definition of 
the operation of multiplication which has been given above; the inverse 
operation of division is however, as we have seen, not always a possible 
one. This restriction upon the possibility of the operation of division 
suggests the introduction into Arithmetic of a new class of numbers, the 
rational fractions, which, when defined, shall be such that the operation 
of division, within the whole aggregate of integers and fractions, may be 
a possible one without restriction. Stated in algebraical form, the demand 
arises for a scheme of numbers such that the equation ax =- b shall always 
have a solution in x, where a, b are any two numbers which belong to the 
contemplated aggregate of numbers. 

The actual use of fractional numbers arose historically from the 
necessities of the process of measurement of extensive magnitude, and the 
conception of a fraction which arises in this connection is the one which 
is used in ordinary life, and is made the basis of the treatment of the theory 
of fractions, even in recent scientific text-books. In accordance with this 
view, a unit of magnitude of some kind is divided into b equal parts, and 
a of these parts are taken ; the resulting magnitude is then denoted by 
the fraction ajb. 

This notion of the essential nature of a fraction, dependent as it is upon 
the notions of a unit , and of the divisibility of such unit into equal parts, is 
incompatible with the modern view that Mathematical Analysis should be 
developed upon the basis of a Pure Arithmetic, quite independently of all 
notions connected with the measurement of extensive magnitude. The 
modern tendency known as Arithmetization manifests itself in the con¬ 
struction of theories of Number and of the operations involving numbers, 
which depend entirely upon the conceptions connected with the process 
of counting; measurement being regarded as a process foreign to Pure 
Arithmetic. The process of counting is an exact one: whereas measurement 
can in practice only be carried out with a greater or less degree of approxi¬ 
mation, and can only ideally be made an exact process. Pure Arithmetic* 
is made the basis of Analysis, not only in accordance with the general 
principle that the fundamental conceptions of a branch of science should 
be irreducible to simpler conceptions, but also because the theory of ideally 
exact measurement has peculiar difficulties of its own. Our essentially 
inexact intuitions of spatial, temporal, or other magnitudes, necessitate 
a process of idealization in which the objects of perception are replaced 
by ideal objects subject to an exact scheme of definitions and postulates, 
in order that an exact science of measurement may be possible. The view 
is at present held by the majority of mathematicians that the nature of 
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the abstract continuum, and that of a limit, are capable of exact formula¬ 
tion only in the language of a Pure Arithmetic; and that this science must 
therefore be developed upon an independent basis before it can be applied 
to the elucidation of the conceptions requisite for an abstract theory of 
continuous magnitude. The theory of measurement is, in accordance with 
this view, regarded as an application, and not as part of the basis, of 
Mathematical Analysis. 

12 . By those writers who are under the influence of the modern 
arithmetizing tendency, the traditional non-arithmetical definition of a 
fraction has been abandoned, and in its place a formal definition has been 
substituted, in which the fraction is regarded as an association of a pair 
of integers. The associated integers are regarded as making a single object, 
and laws of combination of these objects are then postulated. 

If a, b are two integers, a new number (a, 6), or in ordinary notation 
is formed by the association of a and 6, the new number being defined 
to be such as to satisfy the following conditions: 

(1) (a, b) is regarded as ordinally greater, equal to, or less than (c, d ), 
according as ad is greater, equal to, or less than be. The expressions greater, 
equal to, or less than, are here used, not in their primitive sense as referring 
to magnitude, but in the sense in which we have used them in the case of 
integers, as assigning relative order to the numbers. 

(2) (a, 1) is defined as equal to a; thus if b --- 1, the association is 
regarded as equivalent to the integer a. Taking (1) in conjunction with 
this postulate, the new numbers have their orders assigned, not only 
relatively to one another, but relatively also to the integral numbers; so 
that the whole aggregate of integers and fractions is ordered, in the sense 
that, of two given numbers, it can always be said which has the higher rank. 

(3) The addition of two fractional numbers is defined by 

(a, 6) -f (c, d) = {ad -f be , bd). 

(4) The multiplication of fractional numbers is defined by 

(a, 6) x (c, d) — (ae, bd). 

(5) The use of a fraction as an index is defined by the postulate 

#(<*»&) x x ( c,d ^ — #(«»&) + ( c , d ), 

where x is any number, either integral or fractional. The symbol a^ a » b) is 
to be interpreted subject to this postulate, in case such interpretation is 
possible. 

It will be observed that, in the case b = 1, d = 1, the above definitions 
are consistent with those which have been adopted in the case of integral 
numbers; and thus the new numbers, together with the integers, form an 
aggregate with uniform laws of operations. It is easily seen that the opera- 
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tions with the new numbers satisfy the commutative, associative, and 
distributive laws. The inverse operation of division is now one which is 
always possible within the domain of the numbers; thus 

(a, b) (c, d) — ( ad , be). 

The inverse operation of subtraction, (a, b) — (c, d) = (ad — be , 6d), is only 
possible if (a, 6) > (r, rf). 

The association of a pair of integers is a “number” in quite a different 
sense from that in which the cardinal and ordinal numbers, hitherto dis¬ 
cussed, are numbers. The justification of the extension of the term 
“number” to the fractions lies in the fact that a consistent scheme of 
operations can be imposed upon them, of which the laws are in agreement 
with those which hold for operations which involve integers only. 

13 . The scheme which has been above indicated suffices for a formal 
definition and logical development of the properties of fractions, but it is 
subject to the objection that it is of an arbitrary character; indeed it is 
not easy to see why the particular laws of operations have been postulated, 
except as suggested by the traditional non-arithmetical conception of a 
fraction. 

To remedy this defect, a view of the nature of a fraction will be here 
given which relates the fraction with the process of counting, in such a 
manner that fractional and integral numbers have similar relations to that 
process. It will appear that the laws of combination given above naturally 
follow from this mode of regarding the fraction, with the exception of (5), 
which is however immediately suggested by the rule for integral indices. 

Consider an aggregate of b objects, and out of these b objects pick out 
any a b) of them. If we regard these a objects not only as single objects 
of number a, but also as belonging to an aggregate whose number is b, we 
may denote the a objects by (a, b), where their number a is associated with 
the cardinal number b of the aggregate to which they belong. This process 
being independent of the particular aggregate used, the abstract fraction 
(a, b) is related to this process in an analogous manner to that in which 
the number b is related to the process of counting an aggregate whose 
cardinal number is b. Thus the fraction (a, b), or a/6, is characteristic of 
an aggregate of a objects each of which belongs to an aggregate of b objects. 
The extension of the definition to the case a > b is clear when we observe 
that it is unessential that the a objects taken should all belong to one and 
the same aggregate of b objects ; it is sufficient that each of them be re¬ 
garded as essentially belonging to some aggregate of cardinal number b. 
In accordance with this view, a fraction, say 3/5, is characteristic of any 
three things each of which belongs to an aggregate of five things, i.e. 3/5 
means 3 out of 5. That the three things taken out of five should necessarily 
all be equal in respect of size, or some other kind of magnitude, is as 
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irrelevant to the true nature of a fraction as the assumption of five things 
necessarily meaning five equal things is to the true nature of the number 
five. 

Since (a, 1) is characteristic of an aggregate of a things each of which 
is also regarded as a single object, it is clear that (a, 1) is identical with a. 

If we suppose each of the b (dements in an aggregate, of which the 
cardinal number is b, to be replaced by an aggregate of n elements, we 
have now an aggregate with nb for its cardinal number; and instead of 
a elements chosen out of this aggregate we now have na of the new ele¬ 
ments, each of which is to be regarded as associated with the cardinal 
number nb. We represent these na elements by (na, nb), which is equivalent 
to (a, b), since the two forms represent two different aspects of the same 
process. Therefore we have (a, b) — (na, nb), or in the ordinary notation 
ajb = na/nb. This relation is in complete accordance with the law of logical 
(not arithmetical) addition, that a mere repetition of a term yields only 
the term itself. 

Since (a, b) (ad, bd), and (c, d) — (be, bd), we regard (a, b) as greater, 
equal to, or less than (c, d), in the purely ordinal sense of the terms, ac¬ 
cording as ad is g be. For the two numbers (ad, bd), (be, bd) are characteris¬ 
tic of the process of taking ad, be elements respectively from an aggregate 
of the same cardinal number bd ; and thus the relative order of the two 
numbers (a, b), (c, d) will naturally be fixed in accordance with the relative 
order of the two numbers ad, be. 

The addition of the two numbers (a, b) and (c, d) is equivalent to that 
of (ad, bd) and (be, bd), and is consequently naturally defined as given by 
(ad -f be, bd), which characterizes the amalgamation of two aggregates of 
which the numbers are ad, be, the elements of each of which all belong to 
an aggregate of number bd, or to one of several such aggregates. 

To interpret the operation of multiplication, let us consider an object 
represented by (c, d); this consists of c things each belonging to an aggre¬ 
gate of d things. To multiply it by (a, b), is to take a such objects each of 
which belongs to an aggregate of b such objects; we have on the whole 
one or more aggregates of bd elements, and out of these, ac elements are 
to be taken. Thus the multiplication of the number (c, d) by the number 
(a, b) may be understood to characterize the result of taking a objects, 
each of which is characterized by (c, d), out of one or more collections of 
b objects, each of which objects is characterized by (c, d). This is the same 
thing as the process of taking ac objects out of one or more aggregates of 
bd objects, and is characterized by the number (ac, bd ); we are thus led 
to the law of multiplication 

(a, b) x (c, d) = (ac, bd), or ~ x | 
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NEGATIVE NUMBERS, AND THE NUMBER ZERO 

14. Although the operation of addition is always possible within the 
aggregate of integral and fractional numbers, yet the inverse operation of 
subtraction is not always possible; thus a number x cannot be found such 
that x -|- (c, d) = (a, b), unless (a, b) > (c, d). As the limitation of the 
possibility of division suggested the introduction of fractional numbers, 
so this limitation of the possibility of subtraction suggests the introduction 
of a further set of new numbers, which shall be such that, within the so 
completed aggregate, subtraction may always be a possible operation. 

If a , P, y, 8 denote integral or fractional numbers such that a > p, y > 8; 
we may put a = P + x, y = 8 1- y ; then x — a — p, y =-•= y — 8. We have 

« + y- j3 + 84 x + y, 

hence x + y ------ (a 4- y) — (P 4- 8), 

or (a -j8) + (y ~ 8) =■-(a 1 y) - (p I- 8) .(1). 

Again, if a — p = y — 8, Le. a: = y, we have a 4- 8 = p 4 8 4- a; - - /3 I y; 

or a + 8 = p 4- y, if a — P = y — 8 .(2). 

Lastly, we have 

«y = (0 + x) (8 -f- y) = j8 (8 4- y) 4- z (8 4- y) 

= j38 4- £y 4- ^8 4- xy; 
hence ay 4- /?8 = /? (y 4- 8) 4- 8 (# 4- j3) 4- xy 

= fly 4- a8 4- a;y; 

hence (a — /3) (y — 8) = (ay 4- ^8) — (a8 4 j3y) .(3). 

The rules (1), (2), (3), with regard to the numbers a — p, y — 8, which 
so far exist only when a > P, y > 8, suggest the mode of the extension 
referred to above. 

15. Let a, ft be any two numbers integral or fractional, and conceive a 
new number 1) (a, ft), formed by the association of a and ft, to be defined 
as subject to the laws 

(4) , D (a, p) = I) (y, 8), if a 4- 8 - p 4- y, 

(5) , 2) (a, /?) 4- D (y, 8) = D (a 4- y, p 4- 8), 

(6) , D (a, p) x D (y, 8) = D (ay 4- j38, a8 -f- py ): 

it will be observed that when a > p, and y > 8, Z> (a, /?) may denote a ~ 
the three laws becoming (2), (1), (3). It will now be shewn that the symbol 
D (a, P) defines a number of an aggregate within which the operation of* 
subtraction is always possible. For, to find a number x , such that 

x + D (a, P) =* D (y, 8), we see that x ^ D (P 4- y, a + 8), 

D (a 4- P 4- y, a 4- P 4- 8) = D (y, 8), in virtue of (4). 


since 
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Since D (a, a) ~ D (y, y), we see that I) (a, a) is independent of a ; and 
thus D (a, a) defines a new number which is called the number zero , and is 
denoted by the symbol 0. 

The number zero is regarded as characteristic of the absence of all 
elements from an aggregate of which the existence has been contemplated; 
it is the number of such a hypothetical aggregate, in a sense similar to 
that in which an integer is the number of an actual aggregate. 

The number 1) (a + k , k) depends only on «, and we shall postulate that 
it is identical in meaning with a itself. 

The numbers D (a, ft), or a — ft, for which a > ft, are called positive 
numbers, and form the aggregate of integral and fractional numbers we 
have previously considered. 

Those numbers, for which a < /3, are called negative numbers. 

Since by (5), 1) (a, ft) 1 D (/?, a) I) (a -f ft, « + ft) — 0, the number 
1) (ft, a) may be denoted by — I) (a, ft), or in ordinary notation — (a — ft). 
Thus to every positive number x there corresponds a single negative number 
— x, which is such that x -f ( — x) -^ 0. 

We may now use the notation a - ft in every case for D (a, /?), and thus 
a -(P a). 

From (6), it is seen that the operation of division is always possible for 
two members of the complete aggregate of positive and negative numbers 
and zero, except when the divisor is the number zero, in which case the 
operation is meaningless. 

From (6), we see by putting y = 8, N =& I) (a, ft), that N .0 = 0. From 
(5), we have N -f 0 = N. 

Any number D (a , ft) is said to be greater in the ordinal sense than 
D (y, 8), when D (a, ft) — D (y, 8) is positive; thus the complete aggregate 
of positive and negative integral and fractional numbers together with the 
number zero is one in which all the numbers are arranged in a definite 
order. This aggregate is known as the aggregate of rational numbers. 

In the aggregate of rational numbers so ordered, the number zero has 
lower rank than any of the positive numbers, and higher rank than any 
of the negative numbers. Further, if x, y are two positive numbers of 
which x has higher rank than y , the negative number — x has lower rank 
than — y. 

If x, y are any two rational numbers , such that x < y, there exist an 
unlimited number of rational numbers each of which is > x, and < y. 

Such numbers are said to be between x and y. For it can be seen at 
once, from the definition of order given above, that £ (x -f y) is one such 
number; between £ (x + y) and either x or y another rational number can. 
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in a similar manner, be found. This process can be carried on without end; 
and it is clear that, in accordance with the mode of ordering of the aggre¬ 
gate, defined above, all the numbers thus determined are between x and y . 

If x , y are any two positive rational numbers such that x < y, an integer 
n can be found which is such that nx > y. 

For is a rational number such that ~^.x = y \ again if ^ be any positive 

rational number, there exist integers which are > ^; for p -f 1 is itself such 

an integer. If n is an integer which is > ^, we have nx > ~^.x > y ; thus 
the theorem is established. 

IRRATIONAL NUMBERS 

16. The only numbers of which the existence was recognized by the 
Greek geometers were the rational numbers, although the fact that the 
ratio of two geometrical magnitudes is not necessarily exactly represent¬ 
able by such a number appears to have been discovered at a very early 
period. Euclid gave, in the fifth book of his treatise, a discussion of the 
theory of ratios, and in the tenth book, a theory of those incommensurable 
magnitudes which are ideally constructible by means of straight lines and 
circles. In later times*, the idea was current that, to the ratio of any 
two magnitudes of the same kind, there corresponds a definite number; 
and in fact Newton in his Arithmetica Universalis expressly defines a 
number as the ratio of any two quantities. Before the recent development 
of the arithmetical theories of irrational number, and to a considerable 
extent even later, a number has been regarded as the ratio of a segment 
of a straight line to a unit segment, and the conception of irrational 
number as the ratio of incommensurable segments has been accepted as 
a sufficient basis for the use of such numbers in Analysis. 

In accordance with the doctrine that Mathematical Analysis must rest 
upon a purely arithmetical basis, the introduction of irrational numbers 
into Analysis must be made without an appeal to our intuition of extensive 
magnitude, but rather by an extension of the conception of Number, 
resting on a further development of the ideas which have been here dis¬ 
cussed in connection with the theory of rational numbers. The necessity 
for this extension of the domain of Number arises not only on account of 
the inadequacy of rational numbers for application to ideally exact 
measurement, but also, as will be explained later in detail, because the 
theory of limits, which is an essential element in Analysis, is incapable of 

* A good short account of the history of this subject will be found in the article I. A. 3, “Irra- 
tionalzahlen und Konvergenz unendlicher Prozesse,” by A. Pringsheim, in the Kncyclopadie der 
math. Wiseertschaften, vol. i. See also M. Cantor, Oeschichte der Math., vol. I. 
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any rigorous formulation apart from a complete arithmetical theory of 
irrational numbers. 

Before the recent establishment of the theory of irrational numbers, 
no completely adequate theory of Magnitude was in existence. This is not 
surprising, if we recognize the fact that the language requisite for a com¬ 
plete description of relations of magnitudes must be provided by a developed 
Arithmetic. 


17 . The successive extensions of the domain of Number, by the intro¬ 
duction of fractional and of negative numbers, were suggested by the 
desirability of so completing the domain that the operations of division 
and subtraction, which are not always possible in the more limited domain, 
might always be so in the more extended one. In the aggregate of rational 
numbers, the operations of addition, subtraction, multiplication, and 
division (except when the divisor is zero) are always possible operations; 
but it can be readily shewn that the inverse operation involved in deter¬ 
mining a fractional power of a rational number is not, in general, a possible 
one. 

As the simplest case of this impossibility of such operation, we may 
take the problem of finding the square root of a positive integer m which is 
not a square number. It can be shewn that such a number has no square 
root within the aggregate of rational numbers. 

If possible* let m be the square of a rational fraction pjq in its lowest 
terms; thus p 2 — mq 2 = 0. There always exists a positive integer A such 
that A 2 < m < (A -f l) 2 ; we then have Xq < p < (A -f 1 )q. 

Now let us consider the identity 

(mq — Xp) 2 — m (p — Xq) 2 — (A 2 — m) (p 2 — mq 2 ) = 0. 

From this identity it follows that m is the square of the rational number 
{mq — Xp)/(p — Xq), of which the denominator is less than q , and this is 
contrary to the hypothesis that m is the square of the fraction pjq which 
is in its lowest terms. It thus appears that there exists no rational number 
of which the square is m. 

On the formal side of Arithmetic, a demand for the extension of the 
domain of number arises from the impossibility of carrying out, with the 
requisite generality, certain operations, as in the example given above. 
Such extensions of the domain of number as are made when fractional, 
negative, irrational, and complex numbers are successively adjoined to the 
original integral numbers, are made in accordance with a principle known 
as that of the permanence of forms, which was first indicated by Peacockf, 

* This proof is given by Dedekind in his tract Stetiglceit und irrationale Zahlen. An extension 
of Dedekind's method to the case of nth roots has been given by S. M. Jacob; see Proc. Lond. 
Math. Soc. ( 2 ), vol. I (1903), p. 166. 

t British Association Report for 1834; also Symbolical Algebra , Cambridge (1845). 
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and further developed by Hankel*. This principle may be stated in the 
form that, in order to generalize the conception of number, the following 
four requisites must be satisfied: 

(1) Every operation which is represented by a formal expression in¬ 
volving the unextended class of numbers, and which does not result in the 
representation of a number of the unextended class, must have a meaning 
assigned to it of such a character that the formal expression may be dealt 
with according to the same rules as would be applicable if the expression 
represented one of the unextended class of numbers. 

(2) An extended definition of number must be given, such that a 
formal expression, as in (1), may represent a number in the extended sense 
of the term. 

(3) A proof must be given that for numbers of the extended class the 
same formal laws of operation hold as for the unextended class. 

(4) Definitions must be given of the meaning of greater, equal, and less, 
in the extended domain of number, these terms being taken in the ordinal 
sense. 

The arithmetical theory of irrational numbers has been developed in 
three main forms, of which the first| was given by Weierstrass in his 
lectures on Analytical Functions; the secondJ is that of G. Cantor, which 
was developed in further detail by Heine§, and was also developed in¬ 
dependently by Ch. Meray||; the third, that of R. Dedekind ^f, appeared 
about the same time as that of Cantor. We shall give an account of the 
theories of Dedekind and of Cantor, and shall shew that they are funda¬ 
mentally identical. 

kronecker’s scheme of arithmetization 

18. As it is now generally understood, the term “arithmetization” is 
used to denote the movement which has resulted in placing analysis on a 
basis free from all notions derived from the idea of measurable quantity, 
the fractional, negative, and irrational numbers being so defined that they 
depend ultimately upon the conception of integral number. An extreme 
theory of arithmetization has however been advocated by Kronecker**, 
who proposed the abolition of all modifications and extensions of the 

* See his Theorie der komplexen Zahlsysteme , Leipzig (1867). 

t For an account of this Theory see S. Pincherle, Giorn. di mat., vol. xvm (1880), p. 185; also 
O. Biermann, Theorie der analytisehen Funktionen , Leipzig (1887), p. 19. 

t Math. Annalen , vol. v (1872), p. 123; see also Math. Annalen, vol. xxi (1883), p. 545, where 
Cantor discusses all the three theories. 

§ Crelle's Journal , vol. lxxiv (1872), p. 172. 

|| Nouveau Precis d'Analyse infinitesimals, Paris (1872). 

Stetigkeit und irrationale Zahlen , Brunswick (1872). 

♦* See CrellFs Journal , vol. ci (1887), p. 337, “Ueber den Zahlbegriff.’’ 
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conception of number, the integral numbers being alone retained. His 
ideal* is that every theorem in analysis shall be stated as a relation be¬ 
tween integral numbers only, the terminology involved in the use of 
negative, fractional, and irrational numbers, being entirely removed. This 
ideal, if it were possible to attain it, would amount to a reversal of the 
actual historical course which the science has pursued; for all actual 
progress has depended upon successive generalizations of the notion of 
number, although these generalizations are now regarded as ultimately 
dependent on the whole number for their foundation. The abandonment 
of the inestimable advantages of the formal use in Analysis of the exten¬ 
sions of the notion of number could only be characterized as a species of 
Mathematical Nihilism. 

THE DEDEKIND THEORY OF IRRATIONAL NUMBERS 

19 . Let us consider the aggregate of all the rational numbers ordered 
in the manner which has been previously discussed, and let us take any 
one such number N. We may conceive all the rational numbers to be 
divided into two classes R x , and R 2 , such that every number of R x is, in 
the ordinal sense of the term, less than every number belonging to the 
second class R 2 , the two classes being separated by the number N, which 
may itself be assigned at choice either to the first or to the second class. 
If N belongs to the first class, it is the greatest number in that class, and 
the numbers of the second class have no number which is less than all 
the others of that class ; if N be taken to belong to the second class, it is 
the least number in that class, and there exists no number in the first 
class which is greater than all the others; for if any rational number less 
than N be taken, it is always possible to find another greater one which is 
less than N. Such a division of the rational numbers into two classes is called 
a section (Schnitt), and we therefore say that, corresponding to any given 
rational number there exists a section which divides the aggregate of rational 
numbers into two classes , such that all the numbers of the first class are less 
than all those of the second class; and such that either in the first class there 
is no greatest number, or else in the second class there is no least number . 

It can be shewn, by means of examples, that sections of the aggregate 
of rational numbers exist which are different in character from those just 

* He writes ( loc . cit. p. 338): “Und ich glauhe aueh, dass es dereinst gelingen wird, den 
gesammten Inhalt aller dieser mathematischen Disciplinen zvi ‘ arithmetisiren,’ d. h. einzig und 
allein auf den im engsten Sinne genommenen Zahlbegriff zu griinden, also die Modificationen und 
Erweiterungen dieses Begriffs (ich meine hier namentlich die Hinzunahme der irrationalen sowie 
der continuirlichen Grossen) wieder abzustreifen, welche zumeist duroh die Anwendungen auf die 
Geometric und Mechanik veranlasst worden sind.” He proceeds to shew in detail, how the notions 
of negative, fractional, and algebraical numbers can be avoided by substituting for equalities in 
which these numbers occur, congruences relative to certain moduli or systems of moduli. A similar 
suggestion has been made by Cauchy with reference to imaginary numbers. 
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described. If m is a positive integer which is not a square number, we may 
conceive the rational numbers to be divided into two classes, the first of 
which contains all the negative numbers and also those positive numbers 
of which the square is less than m, including zero; the second class contains 
all the positive numbers of which the square is greater than m. The first 
class contains no greatest number, and the second class contains no least 
number; this section is said to be related to an irrational number -y/ m , in 
the same way as a section such as has been considered above is related to 
a rational number. This example shews that sections of the rational 
numbers R exist, such that R is divided into two classes 1R lf R 2> where 
every number of R x is less than every number of R 2) and such that R t 
contains no number greater than all the others, and also R 2 contains no 
number less than all the others. 

A new aggregate of objects, the real numbers , may now be defined as 
follows: 

To every section (R x , R 2 ) of the aggregate R of rational numbers , such that 
every number of R belongs to one or other of the two classes R x , R 2 , and every 
number in R x is ordinally less than every number in R 2 , there corresponds a 
real number. 

In case neither R x contains a number which is ordinally greater than all 
the others in R x , nor R 2 contains a number which is ordinally less than all the 
others in R 2 , the real number corresponding to the section is said to be an 
irrational number. 

In case either R x has a greatest number x , or R 2 has a least number x y the 
section is said to define a real number corresponding to the rational number x. 

The real number which corresponds to a rational number x , though 
conceptually distinct from x , has no properties distinct from those of x, 
and is usually denoted by the same symbol. 

The definition of a real number can be put into a different and some¬ 
what less abstract form, by employing the notion of a lower segment of 
the aggregate of rational numbers. A lower segment of the aggregate R of 
rational numbers is any class of rational numbers which contains no 
number greater than all the others, and such that if any number whatever 
of the class be taken, the class contains all those numbers of R which are 
less than that number. A lower segment of R is identical with one of 
fWedekind’s classes R x , in case R r contains no greatest number. 

A real number may be defined* to be a lower segment of the aggregate R of 
rational numbers; and thus every real number , whether irrational or not , is a 
definite class of rational numbers. 

* This form of the definition is that given by B. Russell, see The Principles of Mathematics, 
vol. i (1903), chaps, xxui and xxiv; it was suggested by Peano, see Bivista di Matematica , vol. vi 
(1896-99), pp. 126-140. 
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In accordance with this definition, the real number 3, for example, is 
defined to be the aggregate of all rational numbers which are less than the 
rational number 3; the irrational number \/3 is defined as the aggregate of 
all rational numbers which are either negative, or if positive have their 
squares less than 3, the number zero being also included in the aggregate. 

The use, here adopted, of the term real number, is sanctioned by general 
usage. The employment of the term real has originated from the contrasting 
of these numbers, not with rational numbers, but with complex numbers. 
The extension of the term Number to the real numbers is justified by the 
fact that it is possible to define the operations of addition, multiplication, 
<fcc., for real numbers, so that the formal laws of these operations are in 
agreement with those which hold for operations within the domain of the 
rational numbers. 

20. It will now be shewn that the aggregate of real numbers, defined 
in Dedekind’s maimer, can be so ordered that every real number has a 
definite rank in the aggregate, i.e. of any two real numbers it is determinate 
which has the higher and which the lower rank. 

The basis of the scheme of order being taken to be the ordered aggregate 
of rational numbers, let us denote by n, n' any two real numbers, and let 
the sections by which they are defined be denoted by (R l9 R 2 ), (R x , R 2 ) 
respectively. 

The following eases may arise: 

(1) If (R x , R 2 ) and (R x , R 2 ) are identical, that is, if every number in 

is also in I?/, and every number in R 2 is also in R 2 , the two numbers 

n, n' are identical; thus n =n r . 

(2) Let us next suppose that there is one rational number r x = r 2 , 
which is contained in R x , but not in R x ; it is consequently contained in R 2 . 
All the numbers in R x are less than r 2 , and hence all the numbers in R x 
are in R 1 . Since r x is the only number in R x which is contained in R 2 , it 
follows that r x is greater than all the other numbers in R x ; and thus the 
number n defined by (R x , R 2 ) is a number corresponding to the rational 
number r x or r 2 . All the elements in R x are contained in R x , and are less 
than r 2 \ all the numbers in R 2 except r 2 ', are greater than r 2 ', for if not 
they would be contained in R x : hence the section (R x , R 2 ) defines the real 
number n' = n , corresponding to the rational number r 2 = r x . The two 
sections are essentially identical, the only difference being that the rational 
number r x ee r 2 , is regarded as belonging to the first class in one section and 
to the second class in the other section. 

(3) If there are two different numbers belonging to R x which also 
belong to R 2 , there are an indefinite number of other numbers which have 
the same property, since an unlimited number of rational numbers can be 



26 Number [ch. i 

found which lie between two given rational numbers. In this case we define 
the real number n or (R 1 , R 2 ) to be greater, in the ordinal sense of the term, 
than n' or (R x , R 2 ), agreeably with the definition already given for the 
rational numbers. 

The cases in which one, or more than one, number which belongs to 
R x also belongs to R 2 , may be treated in a similar manner; thus we define 
the meaning of the relation n < n '. It is easily seen that if n > n f , and 
n r > n ", then the relation n > n" is also satisfied. Thus the system of real 
numbers is arranged in a regular order, such that those of them which 
correspond to rational numbers have the same relative rank as the corre¬ 
sponding rational numbers have in the aggregate of rational numbers. 

21. The aggregate of real numbers has the following properties: 

(1) If a > /?, and p > y, then a > y. 

(2) Between any two real numbers «, y there are an unlimited number 
of real numbers. This is easily proved from the corresponding property 
of rational numbers, by considering the sections which define the numbers. 

(3) If a is a fixed real number, then all real numbers may be divided 
into two classes R } , R 2 , such that R x contains all the real numbers which 
are less than a, and R 2 contains all the real numbers which are greater 
than a. The number a may be regarded either as belonging to R x , in which 
case it is the greatest number in R x , or else as belonging to R 2 , in which 
case it is the least number in R 2 . This also follows from the definition 
above. 

(4) If the aggregate of real numbers falls into two classes R x , R 2 , such 
that every number of R x is less than every number of R 2 , then there exists 
one, and only one, number by which this section is produced. 

To prove this, we observe that the section (R lf R 2 ) of the aggregate of 
real numbers also defines a section (R x , R 2 ) of the aggregate of rational 
numbers, such that all rational numbers belonging to R x correspond to 
real numbers which belong to R x , and all numbers belonging to R 2 corre¬ 
spond to real numbers which belong to R 2 . 

Let N be the real number defined by the section (R y , R 2 ), and let N' 
be any real number different from N, defined by the section (R x , R 2 ). 
There are an indefinite number of rational numbers n which belong to 
only one of the aggregates R Xi R x ; let n be the real number corresponding 
to n. If N' < N, then n belongs to R x , and therefore n belongs to R x ; and 
since N' < n, it follows that N' belongs to R x . Similarly, if N' > N, we 
can shew that N' belongs to R 2 . It has thus been shewn that every number 
different from N belongs to R x or to R 2 , according as it is less or greater 
than N. Thus N is either the greatest number in R x or the least in R 29 
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and therefore N is the only number by which the section (R x , R 2 ) can be 
made. 

22. The operations between two real numbers may, in accordance 
with the above definition of real numbers by means of sections, be so 
defined that the result of each operation corresponds to a section of the 
rational numbers; thus the arithmetical operations are reduced to opera¬ 
tions with rational numbers. 

A complete theory of the operations involving real numbers can be 
established; and the formal laws of the operations can be shewn to be the 
same as in the case of the rational numbers, the range of possibility of 
operations being greater in the case of real than in that of rational numbers. 
This theory has been worked out to some extent by Dedekind: but as the 
Cantor theory of real numbers lends itself to a simpler detailed treatment 
of the operations than that of Dedekind, and as it will appear that the two 
theories are fundamentally equivalent to one another, it will be sufficient, 
as an example of the general method of treating operations in accordance 
with Dedekind’s theory, to take only the case of the addition of two real 
numbers. 

Let a, b be two real numbers defined by means of the sections (R x , R 2 ). 
(R x , R 2 ) respectively; then the sum a -f b , of a and 6, is defined by means 
of a section (R x ' y R 2 ") which satisfies the following conditions:—If c x is any 
rational number, it is put into the class R x ", provided there are two rational 
numbers a x in S l9 and b x in R x , such that a x ~f b x ^ c x ; all rational numbers 
c 2 for which this is not the case fall into the class R 2 ". Tt is clear that 
every number c x is less than every number c 2 , hence the section (R x ', R 2 ") 
is defined by means of this condition. 

It can be shewn that, when a , b both correspond to rational numbers, 
this definition is in agreement with the ordinary definition of the sum of 
two rational numbers, so that the sum of the numbers corresponds to the 
sum of the corresponding rational numbers. Every number c x in R x " , is 
a + b, because a x < a, b x ^ b, and therefore a x f b x £ a \ b. Further, if 
there were contained in R 2 " a number c 2 < a -f b, so that a 4- b — c 2 -f p, 
where p is a positive rational number, we should have 

c 2 = (<a - ip) + (6 - ip), 

and this is contrary to the definition of c 2 , because a — \p belongs to R x , 
and b — \p to R x ; thus every number c 2 in R 2 ", is £ a f b , and it has 
consequently been shewn that (R x ", R 2 ') defines the number a + b. As is 
usual, we have denoted the rational numbers a, b and the conceptually 
distinct real numbers a, b by the same symbols. 
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THE CANTOR THEORY OF IRRATIONAL NUMBERS 

23. The Cantor theory of irrational numbers essentially depends upon 
the use of convergent simply infinite ascending aggregates, or convergent 
sequences (Fundamentalreihen) in which the elements are rational 
numbers; we therefore proceed to define and discuss these aggregates. 

A simply infinite ascending aggregate (o 1 ,a 2 ,a 3 , ...a w , ...), in wjiich 
each element is a rational number , is said to be convergent , if it is such that , 
corresponding to any fixed arbitrarily chosen positive rational number e, as 
small , in the ordinal sense , as we please , an integer n can be found such that 
\an-an+Tn\<c>for m= 1,2,3,.... 

The symbol | x | is here used to denote that one of the two numbers 
x, — x which is positive; | x | is said to be the absolute value of x. 

This definition is equivalent to the statement that, in a simply infinite 
convergent aggregate, an element can always be found whose absolute 
difference from any element whatever which comes after it is as small as 
we please. 

It should be observed that the terms “as small as we please,” or 
“arbitrarily small,” as applied to a positive number which is at choice, 
have reference to the conception of order only, and not to the non-arith- 
metical notion of magnitude. These expressions denote only that the number 
can be so chosen as to be of lower rank than any other arbitrarily chosen 
positive number. 

To each value of € there corresponds a value of n, which will in general 
have to be increased when e is made smaller. 

We may denote the aggregate by the symbol {a n }, and shall speak of 
it shortly as a convergent sequence; that it is simply infinite will in future 
be understood. 

For a convergent sequence , corresponding to any arbitrarily chosen positive 
number c, an integer n can be found, such that , from and after that value of n, 
the absolute difference of any two elements is less than e. 

For choose n so that | a n — a n+m | < for all positive integral values 
of m; then | a n+m - a n+m . | s | a„ - a n+n | + | a n - a n+tn , | < e. 

For the convergent sequence {a n }, if we choose n such that 

| - An+m | < 

then for m = 1, 2, 3, , the value of a n+m for all values of m, lies between 

a n -f- e, and a n — e; that is to say, from and after some value of n, all the 
elements lie between two rational numbers whose difference is arbitrarily 
small. There exist therefore two positive numbers a , a\ of which the smaller 
a f may be zero, such that, from and after some fixed value of n , say 74 , all 
the elements lie in absolute value between a and a\ 
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It follows that: 

If {a w } be a convergent sequence, there exists a positive number N such 
that | a n | < N, for every value of N. 

For N need only be taken to be greater than all the numbers 
| a, |, | a 2 |, ... | o Bl |, a. 

24. If the aggregate (a x , a 2 , ... a n , ...) is such that, from and after some 
fixed element, each element is not greater than the following one, and if all the 
elements are less than some fixed number N, then the aggregate is a convergent 
sequence. 

For if the aggregate is not convergent, there must exist some positive 
number 8, such that an indefinite mimber of increasing values 7i 0 , n 1 , n 2 , ... 
of n can be found, for which | a ni — a„ 0 |, | a„ 2 — a ni |, | a n> — a n% | ... are 
all ^ 8. Since a ni — a Vo ,a n% — a ni , ... are all positive, we have a nr a no -f rS, 
where r can always be taken so large that a no -f rS > N, or a nr > N, which 
is contrary to the hypothesis. Hence the aggregate is convergent. 

It may in a similar manner be shewn that the aggregate is convergent 
if, from and after some fixed element, each element is not less than the 
following one, and if all the elements are greater than some fixed number. 

An aggregate (a 1 ,a 2 , ... a n , ...)*such that a 1 ^ a 2 s a 3 ^ ... is said to be 
a monotone non-diminishing sequence. If all the elements are less than 
some fixed number N, the sequence is convergent. Similarly if 

oq it* a 2 Og ... 

the aggregate is said to be a monotone non-increasing sequence. It is 
convergent if all the numbers a n are greater than some fixed number. 

If {a n }, {6 n } are two convergent sequences of rational numbers , a value of n 
can be found corresponding to any arbitrarily assigned number e, such that 
both | a w+tn — a n+m , | and | b n+m — b„ fm . | are less than e, m and ?n' having 
all positive values. 

For we have only to choose for n the greater of the two values corre¬ 
sponding to c, for each aggregate separately. 

25. It will now be shewn that the aggregates 

{a„ + b n },{a n -b n },{aM, {J”}. 

in which the elements are the sum , difference , product, and quotient, 
respectively of the corresponding elements of the two convergent sequences 
{a n }, {b n }, are also convergent sequences; with a certain restriction in the last 
case. 

We have | (a n ± b n ) - ( a n+m ± b n+m ) | * | a n - a n+m | + | b n - 6 n+tn |; 
now n can be so chosen for a given €, that for all values of m, 

| ®n+m 1 ^ 
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and | b n - b n+m | < hence so that | (a n ± b n ) - (a nAm ± 6, H J | < e; 
therefore the aggregates {a n -f- b n } , {a n — b n } are convergent. 

Again, 

J ®tbn | *“ j (^n ^nfm) ~f~ &n+m) | 

^ ® | 4 »n | "t ^ | | > 

where «, p are the two positive numbers which are such that | a n | < a , 

| I < P, for all values of n and m. 

We can tajce n so large that | b n — b nim | < 8, | a n ■— a n | < 8, where 


8 is at our choice, and may be taken to be 


P' 


Hence, for this value of 


n, 


a n b n ~ cin+mbn+m I < € > for every value of m \ and thus {a fl b n } has been 


shewn to be a convergent sequence. 

(a 


Lastly, in the case of we shall suppose that all the elements of 

{6,,} are numerically greater than some fixed positive number /?'. 

We have then 


(f>n 4 


]> n ) + - O 

bj>n+ m 


hence 


a n 

b n 


ct \b n - b n j. w | \ P j d n 

P ' 2 


If now n be chosen so that | b n — b n{ m |, | a n - ■ a n + m | are both less 

Q'2 

than - n e, for every value of rn, then for such a value of /?, 

a + P ’ J 


* T 

^11 f T 


< e; 


therefore j^ n j is a convergent sequence, provided | b n | is, for all values of 

n, greater than some fixed positive number P', which may be as small as 
we please, but must not be zero. 


26. The essence of Cantor’s theory consists in the postulation of the 
existence of an aggregate of objects for thought, the real numbers, ordered 
in a definite manner, which manner is assigned by means of certain pre¬ 
scribed rules. Any element of the aggregate of real numbers is regarded 
as capable of symbolical representation by means of a convergent sequence 
of which the elements are rational numbers; and the mode in which the 
aggregate of real numbers is ordered is specified by means of formal rules 
relating to these convergent sequences. The aggregate of real numbers 
contains within itself an aggregate of objects which is similar to the 
ordered aggregate of rational numbers which has already been considered, 
in the sense that to each rational number there corresponds a certain real 
number; and the relative order of any two rational numbers, in the ordered 
aggregate of rational numbers, is the same as the relative order of the two 
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corresponding real numbers in the new aggregate of real numbers. The 
rational numbers are frequently regarded as identical with the real numbers 
to which they correspond, and are denoted by the same symbols. In the 
development of Analysis, this identity leads to no difficulties; but, in the 
fundamental theory of the aggregate of real numbers, a conceptual dis¬ 
tinction between rational numbers and the real numbers to which they 
correspond must be made, in order to obviate logical difficulties, and 
especially with a view to coordinating Cantor’s theory with that of 
Dedekind. Those real numbers which do not correspond to rational numbers 
are called irrational numbers; and those real numbers which correspond to 
rational numbers are usually spoken of as themselves rational numbers. 

The rules by which the order of the real numbers in their aggregate is 
assigned are the following: 

(1) Any convergent sequence {«„}, of which the elements are rational 
numbers, is taken to represent a real number, which we may denote by a. 
Two such aggregates {a n }, {/>„} are taken to represent the same real number 
provided they satisfy the condition that, for any arbitrarily chosen positive 
rational number e, a value of n can be found such that | a n+m — b n+m | < e, 
for this value of rt, and for all values 0, 1, 2, 3, ..., of m. Symbolically*, 
we have {a n } ^ under the condition stated. 

(2) The real number represented by {a 7t } is regarded as of higher rank, 
or in the ordinal sense greater, than the real number represented by {b, n }, 
if a value of n can be found such that a n+rn — b nAm is, for this value of n , 
and for all values 0, 1, 2, 3, ..., of m , greater than some fixed positive 
rational number 8. If n can be so determined that a n+m — b n+m is negative 
and numerically greater than some fixed positive rational number 8, for 
every value of m, the number represented by {aj is taken to be less than 
that represented by {b n }. 

The aggregate (x, x, x, ...), or {a;}, in which all the elements are identical 
with one rational number x, represents, since it is a convergent sequence, 
a real number which corresponds to the rational number x. It is clear, 
from the definition of order in (2), that the relative order of any two 
rational numbers, in the aggregate of rational numbers, is the same as 
that of the real numbers which correspond to them, in the aggregate of 
real numbers. The aggregate of rational numbers, and that of the real 
numbers which correspond to them, are similar aggregates. 

Cantor’s theory of irrational numbers, in the form in which it was 
presented by himself and by Heine, has been criticized! on the ground 

* Those who hold the view, advocated by Heine and others (see § 7, note), that a real number 
is identical with the set of symbols by whieh it is represented, can attach no direct meaning to this 
equality. It can only be taken to indicate that the two expressions may be used indifferently in 
any operation which involves the number. 

f See B. Russell, The Principles of Mathematics, vol. i (1903), p. 285. 
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that an assumption is made that the sequence {x }, in which all the elements 
are the same rational number x , represents the rational number x itself, 
and that this amounts to an assumption that x is the limit of the sequence 
{#}; whereas the theory of arithmetical limits is represented by Cantor* 
as deducible from his theory of irrational numbers, and as not assumed 
in the construction of the theory itself. The theory in the form presented 
above is not open to this objection. 

It can be shewn that any two convergent sequences {o r ,}, {6 r ,} satisfy 
one or other of the conditions laid down in the above definitions of equality 
and inequality, i.e. symbolically {a n } g {&„}. 

For, as has been shewn in § 23, corresponding to any arbitrarily chosen 
positive rational number 8, a value of n can be found such that a nim lies 
between a n + 8, and a n — 8, and such that, for the same value of n. b n + m 
lies between b n -f 8, and b n — 8; from this it follows that, for such value of 
n, a n +m -- Kim lies between a n - b n 4 28 and a n - b n - 28; or a rl+m - b„ +m 
differs from a n — b n by not more than 28. If corresponding values of 8 and 
n can be found, for which a n — b n + 28, a n — b n — 28 have the same sign, 
then a n+m — b n+m has the same sign as a n — b n , and is numerically greater 
than a fixed number; the condition of inequality of {a n }, {b n } is then satisfied. 
If no such values of 8 and n can be found, then a nim — b n + m is numerically 
less than 48; and since 8 is arbitrarily small, the condition of equality of 
{a n }, {b n } is then satisfied. 

Although Cantor’s form of the theory of irrational numbers, or rather 
of real numbers, is more convenient for detailed development than is 
Dedekind’s form, yet it lies under the disadvantage that the nature of any 
single real number is veiled by the fact that, although it is a unique object, 
it is capable of representation by an unlimited number of convergent 
sequences, and therefore that the formal character of the theory does not 
make it clear what such a number really is. The comparison between the 
two theories whichf will be given later on will throw light upon this point: 
for it will be shewn that a convergent sequence of the rational numbers is 
sufficient to define a section, of the kind fundamental in Dedekind's 
theory; and this, as we have seen, is equivalent to the definition of a lower 
segment, which is itself a certain definite class of rational numbers. 

27. The sum a -f 6, of two real numbers represented by the sequences {a n \, 
{6 n }, is defined to be the real number represented by the sequence (a n -f b n ); 
and the difference a — b is defined as a number represented by {a n — b n ). 

It has been shewn in § 25 that the two sequences {a n + b n }, {a n — 6 n } 
are convergent. 

* See Math. Annalen, vol. xxl (1883), p. 568. 

t In Tannery’s work Introduction d la thdorie des fonctions d'une variable , Paris (1886 and 1904), 
chap, i, the theory of irrationals is treated by a combination of the two methods of Cantor and 
Dedekind. 
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If {a n }, {b n } represent the same number, the sequence {a n — b n } defines 
the real number zero; for the condition that | a n+m — b n+m | < e, where e 
is arbitrarily small, for a sufficiently great value of n, and for m ----- 0 , 1 , 2 , 
3, , is in this case satisfied. 

The product ab, of two real numbers , is defined to be the number repre¬ 
sented by the sequence {< a n b n }, which has been shewn in § 25 to be convergent. 

The quotient ajb is defined to be the number represented by the convergent 


sequence 


The only restriction on this last definition is that b is not to be zero; 
for, when this condition is satisfied, the elements of the sequence { 6 ,,}, 
which represents b , can be so chosen as to satisfy the restrictive condition 

given in § 25, that j~- n j may be convergent. 

It is necessary to shew that the sum a + b, the difference a — 6 , the 

product ab , and the quotient of two numbers a, 5, as they have been 

defined above, are definite numbers independent of the particular con¬ 
vergent sequences used to represent the numbers a and b. Thus it must 
be shewn that if {a n } = {«„'}, {5 n } — { b n f }, then 

{a n + b n } - {af + bf J, {a n - b n } - {af - b n {a n b n } - {a„V}> 

We have 

! (®n+m i b n+m ) (a n +m i b n+m ) j s* j a n+m a n + m | H* | b n+m b n + m j. 
Now n can be so chosen, corresponding to a fixed number e, that 

| a„+m ~ I < R | b n+m - b' n+m | < for m - 0 , 1 , 2 , 3, ...; 
with this value of n , we now have the condition 

i i ^n+m) (® ntm i b n+m) | ^ 

satisfied; and this is the condition that {a n ± b n } represents the same number 
as {a n ' ± b n '}. 

Again, 

| ^n+m^n+m ® n+ra^ n +m I — | @n+m {bn+m b n+m) | "h | b n + m (&n+m ® «+m) | 


< A I b n 


B \ a n 


where A , J5 are fixed positive numbers. It is now clear that n may be so 
chosen that | a n+m b n+m — a' n+m b' n + m | < 77 , where 77 is an arbitrarily chosen 
positive number; thus {a n b n }, {a n 'b n '} represent the same number. 

Again, 

fl’n+m a n+m j < tt w+m /» _ 1/ \ „/ \ 

I. jLf I =* i. jl f l y n+ni u n+m) ' i. l/ \ a n+m a n+nj > 

y n+m w n+m ‘ u n+m w n+m ^n+m u n+m 
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whence it can easily be seen that the condition is satisfied that 



represent the same number. 

It is readily seen that the same commutative, associative, and dis¬ 
tributive laws hold for the operations between real numbers as for those 
involving rational numbers. 

28. If, from and after some fixed element a n , all the elements of {a n } are 
greater than some fixed positive rational number 8, then the real number repre¬ 
sented by {a 7t } is positive , i.e. it is ordinally greater than zero. 

For, if we take any convergent sequence {8„} which defines the number 
zero, we have f a n } > {b r) \ ; because, for some fixed value of n, a n lm — 6 n4m 
is certainly positive for all values of m, and is greater than a fixed positive 
number; since n can be taken so large that a n+m > 8, and b n , m < S', where 
8' is a positive rational number chosen less than 8. 

Similarly, it may be shewn that the number defined by {a is negative , 
if, from and after some fixed value of n, all the a n are negative and numerically 
greater than some fixed positive rational number 8. 

The term “numerically greater” denotes that | a n | > 8 and thus refers 
to the absolute values of the numbers concerned. 

It is easily seen that, unless {a n } is such that, from and after some fixed 
value of n , all the dements have the same sign , then {a n } must represent the 
number zero. 

If {a n }, {b n \ define two different real numbers a, 6, then there lie between 
a, b an unlimited number of those real numbers which correspond to rational 
numbers. 

Suppose a > b, then there exist a definite rational positive number 8, 
and a least integer n , such that, for all positive integral values of m in- 
eluding zero, a n+m - 6 n+m a 8, | a n - a n+m | < e, | b n - b n+m | < e, where 
€ is a rational number chosen to be < |8. If we take any rational number 
x , which is < 8, and > e, the number {a n — x }, in which all the elements 
are identical, lies between {a n } and {6 n }, since, for every value of m, we 
have a n+m — (a n — x) > x — c ; therefore a is greater than the real number 
which corresponds toa n — x. Again, 

(a n - x) - b n+m = (a n - b n ) + (b n - b n+m ) - x > $ - € - x; 

therefore, provided x is chosen to be < 8 — e, the real number which 
corresponds to a n — x is greater than b, and thus lies between a and b. 
The rational number a n — x may be chosen in an unlimited number of 
ways, since x is any rational number whatever which lies between 8 — e 
and e. To obtain one such number we may take e = J8, x = |8. 
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CONVERGENT SEQUENCES OF REAE NUMBERS 

29. Convergent sequences will now be considered, of which the 
elements are real numbers, it might at first sight be imagined that we 
should be led, by the employment of such sequences, to a further extension 
of the domain of number; it will however be seen that this is not the case. 

The definition of a convergent sequence of real numbers is precisely 
similar to the definition which has been given in the case of sequences of 
rational numbers ; thus (cq, a 2 , ... a n , ...) is a convergent sequence of real 
numbers, provided that, corresponding to each arbitrarily chosen positive 
real number 7 ], a value of n can be found such that | ct n — a n]7n | < 77, for 
m — 1 , 2, 3, .... If we conceive that each such convergent sequence of 
real numbers represents a single ideal object, and if we give definitions 
of equality and inequality, and of the fundamental operations, precisely 
analogous to those given in § 2f> and § 27, and assume as before that a 
convergent sequence in which all the elements are identical with the real 
number a is taken to represent that one of the new aggregate of objects 
which corresponds to a, it will be shewn that the new aggregate of objects 
is similar to the aggregate of real numbers, i.e. to each of the new objects 
there corresponds one of the real numbers, and also that the relation of order 
between corresponding pairs of elements in the two aggregates is the same. 
It thus appears that the aggregate of new objects is practically identical 
with the aggregate of real numbers, since the two are ordinally similar. 
No such relation has been shewn to exist between the aggregate of real 
numbers and that of rational numbers; and it will be shewn later, in con¬ 
nection with the general theory of order-types, that no such relation of 
similarity can exist. Therefore the passage from rational numbers to real 
numbers involves a real extension of the domain of number; but the 
passage from real numbers to an aggregate of objects represented, in 
accordance with the rules referred to above, by convergent sequences 
of real numbers, does not lead to any essential extension of the domain of 
number. 

Let {a n } be any convergent sequence of rational numbers, and let { d n } 
denote the sequence of those real numbers which correspond to the rational 
numbers which form the elements of {»„}. It can easily be shewn that 
\a n ] is a convergent sequence: for, if € is an arbitrarily chosen positive 
rational number, and e the corresponding real number, the condition of 
convergence of {a w } is that, for every c, a value of n can be found such 

that a n+TO lies between a n + e, a n — e, for m = 1, 2 , 3, _ It follows from 

this that a n+m lies between d n + e, d n — e; and this ensures the convergence 
of the sequence {a w }. Conversely, we see that if {a n } is convergent, so also 
is {a n }. 
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Next, let {«„} be a convergent sequence of real numbers; then, between 
a n and a n+1 , a real number a n can be found which corresponds to a rational 
number a n , determined as at the end of § 28. Conceive this to be done for 
every pair of consecutive elements in {«„}, and let us consider the sequences 
• 

Since (l n d n+m ~ ffin ®n) (^n "b ^n+m)> 

we have | a n - « w+m \ £ \a n - a n \ + \a n - a n + m | 4- | a n+m - d n+m | . 

Now, corresponding to any real positive number 8 , n may be so chosen 
that for every value of ra, | d n - a n |, \ a n - a n+m |, | a n+m - d n , m | are 
each less than $8 ; hence, for such a value of n, 

| a n - ci n+m | < 8, for m = 1, 2, 3, ..., 
thus the sequence {<i n } is convergent. 

Again, {a n } - {d n \ = {a n - a n }, and | a n - U n \<\a n ~ a n +i |, 
and, since {a n } is convergent, n may be chosen so great that, for that and 
all higher values of n, all the differences \ a n — a n+l \ are less than an 
arbitrarily fixed number, hence | a n — d n | satisfies the same condition; 
and therefore the two convergent sequences {a n }, {d n } satisfy the condition 
of equality ; or they represent the same one of the new objects. It has 
been shewn above that, since {a n } is convergent, so also is {«„}. Now {a n } 
corresponds to a single real number a; therefore, to any convergent 
sequence {a n }., of which the elements are real numbers, there corresponds 
a real number a. 

We have further to shew that, if {a n }, {/S n } are two convergent sequences 
of real numbers, and a, b the corresponding real numbers, as just deter¬ 
mined, then a % 6, according as {a n } g {/3 n }. 

We know that a g 6, according as {a n } g {&„}, where {b n } denotes the 
sequence of rational numbers which defines 6, in the same way as {a n } 
defines a. Now {a n } — {fi n } = {a n — fi n }, and 

(X n — (d n a„) + (b n fi n ) -j- {ci n l) n ) , 

and we can choose n so large that | a n — a n \ and | b n — fi n \ are each less 
than Irj, where rj is an arbitrarily chosen real positive number: therefore 
we see that a n — fi n lies between a n — b n + rj and d n — b n — rj. It follows 
easily that {a n } g {/? n }, according as {a n } g {b n }, or according as {a n } g {6 n }, 
and hence, as shewn above, according as a g b. It has now been shewn that 
the objects which are represented by convergent sequences of real numbers 
have the same ordinal relation to one another as the real numbers to which 
those sequences have been shewn to correspond. 

It appears, from what has now been proved, that, to every convergent 
sequence of real numbers there corresponds a real number which may be taken 
to be defined by means of that sequence. 
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There does not necessarily exist any rational number which corre¬ 
sponds in the same sense to a convergent sequence of rational numbers. 
The property of the aggregate of real numbers here stated embodies the 
characteristic difference between that aggregate and the aggregate of 
rational numbers; for the latter does not possess the corresponding 
property. It is this property of the aggregate of real numbers which makes 
it suitable to be the field of the real variable in the Theory of Functions. 


THE ARITHMETICAL THEORY OF LIMITS 

30. If x 2 , x 3 , ... x n , ... is a sequence of real numbers such that a 
number x exists which has the property that , corresponding to any arbitrarily 
chosen positive number e, a value of n can be found such that | x — x n |, 
I x — x n+i | > | x — x n +2 | > ••• are l ess than e, then the number x is said to 

be the limit of the sequence x x , x 2 , ... x n ,_ This fact may be denoted by 

the equation x = lim x n . 

n~oo 

This definition is known as the arithmetical definition of a limit, and 
was first given*, in a form substantially identical with the above, by John 
Wallis. 

It will be observed that the above definition contains no assertion as to 
the necessary existence of a limit of a sequence of numbers, but contains 
only a statement as to the relation of the limit to the numbers of the 
sequence, in case that limit exists. 

There cannot be two numbers which both satisfy the condition of being 
a limit of the same sequence. For, if possible, let x, x' be two such numbers 
and let | x — x' | = 8. Choose a value of e, less than then numbers 
n, n r can be found such that | x — x n + m |, | x/ — x n , +m |, for all values 
0 , 1, 2, 3, ... of m, are less than e. Suppose n > n', then | x — x n | and 
| x f — x n | are both less than e; hence | x — x’ | < 2e < 8, which is con¬ 
trary to the condition | x — x' | = 8. 

It will now be shewn that, if the numbers of the sequence {x n } are real 
numbers, and if the sequence is a convergent one, then the real number x 
defined in the manner explained in § 29, by the sequence {x n }, is the limit 
of the sequence. 

For the two sequences {x n }, {x} both define the same number x, and 
therefore satisfy the condition of equality, which is that | x — x n+m | < €, 
for any arbitrarily chosen e, provided n be sufficiently great; and this is 
the condition that x should be the limit of the sequence {x n }. A sequence 

* Arithmetica Infinitorum (1655), Prop. 43, Lemma. See M. Cantor’s Oeschichie der Mathe- 
matik , Leipzig (1892), vol. n, p. 823. 
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of real numbers which has a limit must be convergent. For, if x is the 
limit of {, x n }, then for a sufficiently large value of n , 

J X — X n J , J X X n (i | ) ••• | X Xn+m | > ••• 
are all less than Jc, where e is arbitrarily chosen; now 
| Xn - x n+m | * | a? - *„ | -f I X - :r n+w I; 
hence | x n — x n+m | < e, which is the condition of convergence of {.£„}. 

As the complete result we have now the theorem known as the General 
Principle of Convergence* : 

The necessary and sufficient condition that a sequence x lf x 2f ... x n , ... of 
real numbers may have a limit is that , corresponding to every arbitrarily 
chosen positive number £, a value of n can be found such that x n — x n¥1 , 
x n — #n+ 2 i x n — # nf3 , ... shall be all numerically less than e. 

This theorem, which contains the criterion for the existence of a limit, 
as defined in accordance with the arithmetical definition of a limit, is a 
deduction from Cantor’s theory of real numbers. 

31. If the numbers of a sequence {:r n } are rational numbers, instead 
of real numbers, the definition of the limit is applicable, and it is a neces¬ 
sary, but not a sufficient, condition for the existence of the limit, that the 
sequence should be convergent. Strictly speaking, if a convergent sequence 
of rational numbers has a limit, that limit is also a rational number; but 
from the existence of convergent sequences of rational numbers which 
have no limit there arises the necessity for the extension of the domain 
of number, so that in the extended domain every convergent sequence 
may have a limit; this extension has been carried out by substituting Real 
Number for Rational Number. However, although a convergent sequence 
of rational numbers which has no rational limit, has in this strict sense 
no limit at all, by reason of the convergent sequence of those real numbers 
which correspond to the rational numbers having an irrational number as 
limit, and since, as has been seen above, these real numbers are for practical 
purposes not distinguished from the rational numbers to which they 
correspond, it is usual to consider this irrational number to be the limit 
of the sequence of rational numbers. We may thus assert that any con¬ 
vergent sequence of rational numbers which has not a rational number as 
limit has an irrational number as its limit. This assertion is a correct one 
for the practical purposes of Mathematical Analysis. 

32. The method of limits, which is essential both to pure Analysis and 
to the applications of Analysis in Geometry and in Kinetics, had a geo¬ 
metrical origin in the Method of Exhaustions, which was applied by the 
Greek geometers to determine lengths, areas, and volumes, in simple cases. 

* This term “das allgemeine Convergenzprinzip ” is due to P. Du Bois-Reymond; see his 
Allgemeine Functionenlheorie, Tubingen (1882). 
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This method, supplemented by the notion of the numerically infinite , was 
developed in later times, in various forms, into a general method which 
formed the basis of the Infinitesimal Calculus. The traditional geometrical 
conception of a limit may be exemplified by the case of the determination 
of the length of a curve as the limit of a sequence of properly chosen 
inscribed polygons. The lengths of the perimeters of the polygons are 
regarded as continually approaching the required length of the curve, 
whilst the number of sides of the polygons is continually increased, and 
the maximum length of the sides of a polygon is diminished indefinitely. 
The limit, the length of the curve, is then regarded as actually reached 
at the end of a process described as making the number of sides of the 
polygon infinite; this mode of attainment of the limit being however in¬ 
accessible to the sensuous imagination, and disguising an actual qualitative 
change of a geometrical figure, which possesses corners and is bounded by 
segments of straight lines, into one which has no corners and has a curvi¬ 
linear boundary. No doubt was felt as to the existence of the limit, which 
was regarded as obvious from geometrical intuition. That a curve possesses 
a length, or an area, was considered to require no proof. The first mathe¬ 
matician who recognized the necessity for a proof of the existence of a 
limit was Cauchy, who gave a proof of the existence of the integral of a 
continuous function. That the logical basis of the traditional method of 
limits is defective has in recent times received a posteriori confirmation by 
the exhibition of continuous functions which possess no differential coeffi¬ 
cient, and by many other cases of exception to what were regarded as 
ordinary results of analysis resting on the method of limits, which have 
been brought to light by those mathematicians who have been engaged 
in examining the foundations of analysis. 

The arithmetical theory of limits, which is summed up in the general 
principle of Convergence, provides a definite criterion for the existence of 
the limit of a sequence of numbers; and a considerable part of modern 
analysis is concerned with obtaining special forms of the general criterion 
adapted for use in special classes of cases. The theory is essentially de¬ 
pendent upon the theory of irrational numbers; for, in default of an arith¬ 
metical theory of irrational numbers, all attempts to prove* the existence 
of a limit of a convergent sequence are doomed to inevitable failure; and 
this for the simple reason that a convergent sequence of rational numbers 
does not necessarily possess a limit which is within the domain of such 
numbers. The definition of real numbers by means of convergent sequences 
of rational numbers is not a mere postulation of the existence of limits to 
such sequences; it involves rather the introduction of an enlarged con¬ 
ception of number, of such a character that the scheme of ordered real 

* An interesting discussion of various methods which have been suggested of proving the 
existence of a limit will be found in Du Bois-Reymond’s Allgemeine Functionentheorie 
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numbers should form a consistent whole, and such that every convergent 
sequence of numbers in the domain of real number necessarily has a limit 
within that domain. The postulation of the existence of the aggregate of 
real numbers is justified by shewing that a complete scheme of definitions 
and postulates can be set up for the elements of this aggregate, and that 
such a scheme does not lead to contradiction*. As regards the existence 
of limits in the case of lengths, areas, volumes, &c., referred to above, the 
order of procedure is a reversal of the traditional one, the existence of the 
limit being no longer inferred from geometrical intuition. For example, 
in the case of the determination of the length of a curve, that length is 
not assumed to be independently known to exist, but is defined as the 
arithmetical limit of the sequence of numbers which represent the peri¬ 
meters of a suitable sequence of inscribed polygons. When this sequence 
is convergent, and its limit is independent of the particular choice of the 
polygons, subject to a suitable restriction, then the limit so obtained 
determines the length of the curve. In case no such limit exists, the curve 
is regarded as not having a length. 


EQUIVALENCE OF THE DEFINITIONS OF DEDEKIND AND CANTOR 

33. In order to establish the equivalence of the definitions of irrational 
numbers, as given by Dedekind and by Cantor, it must be shewn that 
every convergent sequence of rational numbers defines uniquely a section 
of all the rational numbers, and that this section is the same for all con¬ 
vergent sequences which represent the same real number in accordance 
with rule (1) in §26. Conversely, it must be shewn that any number 
defined by a section can also be represented by a convergent sequence of 
rational numbers. 

To shew that , corresponding to the convergent sequence {x n } which, in 
accordance with the Cantor theory, defines the real number x , a section 
can be found:—Let r be any rational number, and let r be the correspond¬ 
ing real number represented by {r} . The real number x — r is represented 
by {x n — r}\ and if this number is not zero, then (see § 28), from and after 
some fixed value of n, x n — r has a fixed sign, positive or negative according 
to the value of r. A section of the rational numbers may now be defined 
as follows:—Let every number r such that x n — r is negative, from and 
after some fixed value of n , be placed in the class R 2 ; and let every number 
for which x n — r is positive, from and after some fixed value of n , be 
placed in the class . If there exists a rational number r, such that neither 
of these cases arises, then x = f, and r may be put into either of the classes 

* On this mode of regarding the aggregate of real numbers as dependent upon a complete 
consistent scheme of definitions and axioms, see Hilbert, “Ueber den Zahlbegriff,” Jahr&tber. d. 
dtuUch. math. Vereinigung , vol. vm (1900), p. 180. 
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R x , R 2 . It has thus been shewn that a section of the rational numbers can 
be determined, corresponding to the convergent sequence {# n }. 

Next, let {x n '} be any other convergent sequence which represents the 
same real number x, as {x w } does. We have to shew that the section of the 
rational numbers which corresponds to {x n r } is identical with that which 
corresponds to {x n }. If, as before, r denote any rational number, we have 
{x n — r} = {x n ' — r }. Now a value of n can be found, from and after which 
x n — r and x n ' — r both have fixed signs independent of n , and they must 
have the same sign. It follows that a number r which belongs to the class 
R l , must also belong to the class R x ', by which the section corresponding 
to {x n '} is defined; and also a number r which belongs to the class R 2 , 
necessarily belongs to R 2 , except in the case {x n } = {x n ’} = r. It has thus 
been shewn that the section (R x , R 2 ) which corresponds to {a; n } is identical 
with the section ( R x , R 2 ') which corresponds to {#„'}. 

34. To shew that a convergent sequence can always be found such as 
to define the number corresponding to a given section (R x , R 2 ), we observe 
that two rational numbers can always be found, one of which is in R l9 and 
the other in R 2 , and such that their difference is numerically less than a 
given arbitrarily small rational number e. Let A be any rational number 
in R 1 , and let e' be a rational number < e. Then of the numbers A 4- e', 
A f 2c', ... A 4- re', ... there must be a last one A 4- re' which falls in R X) 
for A + ne' may be made as large as we please by taking n large enough; 
the next number A -1- (r 4- 1) e' is then in R 2 \ and these numbers A -f re', 
A (r 4- 1) e', whose difference is e' < e, are the two numbers required. 
Moreover, if B is a rational number in i? 2 , the two numbers may be so 
determined that both lie between A and B; for we need only take e' to 

be of the form - (B — A), where s is a positive integer so chosen that 
s 

Now let {e n } be any convergent aggregate of rational numbers, which 
has zero for its limit. Determine x x in R x , and x 2 in R 2 , so that x 2 — x x < e 2 ; 
next take x 2 in R x , and x A in R 2 , so that | x 3 — x 4 | < e 2 ; and that x 3 , x t 
both lie between x x and x 2 . Proceeding in this way, we can determine 
x 2 n-i i x 2 n rational numbers of different classes, so that | x 2n _ x ~ x 2n | < e n ; 
then either of the sequences , x 3 , x 5 , ...}, {x 2i x 4 , ...} defines the number 
which is represented by the section (R x , R 2 ). 

To prove this, we observe that {x 2n _ x } is a convergent sequence, since 
all the elements are < x 2 , and x x < x 3 < x h .... 

Again, suppose a is a rational number belonging to R 2 , we can shew 
that, provided a rational number b exists in R 2 which is less than a, then 
a is greater than all the numbers x Xi x 3 , ... by more than a — b. For 
a - x 2 n -1 = (a - b) -f (b - x 2n _ x ) > a - b, 
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however small b — # 2 w-i may become. Hence, unless a is the smallest 
rational number in R 2 , the real number {a} which corresponds to a is 
greater than the number x 3 , ...). 

Again, the sequences {a? 2n _ l }, {x 2n } represent the same number, since 
their difference is the aggregate {£„} which defines zero. It now appears, 
by reasoning similar to the above, that any number a in R x is such that the 
real number {a} is less than the number {x 2n }, unless a is the greatest 
rational number in R 1 . 

If either R x has a greatest rational number, or R 2 has a least one, the 
real number {a) which corresponds to this rational number a , is itself 
defined by (R x , R 2 ), and is the number represented by either of the se¬ 
quences {^ 2 n-il» {# 2 n}- In any case, either of these two sequences defines 
the number given by the section (R l , R 2 ). 

The complete equivalence of the two theories of Dedekind and of 
Cantor has now been established. The first theory operates with the whole 
aggregate of rational numbers, the second with sequences selected out of 
that aggregate. 


THE NON-EXISTENCE OF INFINITESIMALS 

35. It should be remarked that, in assuming that every section of the 
aggregate of real numbers defines a single real number, it has been im¬ 
plicitly assumed that if a, b are any two positive real numbers, such that 
a < b, then a positive integer n can be found such that na > b. 

This is the arithmetical analogue of the so-called principle of Archi¬ 
medes. 

If any real numbers existed which are ordinally greater than all the 
numbers a, 2a, 3 a, ..., then a section of the aggregate of real numbers 
would be defined by considering all numbers greater than all the numbers 
a , 2 a, 3 a, ... to be in one class, and all the remaining real numbers to be 
in the other class; and this section would define a real number N. If now 
e be an arbitrarily chosen positive number less than a, then N — e is a 
number which is less than some of the numbers a, 2a, 3 a, ...; and there 
must be a first of this set of numbers such that N — e is less than it. Let 
this be pa; thus N — e < pa, hence N < pa -f e < (p -f 1 ) a; which is 
contrary to the hypothesis that no number na is in the class of numbers 
which are > N . 

The property of the aggregate of real numbers which has been estab¬ 
lished may be denoted by the statement that the aggregate of real numbers 
forms an Archimedean system; and this property of the aggregate is 
essentially equivalent to the property that every section of the aggregate 
defines a single number of the aggregate. 
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A consequence of the fact that the aggregate of real numbers forms an 
Archimedean system is that so-called infinitesimal numbers do not exist 
within the aggregate. Every positive number e, being such that an integer 
n can be found such that ne > 1 , is a finite number, in the sense in which 
finite numbers were distinguished from infinitesimals in the older forms 
of the Infinitesimal Calculus. In Arithmetical Analysis the conception of 
the actually infinitesimal has no place. When the expression “infinitesimal” 
is used at all, it is to describe the process by which a variable to which 
the numbers of a sequence converging to zero are successively ascribed, as 
values, approaches the limit zero; thus an infinitesimal is a variable in a 
state of flux, never a number. Such a form of expression, appealing as it 
does to a mode of thinking which is essentially noil-arithmetical, is better 
avoided. 


THE THEORY OF INDTCES 


36. When m is a positive integer, and x a rational number, x m was 
defined to denote a: x x x x x ... x x(m factors); and this definition maybe 
extended to the case in which x is any number defined by a convergent 
sequence; so that if x is defined by {:r n }, x m is defined by {x n m } . It thus 
appears that for any real number x 9 we have, provided m and n are positive 
integers, x m x x n — x m+n . 


If we assume xP and x~ m to be defined as having such a meaning that 
this law of indices holds when m or n is zero, or a negative integer, we can 
at once interpret xP and x~ m ; for 

xP X X n ~ X th0 — X n , thus xP -- 1 ; 


and 


x~ n x x n — xP = 1, thus x~ n = 


x n 


p 

When pjq is a rational fraction, we shall define xP to have such a mean¬ 
ing that the above law of indices holds when either or both of m, n may be 
rational fractions. With this assumption 

p p v p 

xP x xP x xPx ... x xP (q factors) = x p ; 

v P 

hence (xP) q = x v ‘ ; or xP is, if it exists, a number whose gth power is x v . The 

p 

problem of determining, if possible, a number xP , is that of finding a 
number whose gth power is a given number; and it has been already 
shewn that this is not always a possible operation within the domain of 
rational numbers. 

It will now be shewn that, in the domain of real numbers, the operation 
p 

of finding x q is always a possible one when x is positive; and also when x is 
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negative, provided however that , in this latter case , q is an odd number , or if it 
is even , p is not odd. 

The following lemma will be required :—If a is any real positive number 
less than unity , a positive integer m can be found such that a m < e, where e is 
an arbitrarily prescribed positive number , or m other words , lim a n = 0. 

oo 

Since a n > a n+1 , the sequence (a, a 2 , ... a n , ...) is convergent. 

Suppose, if possible, that the sequence represents a positive number k 
different from zero; then m may be so chosen that a m , a wfl , ... all differ 
from k by less than the arbitrarily prescribed number 8, say a m — k + 17, 
where rj < 8. We have therefore a m+1 = (fc 4 - rj) a < (Jc 4- 8) a; now 8 can 

be chosen to be equal to ^ ^ , then a w+1 < & — 8; and this is contrary 

to the condition imposed in the choice of m. 

It follows that k cannot be different from zero; and thus the lemma is 
established. 


Suppose now that a is any positive number, rational or not, which 
lies between N 9 and (N 4- l) fl , where N is a positive integer; we shall 
first shew that a number N -f h, where h < 1, can always be found such 
that a — (N + h) Q is positive, and less than a — N q . We find by division 
(N + h) 9 - N* - {(N + h) - N] {(N + hy- 1 + (N + h) Q - 2 N + ... -f N Q ~ 1 } ; 
hence, if h is positive and less than unity, (N 4- h) 9 — N Q lies between 
qhN q ~ l and qh (N + l) 0-1 . 

Since a — (N 4- h) 9 = (a — N 9 ) — {(iV' 4- h) q — N «}, 


d — JtyQ 

we must take h not greater than —, in order that a — (N 4- h) Q 
may certainly be positive; and the difference a — (N 4- h) q is then less 
than (a - N") - qhN"~K 


a — N q 


q(N 4- l)^ 1 ’ 


Let h — 

then a — (N 4- h) q < (a — jV*) |l 

Let JV X =* N + h, then N t is such that 


N 

N -h 


,n- 


a — N t q < (a — N q ) 


{'-(An 


and 


> N. 


In a similar manner, we can shew that a number N 2 exists which is > N lt 
and such that 


a - N 2 q < (a — Nj 9 ) j 1 


nn 
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Proceeding in this manner, we obtain a series of numbers N, N x , N 2 , ... 
N r> ... such that N r > N r _ x , and that a — N r q is positive, and less than 


(a - N r _S) 



We shall now shew that (N, N x , N 2 , ... N r , ...) is a convergent sequence 
which defines a number whose qth power is a. 


The sequence {N r } is convergent, since N r > N r _ x , and every N r is less 
than N f 1. The qth power of the number defined by this convergent 
sequence is {N r Q }, and we shall shew that this defines the number a or {a }. 


We have 

a — N r q < (a 

1 - 




/ N y r 

U l) 


< (a - N") 


/ jv y-»1' 
If-i iy 


N N r _ 

N + l < iV r _ J + 1 


and hence 1 - yj" > 1 - f '■ 

Now 1 — I n | J is a proper fraction, hence from the lemma proved 

above we infer that a power r of the expression can be found which is less 
than an arbitrarily chosen positive number; which number we may take 

to be € • Hence, corresponding to every e, a number r can be found 

such that a — N q T + 9 < e, for s = 0, 1, 2, ..., and therefore the sequence 
{N r q } defines the number {a} or a. 


If a is a positive proper fraction, we have (a 2 ) q < a ; hence we may take 
N to be equal to a 2 , instead of to a positive integer. Then a < (N -f l) 7 ; 
thus this value of N will play the same part as the integral value in the 
above proof, and the reasoning is the same as before. 


37. It has now been shewn that in every case a real number can be 
found of which the qth power is a given positive number a. It thus appears 
v 

that & has an interpretation within the domain of real numbers, when x 

is any positive number, and ^ is a positive rational fraction. 

j _p 

We interpret x g to be such that 

_p p 

x 9 x ^ = = 1, 

p p 

X g = 1/x 9 . 
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V 

If x is a negative number — x', we have (— x') q , defined as a number 
whose </lh power is (- - x ) v ; and ( - x') p is x fp or — x' v , according as p is 
even or od 1. 

v v 

If p is even, (— x) q can be interpreted as the value of x' q . If p is odd, 
v v 

and q is odd, (— x') q may be interpreted as — x ,q . When p is odd, and 


v 


q is even, we have obtained no interpretation of (- x') q . 

2 r I l 

To complete the theory of indices in such a way that (— x') may 
have an interpretation, we should require a further extension of the con¬ 
ception of number. This further extension takes place by the introduction 
of complex number, which is however outside the limits imposed upon this 
work as a treatise dealing only with real number. 


38. The only case in which x ”, for a positive x , has not been defined, 
is that in which n is not a rational number. To extend the definition to 
this case, we suppose n to be defined by a convergent sequence { n T \, in 
which all the numbers n, are rational. We shall shew that the aggregate 
{x nr } is convergent, and the number which it defines we shall denote by x n . 

We have x nr — x tlri * — x Ur {1 — x nr ' s rir }\ now, since {n r } is a convergent 
aggregate, all the numbers n r are numerically less than some fixed number, 
and therefore | x Ur | < A, where A is some fixed number. 

First suppose x > 1, then 

X Hr — X nr + ' ='X nrr • (x nr ~ nr ^ — 1) = X nr (l — X nr *-•-**), 

hence | x nr — x nr * | < A | x l nr ~ Wrf *l — 1 |. 

Now let r be so chosen that, for all values of s, 


I n r -n r+s \ A, 

where q is a positive integer; then 


i 

X\"r~ n r .\ - 1 < J( fl _ 1 < 


1 


X — 1 

1 2 y l ’ 

+ 3fi + & + ... + X q ~ 


hence 


or 


r ln r -n r ,i| 


1 I < 


X l 


• x 1Xr '» I < A 


and if q is chosen so that - < € 


q 

x — 1 

q 9 

, where e is a fixed number, we see 


q A (x — 1) 

that r may be so chosen that | x nr — x tlr | < e, for all values of s ; there¬ 
fore {x**} is a convergent sequence. 
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If x < 1, then jj^j is a convergent sequence, and therefore {x nr } is 

also convergent, since it is the quotient of {1} and If x = 1, then 

{# Wr } 1. Thus in every case { x nr } is a convergent sequence if {w r } is con¬ 

vergent. 

It is easily seen that, if {n/} is any other convergent sequence which 
also converges to n, the two sequences {x nr }, {x nv '} both define the same 
number. Thus the value of x n is independent of the particular sequence 
of rational numbers employed in defining n. 

Since {# r ' r } {x mr ) — {x nr + mr } y we see that the definition of x n , when 
n is not rational, is such that the relation x m x x n — x m + n is satisfied. 


THE REPRESENTATION OF REAL NUMBERS 


39. The ordinary mode of representation of a real number is by means 
of a decimal, or more generally by a radix-fraction. When the decimal is 
non-terminating, this mode of representation is a case of the representation 
by a convergent sequence of rational numbers, in accordance with Cantor’s 
theory. For example, the number tt is represented by the sequence 


(3, 3 1, 3-141, 3-1415, 3-14159, 3-141592, ...), 
where, by known processes, any prescribed element can be found as the 
result of a definite number of arithmetical operations. 

The general theorem will be established that every positive non-rational 
real number N is uniquely representable by means of a non-terminating series 
of radix-fractions , of which r, the radix , is any integer 2. 

Of the numbers 0, r, 2r, 3r, ..., there is (see § 35), of all those which 
are less than rN, a greatest one c 0 r, which may be zero; thus 
rN > c 0 r, and < (c 0 -f 1) r; 

N 

it follows that N — c 0 -f - 1 , 

r 

where N- l is a positive number less than r. 

In a similar manner we obtain 


7VT ^2 

N i = C 1 -h , 
r 


N, 


c 2 -f 3 , ... N n — c n H-~ l , 

r r 


where N 2 , N 3) ... A 7 n . fl are all < r \ therefore 

Ci Co . c n N. 


A T — r _l ^ _i_ 1^4. XT _ n fl 

* ~ °0 + " + r2 + ••• + r „ + r „ +1 > 


where c 0 , c l9 < 
0 < JV- 4.1 < r. 


c n are each of them positive integral, or zero, and 


Since 


*-(c + S + 3 + ... + &)<i, 
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and it has been shewn that - has the limit zero, as n is indefinitely 

r n 

increased, we see that the sequence, of which the nth element is 

Ci Co c « 

c 0 -}- -f- - 2 ~f •.. 4~ “ j 

u y j'li 

is convergent, and represents the real number N. This is expressed by 

A =C 0 -t- r + ri + ... + ^+ •••, 

in which N is represented by a non-terminating radix-fraction. 

Let us now consider the case in which N is a rational number a . , in its 

ft 

lowest term. We have a — a 0 b -f ft, where ft < b; and rft = a x b -f ft, 
where ft < 6; rft - a 2 b f- ft, ..., rft_ x = a n ft + ft, where ft, ft, ... /ft are 
all less than ft. 

If one of the numbers ft say ft, is zero, we have 


O' a* ( 

.V = , « Oo + ~ l + ; 


-h 


and thus 2V 7 is expressed in terminating radix-fractions; this case can only 
arise when b contains only prime factors of r. The terminating series of 
radix-fractions can be replaced by a periodic one which does not terminate. 
For if we use a n — 1 instead of a v , as the numerator of r n , we have 

rpn -1 ^ K - 1) 6 + 6; 
thus ft becomes b instead of zero, and 

rft (r — 1) ft + ft; 

thus ft,ft +1 , ... are ail equal to ft; and a n+2 > ••• are all equal to 
r — 1. Thus N is represented by 


a a, a* a,, — lr—lr — 

= P«0 + “ + r 2 h ••• + ~ r n r «4l r n +2 + ... 


It thus appears that a rational number , which in its lowest terms has a 
denominator which contains only prime factors of r, capable of a double 
representation; (1) by a terminating series of radix-fractions; (2) by a non¬ 
terminating series of radix-fractions, of which the numerators after some fixed 
one are all r — 1. 

In case none of the numbers ft, ft, ... ft, ... vanishes, it is clear that 
since all these numbers are either 1, 2, 3, ... ft — 1, they cannot be all 
unequal. Suppose ft is the first which is repeated, and let ft — ft fm ; it 
is then clear that ft +1 « ft+m+i, ft +2 = ft+m+2> ; and therefore the 

number is represented by a recurring series of radix-fractions. 


40. When a number is defined by means of a convergent sequence of 
some special form, it is in general not immediately obvious whether the 
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number is rational or irrational. Many special investigations relating to 
particular cases, and various general criteria, have been given by well- 
known mathematicians. 

One of the most important modes of such representation of a number 
is that by an endless continued fraction. This fraction may be regarded 
as an aggregate, each element of which is a finite continued fraction. 
Legendre established the fundamental theorem that a number represented 
by an endless continued fraction 

a 2 €L n 

b l -L b 2 ± b s :f b n : 4 - 

that is, by an aggregate of which the nt\\ element is 

a, a 2 a n 

bi _b b 2 rt: ^3 rt b n 

is irrational*, provided the positive integers a n , b n are such that for every 
value of n, b„ — a n ^ 1 ; except that when b n — a„ — 1 , for every value of 
n > m, where m is some fixed number, and when at the same time the 

signs before all the fractions , for n > m, are negative, then the con¬ 
tinued fraction converges to unity, or to a rational fraction, according as 
m 1 , or m > 1 . 

This theorem contains as special cases the theorems previously estab¬ 
lished by Lambert, that tana;, log p x, tan 1 x, tt are irrational for 
rational values of x. The irrationality of e and e 2 was first proved by Euler f. 
Legendre J himself applied the general theorem to prove the irrationality 
of 7 T- 2 , although his proof was lacking in rigour. 

The following general theorem has been proved § by Cantor: 

If 6 , 6 ', b", ... form a set of positive integers such that, q being any 
arbitrarily chosen integer, all the numbers 1 , b, bb\ bb'b", ..., from and 
after some fixed number of the sequence, are divisible by q; then any 
number N can be uniquely represented by 

/ + A + Ji. . V . 

+ b + bb , + bb’b" + •••’ 

where / is an integer, and A, /z, v, ... are integers (including 0 ) such that 

A < b — 1 , n < b' — 1 , v ^ b" — 1 ,_ 

Further, in order that the number N may be rational, it is necessary that, 

* A proof of this theorem is given by Pringsheim, “Ueber die Convergenz unendlieher Ketten- 
briiche,” Sitzungsberichte d. bayer . Akad. vol. xxvn (1897), p. 318. 

t t)n the history of these theorems see Pringsheim’s article “Ueber die ersten Beweise der 
Irration&litat von e und tt,” Sitzungsberichte d. bayer, Akad. vol. xxvn (1897). 

t See his iSUments de Qiom&trie, Note 4; see also Rudio’s work, “Archimedes, Huygens, 
Lambert, Legendre” (1892), p. 166. 

§ Schldmilch s Zeitschrift , vol. xiv (1869), p. 121, “Ueber die einfachen Zahlensystenie.” 
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from and after some fixed term of the series, all the numbers A, /x, v t ... 
have their highest possible values. If this condition is not satisfied, N is 
irrational. 

As an example of this theorem, the number e represented by 


1 


1 


2 H-1- -f 

2 2.3 2.3.4 

is seen to be irrational. 

Another mode of representation is that in which the sequence of 
integers 6, 6', 6", from a particular element onwards, is periodic. In 
this case, the necessary and sufficient condition that the number repre¬ 
sented by p p 

b + bb' + bb'b" + 

should be rational, is that the sequence /S, /T, f3", ... be, from and after 
some fixed number of the sequence, periodic. This is a generalization of 
the theorem relating to a number represented by radix-fractions. 

If b ~ 2, b' = 3, 6" — 4, ..., we obtain the theorem* that the number 
represented by 

C 1 , C '2 , C 3 , , C n , 

2! + 3! + 4! 4 - r n! + -’ 

where c n & n - 1, is rational, only if, from and after some particular value 
of n, c n = n — 1. 

A mode of representation of numbers by sequence of products has been 
given f by Cantor. He shews that every number N > 1, can be uniquely 
represented in the form 


( l+ ;)( 1 + s )( 1+ ;)( l + s )"" 


where a, 6, c, ... are integers such that 

b £ a 2 , c £ 6 2 , d £ c 2 , .... 




Na 


The number a is determined as the integral part of ^—y. If y = B, 
JB Bb 

b is the integral part of g--; if j- y = C, c is the integral part of 

C A 

^ --; and so on. 

O — 1 

As an example, V2 is represented by 

(* + 3) { l + 17) i 1 + 577) (* + 665857) 

where 17 = 2.3* - 1, 577 = 2.17*-!, 665857 = 2.577*-!,.... 


* See Stephanos, Bulletin de la soc. math, de France , vol. vii (1879). For further information 
on the history of this subject see Pringsheim’s article I. A. 3, in the Encyclopadie der math. 
W iasenschaften. 

f Schlomilch’s Zeitschrift , vol. xiv (1869), p. 162, “Zwei Satze.. 



4o, 4 i] Representation of Real Numbers 51 

The criterion for determining whether N is rational or irrational is the 
following: 

The number represented by 

( ,+ a)( l + l'( 1+ c i )- 

where b £ a 2 , c ^ b 2 , ..., 

all the numbers a, b,c, ... being positive integers, is rational if, from and 
after some fixed number of the sequence a, 6, c, ..., each number is the 
square of the preceding number of the sequence; but the number is 
irrational if this condition is not satisfied. 

THE CONTINUUM OF REAL NUMBERS 

41. If a l9 b x are any two real numbers such that a x < b x , then two real 
numbers a 2 , b 2 , (a 2 < 6 2 ), can be found both lying between a lf b 1} and such 
that the difference between a 2 , b 2 is as small as we please, i.e . b 2 — a 2 < e, 
where e is an arbitrarily prescribed number. Between a 2 , b 2i two more 
numbers a 3 , b 3 , (a 3 < & 3 ), can be found whose difference is again as small 
as we please; and this process may be carried on indefinitely. This property 
of the aggregate of real numbers may be expressed, to use the term 
introduced by G. Cantor, by saying that the aggregate of real numbers is 
connex ; it arises from the fact that an indefinite series of numbers can be 
found which lie between any two given numbers. If we anticipate a term 
which will be introduced when we come to the general theory of aggregates, 
the property of connexity may be expressed by saying that the aggregate of 
real numbers is everywtiere dense. 

It will further be observed that the aggregate of rational numbers is 
also connex; so that, as far as this property is concerned, there is nothing 
to differentiate the one aggregate from the other. 

If the difference of a n and b n is denoted by e„, and the sequence 
€ 1 , e 2 > • • • *« > • • • satisfies the condition that, corresponding to any fixed 
arbitrarily small positive number rj , a value of n can be found such that 
€ n , c w+1 , ... are all less than rj, then there exists a single real number x 
which is greater than all the numbers a t , a 2 , ..., and less than all the 

numbers b lt b 2i _ This number x is the limit of either of the sequences 

(Oj, Oj, ... a n , ...) and (b l9 b 2 , ... 6 n , ...), and is defined by a section of all 
the real numbers. 

If we confine ourselves to the domain of rational numbers, there 
subsists in that domain no such property; that is, the above numbers a, b 
being all rational, no such rational number as x necessarily exists. 

In the domain of Real Number, (a), every convergent sequence has a 
limit which is a number belonging to the domain, and (6), every number is 
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the limit of properly chosen sequences of numbers belonging to the 
domain. The possession by the domain of real numbers of these properties 
(a) and (6) is expressed by saying that the aggregate of real numbers is 
•perfect. 

The domain of Rational Number possesses the property ( b) but not the 
property (a); consequently the aggregate of rational numbers is not perfect. 

From the point of view of Dedekiml’s theory, the property that the 
aggregate of real numbers is perfect expresses the fact that every section 
of the real numbers corresponds to a single real number, and the converse. 
A section of the rational numbers does not always correspond to a rational 
number; consequently the aggregate of rational numbers is not perfect. 

We here give the name continuum * to an aggregate which possesses the 
two properties of being connex, and of being perfect. This is in the first 
instance taken to be the definition of the meaning of the word continuum, 
as it is frequently used in Analysis. Thus the aggregate of real numbers 
forms a continuum; whereas the aggregate of rational numbers is essentially 
discrete, and does not form a continuum, since one of the two essential 
properties of a continuum is absent. 

The aggregate of real numbers is spoken of as the continuum of real 
numbers , or the arithmetic continuum. 

The real numbers which lie between two numbers a, b do not form a 
continuum in accordance with the above definition ; but if the two numbers 
a, b themselves are considered to be included in the total aggregate, then 
this completed aggregate does form a continuum. 

It should be remarked that, in accordance with a somewhat different 
definition of the term continuum, employed by Weierstrass, and which 
will be referred to in Chapter n, the real numbers between a and b form a 
continuum. 

All the real numbers x such that a ^ x % 6, in the ordinal sense of the 
symbols <, =, >, are said, to form an interval (a, b); and such an interval is 
frequently described as a closed interval. 

The real numbers x which are such that a < x < b, are frequently said 
to form an open interval (a, 6). 

The closed interval (a, b) is a continuum, since it satisfies the two 
necessary conditions for the applicability of the term; but the open 
interval (a, b) is not a continuum in this sense of the term, as it contains 
convergent sequences which have no limit belonging to the open interval. 
Such an open interval has been termed by Cantor a semi-continuum , but, 
in accordance with the observation made above, it may be termed a 
Weierstrassian continuum. 


* See Cantor, Math. Annalen , vol. xxi (1883), p. 576. 
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Of the two essential properties of the arithmetic continuum, that of 
connexity , and that denoted by the term perfect , the latter is absolutely 
indispensable, in order that the arithmetic continuum may be suitable to 
be the field of operations in analysis. It will appear, when we come to the 
consideration of the theory of functions of a real variable, that many of 
the most important properties of a function may still subsist even if the 
domain of the variable lacks the property of connexity; but that such 
properties would not belong to functions of a variable which is defined 
for a domain such that convergent sequences of numbers in it possess no 
limit within that domain, and which therefore lacks the property of being 
perfect. This is the more remarkable on account of the fact that, in the 
older traditional notion of a continuum, the property of connexity was 
the one which was regarded as all important; the more essential property 
of being perfect has only been explicitly formulated in the course of the 
construction of the modern arithmetical theory. 

42. The term arithmetic continuum is used to denote the aggregate of 
real numbers, because it is held that the system of numbers of this aggre¬ 
gate is adequate for the complete analytical representation of what is 
known as continuous magnitude. The theory of the arithmetic continuum 
has been criticized on the ground that it is an attempt to find the continu¬ 
ous within the domain of number, whereas number is essentially discrete. 
♦Such an objection presupposes the existence of some independent con¬ 
ception of the continuum, with which that of the aggregate of real numbers 
(am be compared. At the time when the theory of the arithmetic continuum 
was developed, the only conception of the continuum which was extant 
was that of the continuum as given by intuition; but this, as we shall 
shew, is too vague a conception to be fitted for an object of exact; mathe¬ 
matical thought, until its character as a pure intuitional datum has been 
clarified by exact definitions and axioms. The discussions connected with 
arithmetization have led to the construction of abstract theories* of 
measurable quantity; and these all involve the use of some system of 
arithmetic, as providing the necessary language for the description of the 
relations of magnitudes and quantities. It would thus appear to be highly 
probable that, whatever abstract conception of the intuitional continuum 
of quantity and magnitude may be developed, a parallel conception of 
the arithmetic continuum, though not necessarily identical with the one 
which we have discussed, will be required. To any such scheme of numbers, 
the same objection might be raised as has been referred to above; but if 
the objection were a valid one, the complete representation of continuous 
magnitudes by numbers would, under any theory of such magnitudes, be 

* See 0. Holder, Die Axiome der Quantitat und die Lehre vom Mass, Leipziger Beriohte, 
vol. liii (1901); also Veronese’s work, Fondamenli di Geomctria , Padua (1891); and Bettazzi’s 
work, Teoria delle grandezze (1890). 



54 


Number 


[ch. i 

impossible. It is clear that it is only in connection with an exact abstract 
theory of magnitude that any question as to the adequacy of the con¬ 
tinuum of real numbers for the measurement of magnitudes can arise. 
For actual measurement of physical, or of spatial, or temporal magnitudes, 
the rational numbers are sufficient; such measurement being essentially 
of an approximate character only, the degree of error depending upon the 
accuracy of the instruments employed. 

The purely ordinal nature of the conception of the arithmetic con¬ 
tinuum, including the ordinal character of an interval, has been pointed 
out in the course of the development of the theory. This will be further 
elucidated in connection with the abstract theory of order-types, to be 
discussed in Chapter iv. 

THE CONTINUUM GIVEN BY INTUITION 

43. Before the development of analysis was made to rest upon a 
purely arithmetical basis, it was usually considered that the field of 
operations was the continuum given by our intuition of extensive mag¬ 
nitude, especially of spatial or temporal magnitude, and of the motion of 
bodies through space. 

The intuitive idea of continuous motion implies that, in order that a 
body may pass from one position A to another position B , it must pass 
through every intermediate position in its path. An attempt to answer 
the question, what is meant by every intermediate position, reveals the 
essential difficulties of this conception, and gives rise to a demand for an 
exact theoretical treatment of continuous magnitude. 

The implication continued in the idea of continuous motion shews that, 
between A and B, other positions A\ B' exist, which the body must 
occupy at definite times; that between A' y B ', other such positions exist, 
and so on. The intuitive notion of the continuum, and that of continuous 
motion, negate the idea that such a process of subdivision can be conceived 
of as having a definite termination. The view is prevalent that the intuitional 
notions of continuity and of continuous motion are fundamental and sui 
generis ; and that they are incapable of being exhaustively described by a 
scheme of specification of positions. Nevertheless, the aspect of the con¬ 
tinuum as a field of possible positions is the one which is accessible to 
Arithmetic Analysis, and with which alone Mathematical Analysis is 
directly concerned. That property of the intuitional continuum, which 
may be described as unlimited divisibility, is the only one that is im¬ 
mediately available for use in Mathematical thought; and this property 
is not sufficient for the purposes in view, until it has been supplemented 
by a system of axioms and definitions which shall suffice to provide a 
complete and exact description of the possible positions of points and 
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other geometrical objects which can be determined in space. Such a 
scheme constitutes an abstract theory of spatial magnitude. 

The exact theory of magnitude was developed to a considerable extent 
by Euclid; but not until recently, under the influence of the ideas of the 
arithmetical theory, has it been perfected in a form which exhibits the 
exact system of axioms and definitions necessary for a characterization 
of continuity that is adequate for mathematical analysis. Besides the 
arithmetic theory of number, there exists at the present time a theory of 
magnitude which runs to a certain extent parallel with the former theory. 
Some mathematicians* still prefer to regard number as primarily repre¬ 
senting the ratio of two magnitudes; but they nevertheless to a large 
extent employ the methods of arithmetical analysis. 

THE STRAIGHT LINE AS A CONTINUUM 

44. Although it is no part of the plan of the present work to enter 
fully into the general theory of Magnitude, it is necessary briefly to con¬ 
sider the case of those magnitudes which are segments of a straight line; 
that straight line which is the ideal object of geometry, and which is the 
ideal counterpart of the physical straight line of perception. 

The length of the segment between two points A, B, of a straight line 
is a particular case of a magnitude; and we shall take this conception as a 
datum, subject to a set of axioms! relating to the notions of congruency, 
and to the notions greater and less as applied to magnitudes. 

We assume that any number of congruent segments OA, AB, BC, ... 
can be constructed on the straight line; and that any segment OA can be 
divided into any number of segments which are all congruent to one 
another. 

Any segment OA may be taken as the unit of length, so that its 
magnitude is represented by the number 1; its multiples OB, OC, ... are 
denoted by the numbers 2, 3, .... If each one of the segments OA, AB , 
BC, ... be divided into the same number q of equal parts, then, if P is a 
point of division, OP is denoted by a fractional number pjq, where p is 
the number of the sub-segments in OP. Thus when p, q are any positive 
integral numbers, pjq represents a definite magnitude OP, the unit 
magnitude OA having been fixed upon beforehand. 

Further, the number pjq may also be regarded as representing the 
position of the point P itself. In order to represent points of the straight 
line on both sides of 0 , the convention is made that points on one side 

* P. Du Bois-Reymond in his Allgemeine Functionentheorie , Tubingen (1882), strongly ad¬ 
vocates the view that linear magnitude forms the basis of the conception of Number. See also 
Stolz, Vorle&ungen iiber allgemeine Arithmeiik , Leipzig (1885-86), where both views of Number are 
developed. See also G. Ascoli, Rend. 1st. Lomb. (2), xxvm (1895). 

t These axioms are discussed by 0. Holder, Leipziger Berichte , vol. liii (1901). 
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of 0 shall be represented by positive numbers, and those on the other 
side by negative numbers; thus if P is on the right of 0, and P' on the 
left of 0, and if OP = OP' y the point P' is represented by the number 
— p/q. The length of any segment of the straight line, whose ends are 
points to which rational numbers have been assigned in the manner 
explained above, is the difference of the above two numbers. In this 
manner, we have a correspondence established between the aggregate of 
rational numbers and an aggregate of points on the straight line, the 
relation of order being conserved in the correspondence, so that the two 
aggregates are similar. 

The set of points, thus represented by rational numbers, we may speak 
of as the rational points of the straight line ; but it must be remembered 
that a definite origin 0 , and a definite unit of length OA , are supposed to 
have been fixed upon beforehand; and if these be altered, the set of rational 
points will in general be altered also. 

It has been assumed as an axiom that, if P X Q X is any segment of the 
straight line, it may be divided into any number n x of equal parts: of these, 
if P 2 Q 2 be taken as one, the same axiom asserts that P 2 Q 2 may be similarly 
divided into any number, n 2 , of equal parts, P 3 Q 3 being one of the parts; 
and that this process may be repeated an unlimited number of times. The 
axiom is equivalent to an assumption that the straight line is capable of 
unlimited divisibility; and this, being a characteristic property of the 
intuitional linear continuum, must also hold for its ideal counterpart, the 
straight line which we are here considering. 

We proceed to assume as another axiom that, P X Q X , P 2 Qz> P 3 Q 3 , ••• 
being the segments constructed as above, there exists in the straight line 
one point X, and one only, which separates all the points P x , P 2 , P 3 , ... 

from all the points Q Jf Q 2 , Q 3 , _ If 7 be any point other than X , then 

points belonging to the sequence P 1 ,P 2 ,P 3 , ... and points belonging to 
the sequence Q x , Q 2 , Q 3 , ... can be found which are both on the same side 
of 7. 

The point X may be regarded as the limit of either sequence of points; 
and the property corresponds to that property of the arithmetic con¬ 
tinuum which is expressed by saying that it is perfect. 

In accordance with this axiom there is one single point on the straight 
line which corresponds to any given real number; and this point, or the 
magnitude of the corresponding segment, may be represented by the real 
number. 

This axiom has been stated by Dedekind, in a form corresponding to 
his definition of an irrational number: that a section of the rational points, 
in which they are divided into two classes, is made by a single point. 
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Another form of the axiom is that known as the Axiom of Archimedes *: 
that if AB , A'B' are any two segments of the straight line, of which AB is 
the smaller one, an integer n can always be found such that n. AB > A'B'. 
As in the case of the arithmetic continuum, this is equivalent to the 
negation of the existence of infinitesimal segments of the straight line. 

This axiom being assumed, there is a complete correspondence between 
the points of the straight line and the aggregate of real numbers. Thus the 
nature of the linear continuum, that is, so far as its possible parts, and the 
possible positions in it, are concerned, is completely represented and 
described by means of the arithmetic continuum, the axioms relating to 
the straight line having been so chosen that this may be the case. It will 
be observed that there is no real disparity between the rational points and 
the irrational points of the straight line; a point, which with one origin 
and one unit of length, is a rational point, may be an irrational point if 
another origin, or another unit of length, be chosen. 

45. The mode which has been adopted above, of establishing a com¬ 
plete correspondence between the aggregate of real numbers and the 
aggregate of points in a straight line, though the most convenient mode, 
is not the oidy possible one. All that is really necessary for the corre¬ 
spondence is that, in accordance with some systematic scheme, the points 
in the straight line shall be made to correspond with the numbers of the 
arithmetic continuum, in such a way that the relation of order is conserved 
in the correspondence. 11 is not necessary that the difference of two numbers 
should represent the length of the segment of the straight line which is 
terminated by the points that correspond to the two numbers. The mode 
of correspondence given above is however the simplest one, and will there¬ 
fore be adopted for the purpose of enabling us to use the language of 
geometry in analytical discussion. 

In the case of space of two or of three dimensions, it will be assumed as 
axiomatic that one point of the space, and one only, corresponds to each 
pair or triplet of real numbers which represent Cartesian coordinates. This 
axiom may be considered as fundamental in the Cartesian system of 
analytical geometry. 

The disputable idea that the theory here explained necessarily implies 
that a continuum is to be regarded as made up of points, which are 
elements not possessing magnitude, has frequently been a stumbling-block 
in the way of the acceptance of the view of the spatial continuum which 
has been indicated above. It has been held that, if space is to be regarded 
as made up of elements, these elements must themselves possess spatial 
character; and this view has given rise to various theories of infinitesimals 

* The importance of the Axiom of Archimedes in this connection was pointed out and discussed 
by Stob% Math. Annalen, vols. xxn (1883), p. 504 and xxxix (1891), p. 107. 
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or of indivisibles, as components of spatial magnitude. The most modern 
and complete theory of this kind has been developed by Veronese*, and 
is based upon a denial of the principle of Archimedes which has been already 
referred to. In Veronese’s system, when a unit segment of a straight line 
has been chosen, there exist segments which are too large, and others that 
are too small, to be capable of representation by finite numbers; and these 
segments are respectively infinite, and infinitesimal, relatively to the unit 
segment chosen. Under this scheme, a section of the rational points, or 
a section of the points represented by real numbers, is made, not by a 
single point, but by an infinitesimal segment. Veronese has consequently 
introduced systems of infinite and of infinitesimal numbers, each of an 
unlimited number of orders, for the measurement of segments which, 
relatively to a given scale, are infinite or infinitesimal. From his point of 
view, the points on a straight line which represent the real numbers form 
only a relative continuum, i.e. one which is relative to the particular scale 
of measurement employed; and he contemplates the conception of an 
absolute continuum, for the representation of which his series of sets of 
infinite and infinitesimal numbers are requisite. A segment, which in a 
given scale is finite, may be infinitesimal, or infinite of any order, when 
measured relatively to another scale. 

The validity of Veronese’s system has been criticized by Cantor and 
others on the ground that the definitions contained in it, relating to 
equality and inequality, lead to contradiction; it is however unnecessary 
for our purpose to enter into the controversy on this point. The straight 
line of geometry is an ideal object of which any properties whatever may 
be postulated, provided that they satisfy the conditions, (1), that they form 
a valid scheme, i.e . one which does not lead to contradiction, and (2), that 
the object defined is such that it is not in contradiction with empirical 
straightness and linearity. There is no a priori objection to the existence 
of two or more such adequate conceptual systems, each self-consistent, 
even if they be incompatible with one another; but of such rival schemes 
the simplest will naturally be chosen for actual use. Assuming then the 
possibility of setting up a valid non-Archimedean system for the straight 
line, still the simpler system, in which the principle of Archimedes is 
assumed, is to be preferred, because it gives a simpler conception of the 
nature of the straight line, and is adequate for the purposes for which it 
was devised. The case of the non-Euclidean systems of geometry is an 
instance of the existence of valid geometrical schemes divergent from one 
another, which nevertheless all afford a sufficient representation of physical 
space-percepts. 

* See his Fondamenti di Geometria, Padua (1891); a German translation by Schepp has been 
published in Leipzig (1894). 
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An answer to the difficult question, in what sense the straight line, or 
a space of two or of three dimensions, admits of being regarded as an 
aggregate of points, can only be discussed after a full treatment of the 
nature and properties of infinite aggregates has been developed. The 
discussions in Chapters n, m and iv, of infinite aggregates, and especially 
of the notion of the poiver or cardinal number of such an aggregate, will 
throw light upon this subject. 



CHAPTER II 


DESCRIPTIVE PROPERTIES OF SETS OF POINTS 

46. An aggregate of real numbers, each element of which consists of 
a single real number, is defined by any prescribed set of rules or specifica¬ 
tions which are of such a nature that, when any real number whatever is 
arbitrarily assigned, they theoretically suffice to determine whether such 
real number does or does not belong to the aggregate. The difficulty of 
regarding an aggregate, so defined, as a definite object, is bound up with 
the difficulties connected with the notion of the linear continuum, i.e. the 
aggregate of all real numbers, out of which the defined aggregate is to be 
obtained by a process of selection which, except in the case of a finite 
aggregate, can never be actually carried out in its entirety, but which is 
determined by a rule or set of rules. The precise scope of the definition 
will be rendered clearer by the consideration of various classes of actually 
defined aggregates which will be considered in the present Chapter; more¬ 
over, the theoretical difficulties of the notion of such an aggregate, in 
general, will be in some measure elucidated by the discussions in the 
present and the following Chapters, of the notion of the power, or cardinal 
number, of an aggregate. 

In accordance with the principle explained in § 44, each number of a 
given aggregate may be represented by a single point on a fixed straight 
line; thus to an aggregate of numbers there corresponds an aggregate of 
points on the straight line. An aggregate of single numbers, or of their 
equivalent points, we shall speak of as a linear set of points. 

The theory of linear sets of points, of which the present ('hapter 
contains an account, arose historically from the discussion of questions con¬ 
nected with the theory of Fourier’s series and of the functions which can 
be represented by such series. A consideration of the properties and 
peculiarities of the sets of points at which infinities or other discontinuities 
of such functions exist, led to a study of the properties of linear sets in 
general, and to the development by G. Cantor, P. Du Bois-Reymond, 
Bendixson, Harnack, and others, of a general theory which has lately 
received wide applications both in Analysis and in Geometry. 

Corresponding to the theory of linear sets of points, there exist theories 
of plane, solid, or ^-dimensional sets of points. Each element of an 
aggregate in ^-dimensions consists of an association of p real numbers 
(x a \ x (2) , ... x (v) ), and such an element is spoken of as a point in jp-dimen- 
sional space. The p-dimensional continuum consists of the aggregate 
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of all such points, when each of the numbers x (l \ x (2) , ... x (p) may have 
any value in the linear arithmetic continuum. 

A considerable part of the theory of sets of points in two or more 
dimensions is completely parallel to the theory of linear sets of points; the 
proofs of many theorems for linear sets of points being applicable without 
essential modification to the case of sets in any number of dimensions. 
This remark applies both to the descriptive properties of sets considered 
in the present Chapter and to the metric properties which will be discussed 
in Chapter m. At a certain stage of the descriptive theory, however, the 
properties of linear sets of points exhibit a simplicity which the properties 
of sets in two or more dimensions do not possess. The greater complexity 
in the case of two or more dimensions, than in one dimension, corresponds 
to the greater complexity of the Analysis Situs for space of two or more 
dimensions than in the case of linear space. 

In the present Chapter an account will be given both of those descrip¬ 
tive properties which are practically independent of the number of dimen¬ 
sions of the sets, and of those properties in which essential divergence 
exists between the case of linear sets and that of sets in two or more 
dimensions. 

The whole theory of sets of points is essentially an arithmetical theory; 
the geometrical nomenclature and representation is a matter of convenience, 
not of necessity. 


THE UTTER AND LOWER BOUNDARIES OF A LINEAR SET OF T0INTS 

47 . A simple case of a linear set of points is that in which the set 
consists of all the points of a linear interval (a, b) either closed or open, in 
accordance with the definition of such an interval given in §41. 

Thus the set of points which form a closed interval (a, b) consists of 
all points x such that a ^ x ^ b; and the set of points of an open interval 
(a, 6) consists of all points x such that a < x < b. Either of the sets of 
points x for which a ^ x <b, or a < x £ b, may be said to be the points of 
a semi-closed interval (a, 6), open at 6 or at a. 

A point x of the set forming a closed interval (a, b) is said to be in the 
interval or segment (a, b). A point x of the set forming an open interval 
(a, b) is said to be within, or interior to, the interval or segment (a, 6), or 
is said to be in the open interval (a, 6). 

Let a set of points be such that every point of the set lies upon a 
straight line, the position of each point being determined by its distance 
from a fixed origin upon the straight line, in the manner explained in § 44. 
If a point exists, such that no number of the set is greater than /?, the 
set is said to be bounded on the right. In this case it will be shewn that 
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there is a definite point b , such that no point of the set is on the right of 
by and such that either (1), b is itself a point of the set, or else (2), that 
points of the set are within the interval (b - e, b), however small the 
positive number € may be taken to be; or that both the conditions (1) and 
(2) are satisfied. 

The point b may or may not itself be a point of the given set. In either 
case it is said to be the upper boundary of the given set. If b is itself a 
point of the given set, it is said to be the upper extreme point of the set. 

When there are points of the given set interior to the interval (b — e, b) 
for every value of e (< b), the point b is said to be the upper limit of the set. 

In case 6 is both the upper limit, and the upper extreme point, of the 
set, the upper limit is said to be attained; and b may then be called the 
maximum point of the set. 

To prove the existence*, under the condition stated, of an upper 
boundary, as above defined, it may be observed that all the numbers of 
the continuum of real numbers can be divided into two classes, one of 
which contains every number which is greater than all the numbers of 
the set, and the other of which contains every number which either belongs 
to the set or is less than some or all of the numbers of the set. The section 
thus specified defines a number b which is the upper boundary of the set. 

In a similar manner, it may be shewn that, if the set is bounded on 
the left, i.e. if a point can be found such that all the points of the set are 
on the right of such point, then a point a exists, which is such that no 
points of the set are on the left of a, and such that either a is a point of 
the set, or else points of the set are within every interval (a, a f c), where 
c is an arbitrary positive number, or else that both conditions are satisfied 
simultaneously. 

In case points of the set lie within every interval (a, a -f e), then a is 
called the louder limit of the set; and the lower limit is said to be attained 
if a be itself a point of the set. In any case in which a is a point of the 
set, it is then said to be the lower extreme point of the set. The term lower 
boundary may in all cases be applied to a. 

A set of points which has both an upper and a lower boundary is said 
to be a bounded set. Thus a set is bounded if every point # in it is such 
that \x\<A, where A is some fixed positive number. 

48 . If no point /? exists, which is such that no point of the set is on 
the right of then the set is said to be unbounded on the right; or it is 
said that the upper limit of the set is + oo; the two statements being 
regarded as tautological. Similarly, if no lower boundary a exists, the set 

♦ The existence of upper and lower boundaries was proved by Weie. 3trass, in his lectures. 
See also Bolzano, Abh. d. Bohmischen OeseUsch . d. Wiss., vol. v, Prag (1817). 
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is said to be unbounded on the left; or it is said that the lower limit is 
— oo. 

The symbols + oo, — oo do not represent numbers of the arithmetic 
continuum ; they must be taken to represent what is sometimes spoken 
of as the improperly infinite, i.e . the mere absence of an upper or a lower 
boundary respectively. In order, however, to avoid circumlocution in the 
statement of theorems concerning sets, it is usually convenient to speak 
of 4 oo, — oo, used in the above sense, as if they were numbers, sometimes 
called improper numbers, which correspond to upper and lower limits 
respectively. 

The statement that + oo is the upper limit of a given set is thus taken 
to be equivalent to the statement that, if A is an arbitrarily chosen 
positive number, there exist points x of the set such that x > A . Similarly 
if — oo is the lower limit of a set, there are points x of the set such that 
x < — A. 

In the present Chapter, it will frequently be assumed that the sets 
treated of are bounded; and the interval (a, b) will be said to be the interval 
in which the set exists. This restriction is not so great a one as might at 
first sight appear; for an unbounded set can be placed into correspondence 
with a bounded one, in such a manner that the relative order of any two 
points in the one set is the same as that of the corresponding points in the 

other set. If x' — where the radical is taken to have always the 

positive sign, then to a point x , in the unlimited interval ( — oo, 4- oo), 
there corresponds a point x\ in the open interval (— 1, 4* 1); and also 
x \ 1 , according as x x = x 2 . In order to set up a complete correspondence 
between the closed interval (— 1, 1) and the points of an unbounded seg¬ 
ment, we must adjoin to the latter the (improper) points 4- oo, - oo, 
which we take to correspond respectively to the end-points 1, — 1, of the 
closed interval. 

The same object might have been attained by using the transformation 

2 

x — -- tan -1 x . 

7T 


There is no real loss of generality in considering only such sets as lie 

X — OL 

in a given interval, say (0, 1); for the relation x' = ^ ^ establishes a 

complete correspondence between sets in the interval (a, )3) and sets in 
fhe interval (0, 1), the relative order of points being preserved in the 
correspondence. 

The points of the interval (a, p) may be made to correspond in order 
with the points of the interval (0, 1), in such a manner that an arbitrarily 
chosen point y within (a, p), corresponds to an arbitrarily chosen point 
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within (0, 1); for example the point \ . This correspondence can be effected 
by the transformation x * x — a y — p 

x' — 1 x — ft ‘ y — a * 

NON-LINEAR SETS OF POINTS 

49. A plane, or two-dimensional, set of points is an aggregate of which 
each element is constituted by a pair of real numbers (.r (3) , x l2) ). Such an 
element may be spoken of as a point P in a plane; the rectangular co¬ 
ordinates of P being x n) ,x i2) . Corresponding to intervals, in the case of 
linear sets of points, we now consider rectangles, or cells, of which the 
sides are parallel to the coordinate axes. 

A closed cell (a (1) ,a <2) ; 6 m , 6 (2) ) is regarded as the set of all points 
(a; (1) , x {2) ) such that a (1) 5 :r (1) < b (1 \ a i2) x {2) £ b {2) . 

An open cell (a (1) , a {2) ; b a) ,b (2) ) is regarded as the set of all points 
( x il) , x (2) ) such that a (1) < x (I) < b (1) , a <2) < x i2) < b (2) . 

The set of all points (or (1) , x {2) ) such that a {1) £ x a) < b a \ a {2) - x {2) < b {2) > 
may be spoken of as the semi-closed cell (a m , a i2) ; b {1 \ b (2) ). 

A plane set of points ( x {1) , x (2) ) is said to be bounded, in case t wo 
positive numbers A n) } A {2) exist, such that | x (1) | ^ A (1) , | x {2) | A (2 \ for 

every point (# (1) , x {2) ) of the set. 

When all the points of a given set are in a closed cell (a (1) , a {2) ; b ll) , b (2) ), 
this cell may be spoken of as a fundamental cell for the given set of points. 

More generally, a p -dimensional set of points consists of elements each 
of which is constituted by an association (x {1 \x {2) ,... x {p) ) oip real numbers. 

The p-dimensional closed cell (a (1) , a {2) , ... a {p) ; b {1) , b {2) } ... b {p) ) is the 
set of all points (a; (1) , x {2} , ... x (p) ) such that 

a (l) < xa [2) s *< 2 > s . a (p) si 

The p-dimensional open cell consists of all points such that 
a U) < X (1) < 1) J a <2) < X (2) < ^(2),.a<*» < z< 3>) < b {p) \ 

and the semi-closed cell consists of all points such that 

a {1) ss x (1) < 6 (1) , a (2) * x (2) < 6 (2> ,.a<*» & x< p > < b ip K 

All points of the closed cell such that x {Q) = a {Q) , or x iQ) — b (q) for some 
value of q (= 1, 2, 3, ... p) are said to be on the boundary of the cell. 

Jf positive numbers A {1) t A m , ... A {p) exist, such that for all points 
(x {1) , x (2) , ... x (p) ) belonging to a given set of points the conditions 
| xw | s; A n) , | *( 2 > | s A< 2 >, ... | | ss A<*> 

are satisfied, then the p-dimensional set is said to be bounded, and any 
closed cell in which all the points of the set are contained may be spoken 
of as a fundamental cell for the given set. If no such fundamental cell 
exists, for a given set of points, that set is said to be unbounded. 
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An unbounded set of points (# (1) , # (2) , ... x (p) ) may be correlated with 
a bounded set (£ (1) , £ (2) , ... £ (p) ) for which a fundamental cell is 
(- 1, 1,... - 1; 1, 1,... 1) 
by means of the relations 

£<«> -- 4 (r‘«>) 2 }*, (q ~ 1, 2, 3, ... />); 

or by means of the relations 

- 2 tan 1 #<*>, {(j - 1,2, 3, .../?). 

7T 

Any cell will be correlated with another cell by means of either of 
these relations. It is clear that these relations an* only examples of an 
indefinite number of other relations by which a similar correlation may be 
obtained. 

In order that every point on the boundary of tin* cell ( 1, 1, ...; 

1, 1, ...) may correspond to a point in the .r-space, it is convenient to adjoin 
a set of points at infinity to that space. For example, in the cast 1 p — 2, 
points (x {l) , oo) for all values of # U) , points (x {1) , - go) and points (oo, # <2) )> 
( -- co, # (2) ) will be taken to correspond to the points (£ (1) , 1), — 1), 

(1, £ (2> ), (— 1, £ (2) ) respectively. Also the four points ( oo, -L oo) will be 
adjoined so as to correspond to the four points ( 1, i- 1) in the finite 1 
rectangle. In this manner it can be assumed as a matter of convenience 
that the boundary of the rectangle ( 1, 1; J, 1) corresponds to a bound¬ 

ary adjoined to the .r-space. The advantage of this procedure is that no 
exception need be made as regards the correspondence of a f-set with an 
a*-set, when the £-set has points on the boundary of the cell in which it is 
contained. 

In stating properties of sets of points which are independent of the 
number of dimensions it is frequently convenient to employ a notation 
in which the number p of dimensions does not appear. To effect this the 
point (# (J) , x {2 \ ... x {,p) ) of a p-dimensional set may be denoted by a single 
letter#, as in the case of a linear set. The cell (a (1) , a {2) , ... a {p) \ b (l \ b {Z \ ... b ip) ) 
may then be denoted by (a, b). Thus (a, b) denotes an interval when p — 1, 
and a cell when p > 1. 

LIMITING POINT OF A CONVERGENT SEQUENCE OF INTERVALS, OR CELLS 

50. Let (&i,&i), (a 2 , b 2 ), ... (a„, b ft )> ... be an unending sequence of 
closed intervals which are such that any one of them (a n , b n ) lies entirely 
in the preceding one (< a n b n _j), the two having at most one end-point 
common; thus a n £ a n _ 1? b n ,b n ^\ moreover, suppose that the lengths 
b x — «i, b 2 — a 2 , ... b n - a„, ... form a sequence which converges to zero, 
the condition for which is that, corresponding to any arbitrarily small e, 
n can be so chosen that b m — a m , for all values of m w hich are n, is < e. 
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It will be seen that, in accordance with the axioms explained in § 44, 
there exists one point and one only which is in every interval of the sequence . 
This point may be called the limiting point of the convergent sequence of 
closed intervals. 


Each of the aggregates (a ly a 2 , a 3 , ... a n 


(bi , b 2 


being 


convergent, defines a number; and in fact, in virtue of the definition of 
equality in § 26, they define the same number x. This number x is not less 
than a n and not greater than b n , whatever n may be; the point x therefore 
lies in all the intervals, and is the limiting point whose existence was to 
be remarked. If y be any number greater (or less) than x , we can find n so 
great that b n - x < y — x, if y > x; or that x — a n < x — y, if x > y : thus 
y does not lie in ( a n , b„). Hence there is only one point which satisfies the 
prescribed conditions. 

If, for every n , from and after some fixed value, the inequalities 
a n > a n _ ly b n < 6 n _ 1 both hold, then the limiting point x is in the interior 
of all the intervals of the sequence. If, from and after some fixed value 
of n y say n 1} we have a n — a n _ x , b n < b n A , the limiting point x coincides 
with the common end-points a ni _ x , a ni , a niAl > _ 

If the intervals (a,, />,), (a 2 , b 2 ) y ... (a n ,b n ), ... are all open intervals, 
there is not necessarily any point which is in all the open intervals. Con¬ 
sider, for example, the set of open intervals 

(»,'), K). (».*)■ -K). 

In this case the point x defined by either of the sequences 

(o,o,». ..o, (i.'.j.... 

is the point 0, which is not in any of the open intervals of the given set. 
It thus appears that the theorem does not hold for the set of open intervals 
if, from and after some fixed value of n y we have a n = a w+1 , or if, from and 
after some fixed value of n , we have b n — 6 W+1 . If however, from and after 
some fixed value of n, both the conditions a n < a n+l , b n > 6 n+1 are satisfied, 
the theorem is valid; for the point x y defined as above, is a point in every 
open interval (a n , 6 n ), from and after some fixed value of n. 

If the intervals of the given set are semi-closed; being all open at the 
right-hand end-points, the theorem holds good unless b n = b n+l , from and 
after some fixed value of n. 

In the case of a sequence of p -dimensional closed cells 
(a (1) a (2) a (J,) * b (1) b (2) b <*>1 
where n = 1, 2, 3, ..., each of which is in the preceding one, so that 
a n+ i (fl) s and b n+1 iQ) & b n {9) , for all values of n, and for the values 
1, 2, 3, ..., p, of q , and if b n {g) — a n i9) , for each value of q, forms a sequence 
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of numbers which converges to zero, for n — 1, 2, 3, , there is a unique 

point which is in all the cells of the set, and which is called the limiting 
point of the set. For the sequence of linear intervals (a n {Q) , b n (q) ) defines, 
for each value of q , a single point x (Q) in all the intervals. Thus the point 
( x {1) , x (2) , ... x ip) ), uniquely determined, is in all the cells of the sequence, 
and it is easily seen that it is the only point that satisfies this condition. 

If the cells are open, the corresponding theorem does not hold, unless 
there exists some value of n from and after which a n (q) < a n _ fl ( '* ) , b n {q) > b n+1 {q) , 
for each value of q. 

If the cells are semi-closed, for the validity of the theorem the condition 
b n iq) > b n ^ x {q) must be satisfied, from and after some fixed value of n , for 
every value of q. 

(Q~P ) £ 

The number j 2 ( b (q) — a (<2) ) 2 > may be called the span, or the length 

of the diagonal , of the cell (a ll) , a i2) , ... a iv) ; 6 (1) , 6 (2> , ... b (v) ). This number 
may also be spoken of as the distance between the two points 

(a (1) , a (2) , (b ( »,b< 2) , ... b <*>), 

and may be denoted by PQ , when the points are denoted by P , Q 
respectively. 

A convergent sequence of closed cells is one in which each cell is in the 
preceding one, and such that the sequence of spans of the cells converges 
to zero; such a convergent sequence defines a unique point in all the ceils 
of the sequence. 

In the case of such a convergent sequence of cells, the convergence of 
the sequence of spans to the limit zero ensures that, for each value of q , 
where 1 s q £ p, b n iq) — a n (q) converges to zero, as n is indefinitely increased. 
A sequence of closed cells, each of which contains the next, may however 
be such that b n {q) — a n (q) converges to zero for some of the values of q , 
but not for all these values. In that case the sequence of ceils may be 
said to be incompletely convergent ; the points of a cell of dimensions lower 
than p will all be contained in each of the cells of the given sequence. 
This cell may be termed the limiting cell (or interval) of the given sequence. 

For example, if p = 3, we may have b n a) — a n (1) and 6 n (2) — a n (2) con¬ 
verging to zero, but 6 n (8) - a n (3) converging to some number greater than 
zero; in that case the sequence converges to a limiting interval. Again, 
if 6 n (1) — a n (1) converges to zero, but 6 n (2) — a n (2) , b n (3) - a n (3> converge to 
numbers greater than zero, the limiting cell is two-dimensional. 
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SYSTEMS OF NETS 

51 . Let the linear interval (a, b) be divided into a number m 1 of parts, 
such that the length of each part does not exceed a positive number 
let D x denote the finite set of intervals so obtained. By introducing further 
points of division of (a, 6), let a new finite set of intervals i) 2 be determined, 
of which the number is ra 2 (> wj, and let the length of each interval of 1) 2 
not exceed a positive number 8 2 (< Si). Proceeding in this manner, there 
can be defined, in accordance with some specified set of rules, a sequence 
1 \, I ) 2 ,... I ) n , ... of finite sets of intervals, such that the intervals of D n are 
all not greater than an assigned number 8 n . Let it be supposed that the 
numbers S x , S 2 ,... S n ,... form adiminishing sequence which converges to zero. 

If we regard the intervals of each set D n as semi-closed, i.e. closed on 
the left and open on the right, except that the extreme interval on the 
right is taken to be closed, each point of the closed interval is in one and 
only one of the intervals of the set I) n , for each value of n. 

The set D n of semi-closed intervals (except that the extreme one on 
the right is closed) may be spoken of as a net , and the nets corresponding 
to n — 1, 2, 3, ... may be called a system of nets fitted on to the interval (a, b). 
The particular net D n may be called the net of order n, or the nth net, of 
the system. The m n separate intervals of the net D n may be spoken of 
as the meshes of the net D n . 

The fundamental property of such a system of nets is that each point x 
of the interval (a, b) is in one and only one mesh of each of the nets D n . 
These meshes, for n — 1 , 2, 3, ..., constitute a convergent sequence of 
intervals of which the point x is the limiting point. In each net of the 
system the rank, or order, of the meshes may be taken as their order 
from left to right in the interval (a, 6). 

In case D l consists of rn meshes of equal breadth, Z> 2 of m 2 meshes of 
equal breadth, and in general D n consists of m n meshes of equal breadth, 
the system of nets will be said to be symmetrical. 

In some cases it is convenient to employ nets in which all the meshes 
are closed at both ends. In that case a point x of (a, b) that is an end¬ 
point of a mesh of D n will, in general, belong to two adjacent meshes of 
D n , and of all the subsequent nets of the system. Such a point will conse¬ 
quently be the limiting point of more than one sequence of meshes. Such 
a system of nets may be spoken of as a system of nets with closed meshes . 
It may or may not be a symmetrical system. 

In case the interval (a, b) is replaced by an indefinite interval (— oo, oo), 
(a, oo), or (— oo, 6), the number of meshes of a net fitted on to such in¬ 
definite interval will not be finite, but as before, the set of numbers 
Sj, S 2 , ••• K> will be taken to converge to zero. 
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The definition of a system of nets can be extended to the case of cells 
of two, or of any number p , dimensions. In the case of the two-dimensional 
cell (a (1) , a {2) ; b {1) , 6 <2) ), this rectangle is divided by straight lines parallel 
to the axes into a set D 1 of rectangular cells, the spans of all of which 
are £ 8 t . Some or all of these cells are again divided into smaller cells, by 
the introduction of fresh straight lines parallel to the axes, so that the new 
set of cells I) 2 is such that their spans are all ^ S 2 . Proceeding in this manner, 
in accordance with a specified set of rules, and the diminishing sequence {8 n } 
being taken to converge to zero, we obtain a sequence I) 1 , J) 2 , ... D n , ... 
of sets of cells. We take each of the cells of I) n to be semi-closed, except 
that those which have a side in common with one of those sides of 
(a (1) , a (2) ; b a \ b {2) ) which do not contain the point ( a (1) , a {2) ) are closed 
along that side; that side itself being taken to be semi-closed, unless it 
has (a (2) , b (2) ) for an end-point, in which case it is regarded as closed. 
Each of the sets D n of rectangular cells is then termed a net ; each of the 
cells of which I) n is composed is termed a mesh of the net D n ; and the 
totality of the nets {D n } is called a system of nets fitted on to the cell 
a< 2 >; 6< 1 >, 6< 2) ). 

Any point x in the fundamental cell is now in one and one only of the 
meshes of D n i for each value of n ; and consequently x is the limiting point 
of a unique sequence of meshes of the cells of the system. 

It is easy to arrange the meshes of a net D n in ascending order or rank. 
Thus we may take those cells which have a side in common with a part 
of (a (1) , a (2) ; 6 (1) , b {2) ) in ascending order from left to right; after these we 
may take in order from left to right those cells which have a side in 
common with one of those just referred to; then we take in order from 
left to right those cells which have a side in common with one of these 
latter, and so on. 

If the nets of D n all have the same span 8 n for each value of n, and if 
the ratio of 8 n+1 to S n is independent of n , the system of nets is said to be 
symmetrical. 

As before, a system of nets may be fitted on to the unbounded plane 
of (*<*>, 

In case all the meshes of D n , for every value of n , are closed, the 
system is said to be a system of nets with closed meshes. In this case 
the sequence of meshes of which the limit is a prescribed point (x (1 \ x m ) 
is in general not unique. 

It is unnecessary to point out in detail the corresponding definition 
and convention for the case of a ^-dimensional cell, when p > 2. It is 
clear that the number of dimensions makes no difference as regards the 
properties of a system of nets. 
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It will appear that a system of nets provides an apparatus which is of 
considerable utility as an instrument of investigation in the theory of sets 
of points, and in that of functions of one or more variables. The employ¬ 
ment of this nomenclature has the advantages of preventing repetition in 
the proofs of various important theorems, and of providing the means of 
giving those proofs in a form applicable not only to linear sets of points 
or to functions of one variable, but also to sets of any number of dimen¬ 
sions, or to functions of any number of variables. 

Although the method, equivalent to the employment of systems of nets, has been in 
use for a considerable time past, the first formulation of the idea of a system of nets 
appears to be due to de la Vall6e Poussin*. His terminology is however not identical with 
that here employed. He speaks of what is here termed a system of nets, as simply a net 
(r^seau), and what is here called a net is termed by him a lattice (grillage). Moreover he 
considers only what are here described as symmetrical systems of nets, and this is 
sufficient for the special purpose for which he employs them. For the more general 
purposes of investigation, it is however convenient not to make this restriction; 
accordingly the wider definition given above has been here adopted. 


THE LIMITING POINTS AND THE DERIVATIVES OF A SET 

52. If a point x be taken in the interval (a, b ), an interval (x -- e 1 , x -f e 2 ) 
which lies entirely in (a, b) is called a neighbourhood of the point x\ and 
this neighbourhood may be made as small as we please by proper choice 
of e 1 and e 2 • An interval (x , x -f e 2 ) is called a neighbourhood of x on the 
right, and (x — , x) is called a neighbourhood of x on the left. The end¬ 

points a and b can only have neighbourhoods on the right and the left 
respectively. A neighbourhood of a point may be open or closed. 

In the case of a point (x (1) , x (2) ) in a plane, the cell 

(a:<i> - ef\x™ - x< x > f £ 2 u>, x< 2 > 4- c 2 (2) ) 
is called a neighbourhood of (x (1) , x (2) ); it may be closed or open. Corre¬ 
sponding to neighbourhoods on the right or left, of a point x of a linear 
interval, there are in the plane four partial neighbourhoods of the point 
(x (1) , x (2) ), viz. the four rectangles 

(x (1 >, x (2> ; x (1 > + € 2 (1 >, x< 2 > + € 2 < 2 >), (x (1 > - e A a) , x< 2 > - €!<«; x (1) , x< 2 >), 

( X U> _ €i (i) } *<*); X ii) § X (2) + €2 (2)) j (*<n x w _ €l <2). x a> + z< 2 >). 

In p dimensions, similar definitions hold good; a point has 2 V partial 
neighbourhoods. The neighbourhoods can always be supposed to be in 
a fundamental cell, to which the point is confined. 

If a set of points O , in any number of dimensions , has been defined , and 
a point P be such that every neighbourhood of it contains a point of G, other 
than P itself , the point P is called a limiting point of the set G, whether P 
belongs to G or not. 


* See his treatise Integrate# de Lebesgue, Paris (1916), p. 16, et *eq. 
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It should be observed that, if every neighbourhood of P contains a point 
of Q , other than P, it must contain an infinite number of points of G. For 
let us suppose that a certain neighbourhood H, of P, contains only a finite 
set of points p Xi p 2) • • • Pm belonging to G. We may suppose P to be the 
limiting point of the sequence {d n } of meshes belonging to the nets {D n } 
of a system of nets fitted on to Z7, and such that P is interior to all the 
meshes d n . Each of the points p Y , p 2 , ... p m can be in only a finite number 
of the meshes d n of the sequence; hence, from and after some fixed value 
of n , d n contains no point of G ; but d n is a neighbourhood of P containing 
no points of G, other than P itself, which is contrary to the hypothesis 
made above. 

A limiting point of G, defined as above, is also called a point of accumu¬ 
lation. Another definition of a limiting point is that it is a point P such 
that G contains a sequence of points {P n } for which the distance PP n is 
less than an arbitrarily chosen number e, for all sufficiently large values 
of n. A point which satisfies this condition is certainly a limiting point, 
in accordance with the definition here adopted, but the question whether 
the converse holds will be discussed in Chap, iv, in connection with the 
multiplicative axiom. 

The fundamental theorem will now be established, that: Every bounded 
set of points G which contains an infinite number of points possesses at least 
one limiting point. 

We may suppose a system of nets to be fitted on to the fundamental 
cell, or interval, in which G is contained. Of the meshes of 1\ at least one 
must contain an infinite set of points belonging to G\ let d l be that one 
of such meshes which is of lowest rank. Of the meshes of I) 2 which are 
contained in d x at least one must contain an infinite set of points belonging 
to G \ let d 2 be that one of such meshes which is of lowest rank. Proceeding 
in this manner a definite sequence of meshes d lf d 2 , ... d n , ... is defined, 
each of which contains an infinite set of points belonging to G. The 
limiting point P of this sequence {< d n } of meshes is a limiting point of G. 
For any assigned neighbourhood of G contains d n , from and after some 
fixed value of n, and therefore contains an infinite number of points of G. 

It has thus been shewn that G has at least one limiting point. It may 
have a finite number, or an indefinitely great number, of limiting points. 
It should be observed that a limiting point of G may or may not itself be 
a point of G. In the case of a linear set of points, if either boundary of 
the set be not a point of G , it is certainly a limiting point of G ; it may 
however be both a point of G and a limiting point of G. y 

A limiting point P of a linear set G is a limiting point on both sides, if 
an indefinitely great number of points of G lie in every neighbourhood 
of P on the right, and also in every neighbourhood of P on the left. 
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Otherwise P is a limiting point of G on one side only. This distinction 
may be extended to the case of a limiting point P of a set G in p dimen¬ 
sions, the 2 P partial neighbourhoods of P corresponding to neighbourhoods 
on the right and left of a point of a linear set. 

53. The theorem of § 52 does not hold for the case of unbounded sets 
of points. For example, the linear set which consists of the points 1, 2, 
3, ..., ft, ... has no limiting point, in accordance with the definition given 
in § 52. The same remark applies to the set of points — 1, — 2, — 3, ..., 
— n, — If we apply to either of these sets the transformation 

£ -- x/x'x 2 -| I, 

where the positive value of the radical is always taken, the sets on the 
interval of £ winch correspond to the above sets are 

1/V2, 2/V5, 3/vTo, ?i/Vn 2 T T, 

and - 1/V2, - 2/V5, - 3/VTo, - v/Vn* TT 

which have as limiting points the points 1,-1 respectively; these points 
1, - 1 being the end-points of the interval (— 1, 1) of To the inde¬ 
finitely great interval of .r, which is an open interval, there corresponds 
the open interval (—1, I) of £, and in the former interval there are no 
points which correspond to the points — 1, 1 of the interval of x. If we 
agree to adjoin to the set of real numbers, two improper numbers — oo, oo, 
which are taken to correspond to the numbers — 1, 1 in the closed interval 
( 1, 1) of £, we now regard (— oo, oo) as the closed interval of x corre¬ 

sponding to the closed interval (— 1, 1) of £. To a neighbourhood (1 — e, 1) 
on the left of the point £ = 1, there will correspond the interval 

(- 1 € -i,oo) 

V(2e-e 2 )* / 

on the left of the adjoined point x = oo in the linear interval of x. The 
point x — oo may be regarded, in an extended sense of the term, as the 
limiting point of the set of points 1, 2, 3, .... It corresponds to the limiting 

point 1 of the corresponding set \jV2, 2/V5, 3/VlO,.in the segment 

(-1,1). 

In this extended sense of the term “limiting point,” the point oo will 
be a limiting point of any set G which is such that, corresponding to any 
arbitrarily chosen positive number A, there are points x , of Q, for which 
x > A . Similarly, the point — oo may be regarded as a limiting point of 
the set G, if there are points of G for which x < —A, for all values of the 
positive number A. When oo or — oo, is, in this extended sense of the 
term, a limiting point of G, the point 1, or — 1, is a limiting point of the 
set of points which corresponds to G in the ^-segment. 
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To a neighbourhood (x — e lf x + e 2 ) of a point x y not oo or — oo, in 
the interval ( ~ oo, oo), there corresponds a neighbourhood (£ — e/, £ -f- t 2 ') 
of the point f which corresponds to x. It is easily seen that sequences of 
values of , e 2 which converge to zero, correspond to sequences of e/, e £ ' 
which converge to zero. Thus a finite limiting point of a set G in the 
interval (— 00 , 00 ) of x corresponds to a limiting point of the corre¬ 
sponding set in ( - 1, 1) which is in the interval. 

It thus appears that, when the two points 4 - 00, 00 are adjoined to 

the indefinite interval of x, so that it becomes a closed interval, the theorem 
as to the existence of a limiting point holds of the closed interval. The 
difference in the form of the condition that either of the points 00 , — 00 
should be a limiting point, from the condition applicable to a finite point, 
is seen to be unessential, as it disappears when the set is transformed into 
a set in the finite closed interval (— 1, 1). 

The intervals ( A , 00 ), (— 00 , — A) may be termed neighbourhoods of 
the points 00 , — 00 respectively. 

The case of a plane set may be considered in a similar manner. Em¬ 
ploying the transformation 

£d) = tftu/vV 1 *) 2 4 i, £ (2) - xw/VixwpTT, 

by which points (^ (1) , x (2) ) of the unclosed infinite cell (— 00 , — 00 ; 00 , 00 ) 
are placed in correspondence with points of the unclosed cell (— 1, - 1; 1, 1) 
in the plane of (£ (1) , i (2) ); we see that, corresponding to a set of points in 
the latter plane which has a limiting point on the boundary of the rectangle 
( — 1, — 1; 1, 1), there will be a set of points in the plane of (x (1) , # (2) ) 
which has a limiting point, in the extended sense of the term, on a 
boundary adjoined to the rectangle (— 00 , — 00 ; 00 , 00 ). In this case also, 
any limiting point of a set in the rectangle (— 1, — 1; 1, l)in which the 
points £ (2) ) are contained, will correspond to a limiting point of the 
corresponding set in the rectangle (— 00 , — 00 ; 00 , 00 ) in which the points 
(£ (1) , x (2) ) are contained. It is clear that there is an arbitrary element in 
the particular transformations employed. 

The set of points (a; (1) , x (2) ) such that a (1) — e ^ x ll) ^ a {1) + e', and 
x i2) ^ A, where €, e', A are positive numbers, may be called a closed 
neighbourhood of the point (a (1) , 00 ). A similar definition may be given 
for an open neighbourhood of the point. The set of* points for which 
x (l) £ A , x {2) s — B, wdiere A and B are positive numbers, may be termed 
a closed neighbourhood of the point ( 00 , — 00 ); if x {1) > A , x <2) < - B , 
the neighbourhood is open. Similar definitions will apply to points at 
infinity of the other types. 

54. Returning to the case of a set G in a finite interval or cell, we 
observe that the limiting points of G form a set of points which may be 
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finite or infinite; this set is called the derived set* * * § , or first derivative of G, 
and may be denoted by G'. In case the set G' contains an infinite number 
of points, it possesses itself a derivative set G ", which is called the second 
derivative of G. If we proceed in this manner, we may obtain a series 

G\ G ", ... <?<»> 

of derivatives of G. If the ?tth derivative G in) contains a finite number 
only of points, then these have no limiting point, and we may say that 
G ln+1) ~ 0. It may however happen that, however large the integer n 
may be, the derivative G (n) contains an indefinitely great number of points; 
and thus a next derivative exists. 

A set G which possesses only a finite number of derivatives is said to be 
of the first species. 

In this case, if G {s) contains only a finite number of points, the set G 
is said to be of orderf s. Thus, for example, a set of the first species and 
order zero contains only a finite number of points; and a set of the first 
species and order 1 has a first derivative which contains only a finite 
number of points. It will be observed that the order of each derivative 
of G is less by unity than that of the one which precedes it. 

A set G which possesses an indefinite number of derivatives is said to be 
of the second species. 

As an example, we may consider the set of rational numbers in the 
interval (0, 1). The first derivative of this set contains every real number 
in (0, 1), and all subsequent derivatives are identical with the first. 


EXAMPLES 


1. Lett 



111 1 \ 

r 3’ 4 n “7 * 


We see that (V consists of the single point 0, which does not belong to G; thus G is of the 
first species and of order ]. 


2. Let§ the points of G be given by 

A _ 1 1 JL 

3 s » + 5*A 7*5 f 11V 

where s lt s 2 , s 3 , s 4 each have all positive integral values. Here G' consists of the four sets 
of points given by 

1 L 1 L 1 1 1 J_ 1 111 

3*» + + 7*3 * 3*i + 5*5 + 11V 3*i + 7*3 + 11 * 4 ’ 5*2 + 7*3 + 1 f* 4 ; 


* The notion of the derivative of a set was introduced by Cantor, Math. Annalen , vol. v 

(1872), p. 128. Du Bois-Reymond contemplated the existence of limiting points of various 
orders, Crelle’s Journal , vol. lxxix (1874), p. 30; in Math. Annalen, vol. xvi (1880), p. 128, Du 
Bois-Reymond defined a limiting point of infinite order. 

f Cantor, Math. Annalen , vol. v (1872), p. 129. % Cantor, Math. Annalen, vol. v (1872). 

§ Ascoli, Ann. di Mat . (2), vol. vi (1875), p. 56. 
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and of the six sets of points 
11 1 J. 

S s i + 5** ’ 3*1 + 7*« 

and of the four sets of points 


1 

1 

1 

1 1 1 

3*1 

+ 11*4’ 

5^2 

+ 7*S ’ 5*2 + i 1*4 

1 

1 

1 

1 

3*i 

’ 5*a * 

7*3 

’ 11*4 ; 


1 1 

7*» + i i *4 : 


together with the single point 0. O" consists of the last ten of these sets, and of the 
point 0. The third derivative G"' consists of the last four sets, and of the point 0; G"" 
consists of the point 0 only. The set G is of the first species and of the fourth order. 


3. Let* the points of G be given by 

1 1 


... + 


1 


a l a 2 a n 

where n is a fixed number, and each of the numbers 
integral value. In this case G is of order rt. 

J 


. n n takes every positive 


4. The zerosf of the function sin - form a set similar to that in Example 1. The zeros 


of the function sin / — — \ form a set of the second order, 

rv 

a set of the third order, and so on. 


those of sin / ' » ■ ■ ■■ " \ 

I sin —I 
\ sin / 

v r 


form 


5. Letf the points of G be given by 

1 1 1 

- 4- 4- . -f --. 

where m x , m 2 , ...m n have all positive integral values, including zero, and n is a fixed 
integer. It can be seen that G (n) consists of the point zero only. 


DESCRIPTIVE TERMINOLOGY 

55. If G x , G 2 , 6t 3 , ... G n denote a number of sets of points (either linear 
or in any number of dimensions), the set which contains every point that 
belongs to one or more of the given sets is called § their least common 
multiple , and is denoted by M (G x> G 2 , ... G n ). In case no two of the given 
sets have a point in common, the common measure of the sets may be 
denoted by G x ~f G 2 + ... -f G n , and it may be spoken of as their sum. 
By some writers the term “sum” is employed for the greatest common 
measure. 

That set which contains all those points which belong to every one of 
the given sets is called their greatest common divisor , and it may be denoted 
by D (G x ,G 2 , ... G n ). By some writers this set is termed the product of 
the given sets, and is denoted by G X G 2 ... 6 ? n . 

These definitions are still applicable in the case of an unending sequence 
of sets G lf G 2i G 3i .... 

* H. J. S. Smith, Proc. Lond. Math . Soc. vol. vi (1875), p. 145. 
f P. Du Boia-Reymond, CreUe's Journal , vol. lxxix (1875), p. 36. 
t Mittag-Leffler, Acta Math. vol. iv (1884), p. 58. 

§ Cantor, Math. Annalen , vol. xvii (1880), p. 355. 
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If all the points of a set 77 are points of a set G, H is said to be con¬ 
tained in G, or to be a part, or component, of G. The set G is said to 
contain H. Those points of G that do not belong to H form a set which 
may be denoted by G — II. The set G — II is said to be the complement 
of H with respect to G , and is sometimes denoted by Cq (H). If the set G 
consists of all the points of the fundamental interval, or cell (open or 
closed), which contains //, the set G — II is called the complement of H, 
and is denoted by C (II). 

A set, all of whose limiting points belong to the set itself, is said to be closed. 

Thus, in a closed set, the derivative G' is a component of G. 

Although the definition of a closed set would be applicable to an un¬ 
bounded set, in case limiting points in the extended sense of the term are 
admitted, as explained in § 53, it is usually convenient to restrict state¬ 
ments as to closed sets to the case in which they are bounded. Accord¬ 
ingly, it will in general be assumed that a closed set is a bounded one. 

A set of points G is said to be an isolated set when no point of the set is 
a limiting point of the set. 

Thus, we have, for an isolated set G, the condition D (G, G') = 0. 

If, from any set G, we remove those points which also belong to its 
derivative, the remainder is an isolated set; thus G — D (G, G') forms an 
isolated set. Any set G may be regarded as the sum of an isolated set 
and of a component of the derivative G'. 

If a component II of the set G is such that every point of G is a limiting 
point of H, the set H is said to be dense in G. 

If we consider the case in which II is identical with G, we obtain the 
definition: 

If every point of G is a limiting point of the set, G is said, to be dense in 
itself. 

For a set G, dense in itself, G is a component of the derivative G f . The 
rational numbers of the interval (0, 1) form a set that is dense in itself. 

A set G which is both closed and dense in itself is said to be perfect *. 

Thus a perfect set G is identical with its derivative. It follows that 
every perfect set is of the second species. 

By some writers! the term perfect is applied to sets which, in accordance 
with the terminology of Cantor here adopted, are only closed, without 
necessarily being dense in themselves; what is here called a perfect set 
is then spoken of as an absolutely\ perfect set. 

* Cantor, Math. Annalen , vol. xxi (1883), p. 575. 

f For example Jordan, see Cours <TAnalyse, vol. I, Paris (1893), p. 19. 

J Borel, Lemons sur la thiorie des foncMons (1898), p. 36. 
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In case the set G consists of all the points of a closed interval, or cell, 
a set H which is dense in G is said to be everywhere dense* in the given 
interval, or cell. This is equivalent to the statement that: 

A set H is said to he everywhere dense , or dense in an interval, or in a 
cell , provided that no interval, or cell, contained in the given one, exists which 
contains no points of H. 

Similarly, if a set H is dense in a set G, no interval, or cell, can be 
determined such as to contain points of G and no points of II. 

If H is dense in G, the derivative 77', of H, contains every point of G. 
The derivative G' of a set G wliich is everywhere dense in an interval, or 
cell, must contain every point of the closed interval, or cell. This pro¬ 
perty may be used as a definition*!* of the term ‘‘everywhere dense.” 

By Du Bois-ReymondJ the term pantachisch was used with the same 
meaning as everywhere dense. 

Any interval, or cell, contained in a fundamental interval, or cell, 
may be spoken of as a sub-interval, or sub-cell. 

If, in every sub-interval, or sub-cell (a', //), of the fundamental interval, 
or cell (a, b), in which a set of joints G is contained, another sub-interval , or 
sub-cell (a", b"), can be determined which contains no />oints of G, the set G 
is said to be nowhere dense, or non-dense in (a, b). 

Thus a non-dense set is one such that no interval or cell exists in which 
the set is everywhere dense. 

A component II of a set G is said to be non-dense in, or relatively to G, 
if, in any interval, or cell, that contains points of G, a sub-interval, or sub- 
cell, can be determined which contains a point of G but no point of II. 

A point P of a set G is said to be an interior point of G, if a neighbour¬ 
hood of P can be determined all the points of which are points of G. 

If, however, the set G be bounded, and contained in a fundamental 
interval, or cell, a point of G on the boundary of the interval, or cell, may 
be regarded as an interior point of G relatively to the interval or cell, if 
a neighbourhood of the point exists such that every point of that neigh¬ 
bourhood which is in the fundamental interval, or cell, is a point of G. 
Such a point is however not an interior point of G, relatively to unbounded 
space, or to an interval, or cell, wliich contains the fundamental interval 
or cell in its interior. 

Those 'points of a set G which are not interior points of G, together with 
those points of C (G) which are not interior points of C (G), form a set which 

* Cantor, Math. Annalen , vol. xv (1879), p. 2. 

f Baire, Annali d. Mat. (3), vol. in (1899), p. 29. 

J Math. Annalen, vol. xv (1879), p. 287. 
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is called the frontier of G or of C (G); the set C (G) denoting the complement 
of G in the unbounded interval or space in which G is contained. 

//, however , G is contained in an interval , or cell , and C (G) denotes the 
complement of G with respect to such interval , or cell , the set of points of G not 
interior to G, relatively to the interval or cell , together with those points of G (G) 
which are not interior to C (G), relatively to the interval or cell , forms the 
boundary of G or of C (G) relatively to the fundamental interval , or cell. 

To illustrate the distinction here made, we consider for example the 
linear set consisting of all the points of the closed interval (a, b); the 
boundary of G consists of the two points a, b } which are not interior points 
either of G or of C (G), the complement of G relatively to ( — oo, oo). But, 
relatively to the closed interval (a, b), the points a, b are interior points 
of the closed set (a, 6), in accordance with the convention made above; and 
thus the set has no boundary relatively to the closed interval (a, b). 

Every point of an open interval, or cell, is an interior point of the open 
interval, or cell, regarded as a set of points. For example, every point of 
an unbounded space is an interior point of the unbounded space considered 
as a set of points. 

An open set * is one in which every point is an interior point. A set is 
open relatively to a fundamental interval , or cell , when every point is an 
interior point relatively to the interval , or cell. 

The term open set is however employed by some writers to denote 
any set which is not closed. 

A non-dense set has no interior points, but an everywhere dense set 
may also have no interior points. An open set contains none of the points 
of its frontier. 

It is frequently of importance to consider the properties of sets which 
are contained in a given perfect set G , or which have a part in common 
with G. 

A point P of a set H is said to be an interior point of H relatively to G , if 
it is a point of G and is such that a neighbourhood of P exists for which all 
the points of G in that neighbourhood are also points of H. 

Those points of G which are limiting points both of H and of Co (#), 
where H is a component of G, are said to form the frontier of H and of Cq ( H) 
relatively to G. 

A set H , all the points of which are interior points of H relatively to G, 
is said to be open relatively to G. 

A set H , such that no point which it has in common with G is an interior 
point of H relatively to G, is said to be diffuse relatively to G t or to be diffused 
inG. 

* de la Valtee Poussin, Intdgrcdes de Lebesgue, p. 10. 
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If U has points* in common with G, and is not diffuse in G, it is said to 
be compact in G. 

This property of compactness was formulated by de la Vallee Poussin, 
whose definition is equivalent to that here given. The term “ compact ” 
is employed by some writers with a different meaning. 

PROPERTIES OF CLOSED AND OPEN SETS 

56 . The complement C (G) of a closed set G with respect to a closed' interval , 
or cell, in which G is contained, is an open set, relatively to the interval or cell. 
Conversely, the complement of an open set contained in a closed interval, or 
cell, is a closed set. 

For any point P of C (G) which is not a limiting point of the closed 
set G is such that a neighbourhood of P can be determined which contains 
no points of G. All the points of such neighbourhood (or of the part of it 
in the fundamental interval, or cell) are points of C (G). Therefore P is 
an interior point of C (G). It follows that C (G) is an open set. 

If H be an open set, a limiting point of C (II) cannot belong to II, 
because every point of II has a neighbourhood none of the points of which 
belong to C (II). Since every limiting point of C (H) belongs to C (H), 
the set C (H) is closed. 

A closed set G being essentially bounded, the complementary set C (G) 
with respect to the unlimited interval or space in which G is contained is 
open in the absolute sense. 

An unbounded open set will only necessarily have a closed set as its 
complement, provided the meaning of the term closed set is extended by 
admitting adjoined points at infinity as points of the closed set (see § 53). 
For example, consider the linear set of all points x such that 0 < x ; the 
complementary set with respect to the indefinite interval (— oo, oo) is 
the set for which - oo < x i O\ and this can only be regarded as a closed 
set if the improper point — oo is admitted as part of it. 

The complement Co (H) of a closed set with respect to a perfect set G which 
contains H is open relatively to G. Conversely the complement with respect 
to the perfect set G of a set II contained in it, and open with respect to it, is 
a dosed set. 

The proof of this theorem is precisely similar to that of the last theorem. 

If O x , 0 2 be closed sets, both the sets M (G 1} G t ), D (G x , G 2 ) are closed . 

Let P be a limiting point of M (G x , G 2 ) ; then P must be a limiting 
point of one at least of the sets G X ,G 2 . For, if this were not the case, a 
neighbourhood of P could be determined so as to contain no points of G x 
or of G 2 , other than P, and therefore none of M (G x , G % ). The point P 

* Integrates de Lebesgue , p. 106. 
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consequently belongs to one at least of the closed sets, and therefore to 
M ( G x , 6r 2 ), which is therefore closed. A limiting point P , of I) (G X ,G 2 ), 
is clearly a limiting point both of G x and of G Y 2 ; therefore P belongs to 
both these sets, and thus to D (G x , G 2 ), which is consequently closed. 

The theorem can be readily extended to the case of any finite number 
of closed sets. Thus: 

If G x , G 2 , ... G n be closed sets , both the sets 

are closed. M > G *> • • • G n)< l> ( G i > G f ■■■ (} ») 

If the number of closed sets is indefinitely great, so that they form a 
sequence G x , G 2i ... G n , ... of such sets, the set M (G x , 6 y 2 , ...) is not neces¬ 
sarily closed. For if P x belong to G x , P 2 to G 2 , P a to (r 3 , ..., the point P, 
the limiting point of the sequence P x , P z , ..., does not necessarily belong 
to any of the sets, and thus does not necessarily belong to M (G x , G 2 ,...). 
On the other hand, the set D (G x , G 2y ...), if it exists, is necessarily closed, 
for every limiting point of it is also a limiting point of G m , for each value 
of m, and is therefore a point of G m for eacli value of m. 

If d 1? 0 2 , 0 3 , ... be a finite number , or o sequence , o/ open 
Ifcf (Oj, 0 2 , 0 3 , ...) is also an ope?t set. 

The sets () 1 ,0 2 ,... all being open sets, the complementary sets 
O (Oj), C (0 2 ), ... are all closed sets: and consequently 1) {C (0 X ), C (0 2 ), ...} 
is a closed set, whether the number of the given sets is finite or not. But 
the set 1) {C (Oj), C (0 2 ), ...} is clearly the complement of M (f^, 0 2 , ...). 
which is consequently an open set. 

In case the sets 0 X , 0 2 , ... are all contained in a finite closed interval 
or cell, the complementary sets C (Oj), C (0 2 ), ... are taken relatively to 
this interval or cell. If this is not the case, the sets C (Oj), C (0 2 ), ... will 
only be closed, in an extended sense of the term, when limiting points at 
infinity are admitted. This being taken into account, the proof applies 
to this case. 

7/0 1 ,0 2 , ... 0^ be a finite number of open sets , the set 1) (0 l5 0 2 , ... 0 m ), 
if it exists , is also an open set. 

For the set D (Oj, 0 2 , ... 0 W ) is the complement of the set 
M{C (0,), C (0 2 ), ... <7(0.)}. 

which is a closed set. 

The theorem does not hold for the case of an infinite number of 
open sets. 

It has been seen that the properties* of a set of being perfect, closed, 
open, dense, or non-dense, are invariant for a wide class of transforma¬ 
tions of which examples have been given in § 53; special account being 
taken of the cases in which boundaries at infinity must be adjoined to 

* See a paper by E. H. Neville, Acta Math. vol. xlii (1920), p. 63. 
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the space in which a set exists. These properties of a set are accordingly 
said to be descriptive properties, as distinct from the non-invariant metric 
properties which will be considered in Chapter m. 


PROPERTY OF THE SUCCESSIVE DERIVATIVES OF A SET 

57. The following fundamental property of the successive derivatives 
of a set of points, in any number of dimensions, will now be established. 

All the derivatives G', G", G'", ... G lN) , ... of a given set are closed sets , 
and each of these derivatives , after the first , consists of jtoints belonging to the 
[/receding one , and therefore to G', 

This theorem, usually stated for a bounded set, holds also for an un¬ 
bounded set, provided the extended meaning be given to the terms, 
limiting point, closed set, which has been formulated in § 53 and § 55. 

If a point P of G (n) , where n 2, existed, which did not belong to G', 
then a neighbourhood of P could be determined, so as to contain only a 
finite set of points of G, or no such points; and this neighbourhood would 
therefore contain no points of G', and consequently none of G", G '", ... G ln> ; 
which would be contrary to the hypothesis that P belongs to G (n) . There¬ 
fore every point of G {n) (n ■> 2) belongs to G'. By considering the case 
n - 2, we see that G' is a closed set. 

If we take G in ~ 2) to be the original set, it follows from the above that 
every point of G ln) , the second derivative, belongs to G (n ~ 1) , the first de¬ 
rivative. We have thus shewn that 

f/< w » l)(G\ G", ... G (n) ). 

The derivative G' of a set G which is dense in itself is perfect. 

For 6r' is closed, and every point of G belongs to G' ; thus G' contains 
no point which is not a limiting point of G'. Therefore G', being both 
closed and dense in itself, is perfect... 

ENUMERAELK AGGREGATES 

58. An aggregate which contains an indefinitely great number of elements 
is said to be enumerable *, or countable (abzalbar, denombrable), when the 
aggregate is such that a (1, 1) correspondence can be established between the 
elements and the set of integral numbers 1, 2, 3, .... 

An aggregate of objects is therefore enumerable if the objects can be 
arranged in a series which has a first term and in which any assigned 
object belonging to the aggregate has a definite place assigned by a 
definite ordinal number n. Thus the elements of an enumerable aggregate 
can be represented by a sequence of symbols 

U 1 7 U 2 j • • • 7 • *• * 


* Cantor, Crelle’s Journal, vol. lxxvh (1874), p. 258. 
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It follows from this definition that the elements of two enumerable 
aggregates are such that a (1, 1) correspondence can be established be¬ 
tween them. 

If a new aggregate be defined by selecting, in accordance with a rule 
or finite set of rules, elements from those which belong to an enumerable 
aggregate, an indefinitely great number of such elements being taken, 
then the new aggregate is also enumerable. For such an aggregate selected 
from u 1} u 2 , ... u n , ... is u r , u s , u t , ... (r < s < t ...), which satisfies the con¬ 
ditions of having a first term, and of having each element of the aggregate 
in a definite place in the series. It thus appears that a (l, 1) correspon¬ 
dence can be established between an enumerable aggregate and one which 
is a part of that aggregate, provided this part be not finite. This is the 
characteristic property which distinguishes an aggregate containing an 
indefinitely great number of elements from one containing only a finite 
number of elements. For example, a (1, 1) correspondence exists between 
all the integral numbers and all the odd numbers, or between all the 
integral numbers and all the prime numbers. 

If a finite number of enumerable aggregates be given , or even if the number 
of such aggregates be indefinitely great , but enumerable , then the new aggregate 
formed by combining these aggregates into a single one is itself enumerable. 

We may denote such a composite aggregate by the letters 

^11 > U 12 ' U 1Z > • • • U \n 5 • • • 

^21 9 ^22 ’ ^23 9 • • • U^ n , ... 

W 31 » ^32 ’. 


and we shall shew that the double sequence so formed represents an 
enumerable aggregate. To see this, it is sufficient to write the sequence 
in the form 

u u 

^12 > ^21 

^ 13 » ^22 9 ^31 


U 1 ,n~l9 u 2,n~29 n-3 5 ••• U n~ 1,1 


where the sum of the indices is the same for all the terms which are 
written in one horizontal line. It is now clear that each number u PQ has 
a definite place in a sequence in which u n has the first place; the double 
sequence is therefore enumerable. 

An important particular case of the above result is the following 
theorem: 

The aggregate of all the rational numbers is enumerable. 
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A rational number pjq may be denoted by u Vt q : therefore the aggregate 
is enumerable. It makes no difference that any particular number pjq 
occurs an indefinite number of times as u rPt rq \ since, if all such terms 
except those for which r = 1, and p/q is in its lowest terms, be removed, 
the aggregate left is still enumerable. 

For example, the aggregate of rational numbers in the open interval 
(0, 1) may be arranged in the order £, f, £, f, f, f, J, £, in 
which each fraction that occurs is in its lowest terms. 

A method of placing the rational fractions between 0 and 1 into corre¬ 
spondence with the integers 1, 2, 3 , ... has been given by Faber*. By this 
method the integer corresponding to a given fraction can be calculated by 
means of formulae. Faber shews that there exists a unique representation 
of a given proper fraction pjq, in the finite form 

, a 2 . a 3 , ,_ a n _ 

2 ! , 3 ! 4 ! (?i f 1 )! ’ 

where a r is an integer (or zero) < r 4- 1; whereas every integer is uniquely 
representable in the form 

+ 6 2 .2 -f 63.2.3 + ... ~h b n . 2. 3.4 ... n , 
where b r is an integer (or zero) < r 4- 1. 

To the proper fraction pjq there corresponds the unique integer 
u-i -f a 2 .2! 4 ~ #3 • 3 ! -f- ... -f- a n . n ! 

59 . A more general theorem has also been established by Cantorf. An 
algebraical number is one which is a root of an algebraical equation in 
which the coefficients are all rational numbers, so that the coefficients 
may without loss of generality be taken to be integers. Cantor’s theorem 
is, that all the algebraical numbers form an enumerable aggregate. 

To prove this theorem, let 

PoX n + p 1 x 1t ~ 1 -f ... 4 - p n = 0 

be an equation in which p 0i p 1 , ... p n are all positive or negative integers 
(including zero): and let 

I Po I + I Pi I + ••• + I Pn | + n = N; 

then A is a positive integer which may be called the rank of the equation. 
It is clear that there are only a finite number of equations of any given 
rank, these equations having only a finite number of roots. If then we 
let N = 3, 4 , 5 , ... successively, all the algebraical numbers can be arranged 
in a simple sequence; and thus they form an enumerable aggregate. The 
aggregate which is formed of all the real algebraical numbers is conse¬ 
quently also itself enumerable. 

* Math. Annaien, vol. lx (1905), p. 196. 
t CreUe'e Journal, vol. lxxvu (1874), p. 258. 
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A number which is not an algebraical number is said to be transcen¬ 
dental. The existence of transcendental numbers was first established* by 
Liouville, who shewed how examples of such numbers could be formed. 
No general criterion is known by which it can be decided whether a 
number, defined by a given analytical procedure, is algebraical or trans¬ 
cendental. The first case in which such a number, well known in Analysis, 
w as shewn to be transcendental w as that of the number e, the base of the 
natural system of logarithms; and the first proof that e is transcendental 
was given by Hermite. The next case in which a number, defined analyti¬ 
cally, was shewn to be transcendental was that of the number tt. The 
first demonstration of this important fact is due to Lindemannf, who 
proved the more general theorem that, if v x — ;/, the two numbers x, y 
cannot both be algebraical, except in the case x - 0, y -- l. It follows 
that the natural logarithms of all algebraical numbers are transcendental, 
as also are all numbers of which the natural logarithms are algebraical. 

60. The following fundamental theorem w T ill now be established^: 

The aggregate which consists of the continuum of numbers in a given 
interixil is not enumerable. 

Suppose that , co 2 , co 3 , ... denote the numbers in an enumerable 
aggregate; it will then be shewn that, between any two real numbers a, ft 
whose difference is as small as we please, a number occurs which does not 
belong to the enumerable aggregate. It will then follow 7 that, in the given 
interval (a, ft), there are an unlimited number of points which do not 
belong to the enumerable aggregate, and thus that the latter cannot 
contain all the points of the continuum. If the enumerable set of points 
is not everywhere dense in (a, ft), then smaller sub-intervals inside (a, ft) 
can be taken which contain no points of the aggregate; and thus we have 
only to consider the case in which the given aggregate is everywhere dense 
in (a, ft). Let a> Kx be the first of the points w 1 , oj 2 , ... which lies within 
(a, ft), and w Kt be the next of these points which lies within (a, ft) so that 
Kj < k 2 . Let a be the smaller, and ft' the greater of the numbers o^, to Ki , 
then a < a < ft' < ft, and k 1 < k 2 ; and if fx < k 2 , then does not lie 
within the interval (a', ft'). Considering this latter interval, let co Ks , ca*, 
be the first two of the numbers of the enumerable aggregate which lie 
within (a', ft'), and let a" be the smaller and ft" the greater of these; then 
a' < a" < ft" < ft', and k 2 < k s < ac 4 . Proceeding in this manner, we obtain 
a whole series of sub-intervals each one of which is entirely within the 
preceding one; thus (a {y) , ft {v) ) lies wdthin (a {v ~ l) , ft {v ~ 1) )] and if fx^K Zv , 
then adoes not lie within ( a (v) , ft (v) ); also 

K x < K 2 < K z < ... < f^2v—2 < *2k 1 < *2 V , 

* Liouville’s Journal, vol. xvi (1851), p. 133. f See Math. Annalen, vol. xx (1882), p. 213. 

I Cantor, Crelle's Journal, vol. lxxvii (1874), p. 200. 
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and 2v s /c 2 „; and thus av lies outside (a (, °, /9 (,/) ). Since the numbers 
a", a'", ... are in ascending order, and all lie within (a, /?), they have 
a limit A ; similarly /?', ft'", ... have a limit 17; and a (v) < A £ B < 

If A < B y then, since all the numbers a>„ are outside the interval (A L, B ), 
the given aggregate is not everywhere dense in (a, ft ); which is contrary 
to hypothesis. Hence we have A~B\ and the number A, or 27, is a 
number which does not occur in the aggregate co 1 , o> 2 , ...; thus the assump¬ 
tion that all the real numbers in a given interval can be effectively arranged 
in a simple sequence has been shewn to lead to a contradiction. 

It will be observed that the point of the foregoing proof consists in 
the fact that an everywhere dense enumerable aggregate necessarily has 
limiting points which do not belong to the aggregate. 

A second proof*, also due to Cantor, that the continuum is not 
enumerable is the following:—Without loss of generality, the interval may 
be taken to be (0, 1). Suppose it to be possible to state a set of rules by 
which all the numbers within this interval are arranged in a sequence, so 
that there is a first, a second, a third, and so on; and so that every number 
occurs somewhere in the arrangement. Since certain rational numbers are 
capable of double representation, viz. by means of a decimal in which, 
from and after some fixed place, all the digits are zero, and also by a 
decimal in which, from and after some fixed place, all the digits are 9, we 
shall suppose this last mode of representation excluded, so that each 
number is represented uniquely. Subject to this convention, let the 
numbers, in order, be exhibited as decimals 

7*n V 12 7*13. 

7*21 Viz 7*23. 

7*31 7*32 7*33 . 


where each p stands for one of the digits 0, 1, 2, ... 9. 

It is assumed that we are in possession of a set of rules by means of 
which the rath digit of the wth number can be determined, for each pair 
of values of rn and n f by means of a finite number of applications of the 
given set of rules. Let it be assumed that, if possible, all the real numbers 
in the open interval (0, 1) occur in the above sequence. If now a number 
in the open interval can be determined which does not occur in the above 
sequence, a contradiction will have been shewn to be involved in the 
assumption that all the numbers in the interval occur in the above 
sequence. Now such a number can be defined, for example, by the 
following rules. Let the nth digit of the number be p nn = p nn + 1, unless 
p nn = 8 or 9; if p nn = 8 or 9, let p nn = 0. The number -Pup^pss ••• Pnn ••• 

* Jahresbericht der deutschen math. Ve.re.inig. vol. i (1892), p. 75. 
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differs in respect of at least one digit from every number in the above 
sequence, and therefore cannot occur in the sequence. The contradiction 
in the original assumption is thus established. 

It will be observed that, in general, when any sequence of numbers 
in the interval a < x < b has been defined, the existence of the sequence 
provides the means of defining other numbers in the interval that do not 
occur in the sequence. 

THE POWER, OR CARDINAL NUMBER, OF AN AGGREGATE 

61. A notion of fundamental importance in the theory of aggregates 
is that of the power , or cardinal number , of an aggregate. This notion will 
be considered more generally and fully in Chapter iv, where it will be 
shewn that the power of an aggregate is the generalization of the notion 
contained in the cardinal number of a finite aggregate. At present, an 
account of the notion of the power of an aggregate will be given, so far 
as it is necessary for the application to the case of sets of points. 

Two aggregates of objects are said to have the same power , or cardinal 
number , when a (1, 1) correspondence can be established between them , so that 
each element of either of the aggregates corresponds to one single element of 
the other . 

Finite aggregates have the same power when they consist of the same 
number of elements, i.e. when they have the same cardinal number. Of 
aggregates which are not finite we consider first enumerable aggregates. 
Every enumerable aggregate has the power of the aggregate of integral 
numbers; and this we may denote by a. It has been shewn above that, 
if from an aggregate of power a any elements be removed, then the re¬ 
maining aggregate, provided it contains a non-finite number of elements, 
has still the same power a. It has further been shewn that the composite 
aggregate formed of a finite, or enumerable, set of enumerable aggregates 
has the same power a. It follows, as an interesting case, that the set of 
all those points of a p-dimensional space whose coordinates are rational 
numbers has the power a of the set of integral numbers, or of the rational 
numbers in a given linear interval. 

It is easily shewn that the power of the set of all the points in an 
interval (a, b) is the same as that in any other finite interval, say (0, 1) ; 

u_ a 

for , - = £ establishes a (1, 1) correspondence between the points x of 

o a 

x 

(a, 6) and the points £ of (0, 1). Again, the relation —f estab- 

vx 2 h 2 

lishes a (1, 1) correspondence between all real numbers, and those within 
the interval (— 1, 1); and thus the power of all real numbers is the same 
as that of all those in a finite interval. This power is called the cardinal 
number of the continuum, and may be denoted by c. 
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As regards unenumerable aggregates in general, it can be shewn that 
the power of such an aggregate is unaltered by removing from the aggre¬ 
gate any elements which form an enumerable aggregate. Let A denote the 
given aggregate, and a the enumerable aggregate which is removed; and 
let B denote the remaining aggregate, which cannot be enumerable, for 
otherwise ( a , B) f or A , would be so also. From B, suppose an enumerable 
aggregate a to be removed, leaving the aggregate ( 7 , thus A = (a, a , G), 
B = (a\ C). Now (a, a') and a\ being both enumerable, have the same 
power; and a (1, 1) correspondence therefore exists between their elements; 
and since A and B have the aggregate C in common, it therefore follows 
that A and B have the same power. As an example of this theorem, we 
see that the set of irrational points in a given interval has the power c of 
the set of all numbers in the interval. Again the set of transcendental 
numbers in a given interval has the power c of the continuum; whereas 
the set of algebraical numbers in the same interval has the power a. In 
this proof, the assumption has been made that the unenumerable aggre¬ 
gate B necessarily contains an enumerable part a . 

The known infinite sets of points defined in accordance with the 
methods usual in the theory of sets of points, in a line or in a continuum 
of any number of dimensions, have either the power a or the power c; 
but it has not yet been established that the assumption of the existence 
of an infinite set of points which has neither the power a nor the power c 
leads to a contradiction. Other aggregates have been contemplated which 
have a power higher than c ; these will be referred to later, in dealing with 
the theory of functions. 


62. The p-dimensional continuum has the power c of the one-dimensional 
continuum*. 

To prove this theorem, we use the fact that any irrational number in 
the interval (0, 1) can be exhibited uniquely as an infinite continued 
fraction y j j 

d\ f t ••• 'f + •••) 

where a otj, ... ... are determinate integers for any given irrational 

number x in the interval (0, 1). Let 
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* Cantor, Crelle'e Journal , vol. lxxxiv (1878), p. 242. 
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thus, corresponding to any value of x f a set of irrational numbers x Y , x 2 ,... x p 
is uniquely determined, and conversely, corresponding to any set of irra¬ 
tional numbers ^,x 2 , ... ^,an irrational number x is uniquely determined. 

It has thus been shewn that the irrational points of the linear con¬ 
tinuum (0, 1) correspond uniquely to those points of the p-dimensional 
continuum in which each coordinate is in the interval (0, J), and is irra¬ 
tional. It has been shewn in § 61 that the set of irrational values of x lf 
in the interval (0, 1), has the same power as the set of all the numbers in 
this interval. 8ince this holds also for x 2 , x 3 , ... x p , it follows that a (1, 1) 
correspondence exists between that set of points in the n-dimensional 
continuum, for which x } , x 2 , ... x v all have irrational values, and the set 
in which x x , :r 2 , ... x v have all values rational or irrational; thys these 
sets have t he same power. Hence the set of all points of the /^-dimensional 
continuum, in which each coordinate is in the interval (0, 1), has the 
same power as the set of all points in the linear interval (0, 1). It has thus 
been shewn that the p-dimensional continuum has the same power c as 
that of one dimension. 


THE ARITHMETIC CONTINUUM 

63. The arithmetic continuum being regarded as obtained by ad¬ 
joining to the set of rational numbers the set of all their limiting points, 
the question arises how far it is legitimate to consider the complete set so 
obtained as constituting a single object, determined by means of the 
elements of which it is composed. A finite set of numbers, or points, con¬ 
stitutes a single object determined by means of its parts, in the sense 
that those parts can be exhaustively exhibited by means of a finite number 
of specifications representable by a finite number of symbols. An enumer¬ 
able set of numbers, or of points, in particular the set of rational numbers, 
is not determinate in the sense that the elements of the set can be ex¬ 
haustively exhibited; but it is determinate in the sense that a rule of 
tabulation can be given, such that each particular number of the set 
occupies a determinate place in the table; and each particular number 
can be represented by means of a finite number of symbols. Such a set may 
be regarded as an aggregate, or single object, in the same sense in which 
the natural numbers 1, 2, 3, ... may be regarded as forming an aggregate 
(see § 4). When we come, however, to the case of the continuum, or 
aggregate of all real numbers, the fact that this aggregate is unenumerable 
introduces a new element into the question of the legitimacy of considering 
the set of these numbers as forming a determinate whole, or as constituting 
a single object of thought. The set of real numbers cannot be tabulated 
in such a manner that no number fails to occur at some definite place in 
the table. In fact it has been shewn, in § 60, that the assumption of the 
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existence of such a table leads to contradiction. It thus appears that no 
set of rules or specifications can be given which suffice to determine suc¬ 
cessively all the numbers of the set; and no finite set of symbols can 
exhaustively exhibit- the numbers. The only sense in which the numbers 
of the set are determinate is that- each such number is the limit of a con¬ 
vergent sequence of numbers, taken from the unending table formed by 
the rational numbers. It may fairly be doubted whether such a negative 
specification of elements amounts to a valid synthetical definition of a 
determinate aggregate; this point will however be further discussed in 
Chapter iv, in connection with the general theory of aggregates. It will 
there be shewn that the arithmetic continuum has an order-type possessing 
definite characteristics which, in their totality, uniquely characterize it. 
This expresses the only kind of unity which can appertain to the con¬ 
tinuum, considered as a synthetic arithmetic construction. From the 
point of view of Arithmetical Analysis the existence of the aggregate of 
real numbers, as a single definite object, possessing assigned properties, 
may be regarded as a fundamental postulate; the validity of such postu¬ 
lation being subject to the law of contradiction. If it be held that we 
possess an independent knowledge of the existence of the geometrical 
continuum, derived by a process of idealization from our intuition of 
space, we may regard the function of the set of real numbers to consist, 
not in a synthetical formation of the concept of the continuum, but in¬ 
versely in an analysis of the contents of the continuum. It is difficult to 
see how precision can be introduced into the intuitional notion of the 
spatial continuum apart from some theory relating either to points or to 
infinitesimals; and the language employed in such a theory must be of a 
symbolical character amounting to the use of some kind of arithmetical 
notation. Regarding the geometrical continuum in this way as a single 
object of which we have a direct knowledge obtained from our intuitions 
of space and time, the reduction to a precise abstract form may be re¬ 
garded as being made upon the assumption that the system of rational 
numbers, with their limits adjoined, is adequate to the analytical descrip¬ 
tion of the continuum, in the sense that each point in the continuum is 
represented uniquely by a single real number, and that there is no point 
in the continuum which is not so represented. This amounts to a defini¬ 
tion, in a certain sense, of the contents of the geometrical continuum. 
Such definition is not the only possible definition, but it is a legitimate 
one, provided it suffices for the purposes we have in view in Analysis and 
Geometry, and provided it does not conflict with the concept of Continuity 
as derived from intuition. The generic distinction between a continuous 
geometrical object, and a point, or set of points, situated in that object, 
is not capable of direct arithmetic representation. This does not, however, 
impair the efficiency of Arithmetical Analysis in dealing with geometrical 
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objects. In Cartesian geometry, for example, Analysis is really concerned 
only with the points that can be determined in the geometrical objects 
with which it deals. This does not mean that a continuous geometrical 
object is analysed into points which are of necessity to be regarded as its 
“ parts,” but it does mean that, for the particular purposes of Analytical 
Geometry, an adequate treatment of it is to regard it as a set of points. 

Taking as data the set of rational numbers, an irrational number of 
the continuum is defined in some manner which involves the use of words 
and symbols. Such a symbol may have a unique meaning, or it may be 
of the nature of a variable having as its field some enumerable set of 
rational, or of integral, numbers already defined; for example the symbol n 
may be taken to denote any integer of the sequence of natural numbers. 
By means of a given stock of such words and symbols it is possible to 
define only an enumerable set of elements of the continuum, each word, 
or symbol, being employed a finite number of times only. On this ground 
it has been maintained* by Konig that the elements of the arithmetic 
continuum fall into two classes; the first class consisting of numbers 
capable of finite definition, and those of the second class being inherently 
incapable of finite definition. If this view were justifiable, grave difficulties 
would arise in the whole theory of the arithmetic continuum, which is 
the basis upon which Arithmetic Analysis rests. For an object that is 
inherently incapable of being defined finitely may be held not to be 
definable at all, and such an object is regarded by many, if not most, 
mathematicians as not being an existent object for mathematical thought. 
The fallacy involved in the introduction of this distinction appears clearly 
when the method and implications of Cantor’s second proof, given in § 60, 
that the continuum is unenumerable, are fully scrutinized. It was there 
shewn that if, by means of a given stock of words and symbols, a set of 
numbers forming an enumerable aggregate is defined, the existence of 
such aggregate enables us to introduce a new word or symbol, or to give 
a new meaning to an existing symbol, which will represent the enumerable 
aggregate itself, and can then be employed for the purpose of defining 
new elements of the continuum which do not occur in the aggregate. 
Such a new element has a finite definition in the same sense as that in 
which the elements of the enumerable aggregate in question have finite 
definitions. It must be remembered that an enumerable aggregate may 
itself be denoted by a single word or symbol that can be created or assigned 
ad hoc , whenever the enumerable aggregate has been defined. 

The essence of the proof in § 60 is then that there exists, and can 
exist, at any time, no stock of words and symbols which cannot be increased 

* See Konig, Math . Annaien , vol. lxi (1905), p. 156, also vol. lxiii (1907), p. 217. See also 
Richard, Acta Math . vol. xxx (1906), p. 295, and Hobson, Proc . Land. Math. Soc. (2), vol. iv 
(1907), p. 210, and Whitehead and Russell, Principia Mathemaiica , vol. I (1910), p. 64. 
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for the purpose of defining new elements of the continuum. The theorem 
that the continuum is unenumerable is equivalent to the fact that the 
assumption of the existence of a final and complete stock of words and 
symbols, by means of which alone elements of the continuum can be 
defined by finite definitions, leads to a contradiction. Those elements of 
the continuum which are at the present time, or will be at a future time, 
capable of definition by means of the words and symbols in existence, 
may form an enumerable set; but this does not prove that there exists 
any element of the continuum that is inherently incapable of finite de¬ 
finition. In fact it can be proved that those elements of the continuum 
that remain when the hypothetical elements incapable of finite definition 
are removed, form an aggregate which has all the properties of the con¬ 
tinuum, and can therefore be identified with it. 

Let us consider the continuum of real numbers in the interval (0, 1), 
and let G denote the set of those numbers in it that are capable of finite 
definition. In the first place, the set G is dense in itself, for any element 
of it is the limit of a sequence of rational numbers, all of which belong 
to G. Next, the set G is closed; for if every number of a convergent 
sequence {x n } belong to G, it can be proved that the limit of the sequence 
belongs to G. Denoting the set { x n } by E , the number which is the limit 
of the sequence can be finitely defined as follows: Let the numbers of E 
be represented by non-terminating decimals so that in none of them are 
all its digits, from and after a fixed one, equal to 9. Let the m ih digit of 
a number x be defined as that digit which is identical with the m th digit 
of an infinite number of the elements of E . The number x so defined is 
the limit of the sequence of numbers all of which belong to G y and x, 
being thus finitely defined, itself belongs to G. 

The set G is thus dense in itself and closed, and is therefore perfect; 
it is clearly everywhere dense in the interval (0, 1). It has therefore all 
the properties of the continuum, and therefore the hypothetical non- 
definable elements can be disregarded. 

It will be observed that the elements of the continuum cannot be 
exhaustively represented by any finite set of symbols, each used a finite 
number of times. Thus there can exist no systematic notation which 
suffices to represent all the numbers of the continuum. 

TRANSFINJTE ORDINAL NUMBERS 

64 . The theory of transfinite ordinal numbers had its origin* in the 
investigation of the theory of sets of points. The general abstract theory 
of such numbers, or order-types, will be deferred until Chapter iv; it is 

* An account of Cantor’s earliest presentation of this subject will be found in Math. Annalen, 
vol. xxi (1883), p. 545. 
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necessary however to introduce here the conceptions connected with the 
formation of these numbers, with a view to utilizing them in the theory 
of sets of points. 

Let P 19 P 2 , ... P n , ... denote a sequence of points in a given interval 
AB, representing a sequence a Y , a 2 , a B , ... of increasing numbers, so that 

«i < a 2 < a 3 < ... < a n < .... 

This sequence of points has a limiting point which is not one of the points 
of the sequence, and is on the right of all those points; this limiting point 
we may denote by P w . The symbol w may be regarded as denoting a 
new ordinal number which comes after all the ordinal numbers 1, 2, 3, ... 
n, ...; it is called the first transfinite ordinal number. The number w is not 
contained in the sequence of finite ordinal numbers, but comes after all 
of them; and we shall see that it may be taken as the first of a new 
sequence of ordinal numbers, all of which must be regarded as ordinally 
greater than the finite ordinal numbers. 

iWi _tw„ + _ 

". ' . B 

Fig. 1 

Suppose that beyond the point P m there are other points which we wish' 
to regard as belonging to the same set as the points P,, P 2 , ... P n , ... ; 

then these points will be denoted byP w)1 , P w+2 , ... P w+n , ...; and if these 
points are finite in number, there will be one of them P w1m which is the 
last on the right. The indices of all the points of the set will then be 

1, 2, 3, ... n, ... co, co + 1, co + 2, ... co -f in; 

and the numbers co, co -f 1, ... + m are regarded as a set of transfinite 

ordinal numbers, which commences with the first transfinite ordinal 
number co, and contains the m succeeding transfinite ordinal numbers. 
It may however happen that the set of points P«, P„ + i, P„ t *, ... has no 
last point. In that case, assuming that the points are all contained in a 
finite interval, the set has a limiting point which is not contained in the 
set itself; and this limiting point we denote by P^ or P w . 2 , where co.2 
is an ordinal number which is not contained in the set co, at + 1, w + 2, ..., 
but comes after the numbers of that set. 

If we wish to include further points which are on the right of P w . 2 , 
we must introduce numbers denoted by oj. 2 -f 1, ok 2 + 2, ...; and, in 
case these points form an infinite set in a finite interval, they will have 
a limiting point which will be denoted by Pa,. 2 +a» or P w . 3 . We have now 
the ordinal numbers 

I, 2, 3, ... a> + 1, a> -}- 2, ... co.2, a>.2 -f- 1, co.2 -f 2, ... co.3. 

If we proceed further in this manner, it is clear that we may require 
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numbers to.n,to.n -f J, to.n 4 - 2 , ... c o.n 4 1 , ..., where n denotes any 
finite number. 

Further, it may happen that the set of points P w , P M . 2 , P*. . 3 , • • • P*..«, 
is itself infinite, and has a limiting point on the right of all these points. 
This point we denote by Pj ; and the number to 2 we consider to be a new 
ordinal number which succeeds all the numbers w. n 4- m, where n and m 
have all possible finite values. 

Points on the right of Pj may be denoted by means of the indices 
to 2 4 I, qj 2 4 - 2, to 2 4 - 3, ...; and if these points are infinite in number, they 
may have a limiting point Pj ^. 

Points on the right of P w % w may be denoted by the indices o > 2 4 - to -r 1, 
to 2 f to 4 * 2 , if these have a limiting point, it will be denoted by the 
index to 2 \ to. 2 . Proceeding in this manner, we may have points of which 
the indices are to 2 4 - o>.3, ou 2 f w.4, — If there is an infinite set of such 
points, and the set has a limiting point, on the right of the set, this 
limiting point will have to 2 4 to 2 , or to 2 .2, for its index. 

If we proceed still further, we see as before that we may have to con¬ 
template numbers of the form to 2 .p to.q 4- r, where p , q, r are finite; 
afterwards to 3 , to 3 j- 1, ..., aP.p 4- oj 2 .q 4- to.r 4 - s, &c. The general type 
of ordinal numbers which can be obtained in this manner is represented 
by <o n .p n 4 - co n ~ 1 .p n ^ 1 4- ... -f co.p x 4 - p Q \ and it is clear that, for the re¬ 
presentation of points of a given set, such numbers may be required 
as indices. 

It may happen that the set of points whose indices are to, to 2 , to 3 . ... 
is not finite; then the limiting point of such set will be denoted by the 
index to 10 . Starting afresh wdth this number, we may form numbers such as 

If the points whose indices are to*, to* 2 , to* 3 , ... do not form a finite set, 
their limiting point will be denoted by to*' 0 . 

In a similar manner we may denote by e 1 the number which comes 
after the sequence to, to*, to*' 0 , to*' 0 " ...; and starting from c l5 we may simi¬ 
larly proceed to form further numbers in endless succession. 

65. All the ordinal numbers which can be formed in the manner above 
described are formed by means of the application of Cantor’s tw^o prin¬ 
ciples of generation (Erzeugungsprinzipien). 

(1) After any number another immediately succeeding it is formed by the 
addition of unity. 

(2) After any endless sequence of numbers , a new number is formed 
which succeeds all the numbers in the sequence , and has no number imme¬ 
diately preceding it. 
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All transfinite ordinal numbers which can be formed by means of these 
two principles of generation are said to be ordinal numbers of the second 
class. The finite ordinal numbers are said to be of the first class ; they are 
formed successively, starting with the number 1, by means of the first 
principle of generation alone. 

The numbers of the second class are of two essentially distinct species: 

(1) , non-limiting numbers, those numbers which have each a number im¬ 
mediately preceding them, and from which they are formed by the 
addition of unity; for example co + n, cd 2 . p 4- co.q -f 1, or -I- co -b 1: and 

(2) , limiting numbers, those which have no number immediately preceding 

them, from which they are formed by the addition of unity : for example 
a), (d 2 -f co, to* 4 - 2 are limiting numbers. 

Any particular number of the second class can be denoted by a finite 
number of symbols, but there is no upper limit to the number of symbols 
required to denote such numbers. 

Cantor has further postulated the existence of a number 12 which comes 
after all the numbers of the second class, and is t he first number of a new 
set which is called the third class. The number 12 cannot be obtained as 
the number which succeeds a simple sequence, by means of the second 
principle of generation; for every number which can be so obtained is 
itself a number of the second class. This number 12 can be obtained only 
by means of a third principle of generation, which postulates the existence 
of a new number coming after all the numbers of the complex formed by 
the application of the first and second principles of generation. The validity 
of the postulation of the existence of the number 12, and of the higher 
numbers of the third class, will be discussed in Chapter iv. 


66 . A fundamental property of the numbers of the second class may 
be expressed as follows: 

Let P 1 ,P 2 , P 3 , ... P n , ... P w , P<rti, ... be an infinite set of points such 
that either (1), there is a last point P? , where ft is some number of the second 
class , or (2), there is no last point , but every index occurs which is less than 
some limiting number y of the second class, whereas the index y itself does 
not occur ; the set of points is then enumerable . 


The sets 


Pl, 

P%, ■ 

..Pn,- 

p w , 

PlO+1 > 

• • P kW-n , 

P«.2, 

Put. 2+1, • 

•• ^2 +n 

P «. 3 > 

Pu.tol, • 



P«.r, P„. 





65 , 66 ] Transjinite Ordinal Numbers 95 

where every index less than co 2 occurs, form an enumerable aggregate of 
enumerable sets of points; and this has been shewn, in § 58, to be itself 
an enumerable set. Now consider the sets 


Px.Pt 

, P 3 > 

... 

P w^-1 ) • • • P u , 2 ) * • • P w . 3 

Pj, 

P»+l> 

... Pj+„, 

P wSwflj • - * 

p.‘.t. 

Pw.2U, • 

• • P<u 2 .2+™, 


Ps.„ 

P «*' . 311 > • 

• • Pw . 3 I oi, 



such that in the first set there is every index less than a> 2 , in the second, 
every index less than a; 2 .2, and so on. Each of these sets is enumerable, 
and there is an enumerable set of such sets; hence the whole set, which 
contains every index less than co 3 , is enumerable. In this manner it can 
be shewn that, if every index less than to n occurs, the set is enumerable. 
If the theorem holds for sets which contain every index less than ft, ft, 
ft, ..., then it holds for a set which contains every index less than ft the 
limiting number of the sequence ft, ft, ft , — For the points with indices 
less than ft, with indices > ft and < ft, with indices ^ ft and < ft, &c. 
form an enumerable sequence of enumerable sefys; therefore, by the 
theorem of § 58, the whole set with indices < is enumerable. Since the 
theorem holds for ft ~ co, ft - a; 2 , ft = co 3 , ... it holds for f3 — to By 
continual application of this method, since any number can be reached 
by means of the two principles of generation, and since every number is 
either a limiting number, or is obtained from one by adding a finite 
number, we see that the general theorem holds. 

It will now be shewn, conversely, that if a set of jx>ints P l9 P 2 , ... P n , 
... P*,, ... Pp, ... is enumerable , there must he some definite number y of the 
first or of the second class , such that y does not occur among the indices of the 
points , and such that every number less than y does so occur . 

In case y is a limiting number, there is no last point of the set; but 
if y is not a limiting number, there is a last point, viz. the one of which 
the index is the number immediately preceding y. 

To prove the theorem, we observe that, since the given set of points 
is enumerable, it may be placed in correspondence with a set of points 
Qi, Qzj ... Q?> ••• in which all the indices are numbers of the first class. 
Let us suppose that, if possible, no number y exists; and let P ai be the 
point of {P} which corresponds to the point Q x of {$}. Let Q Vl be the 
point of {$} of smallest index, such that the corresponding point of {P} 
has an index which is > a 1 ; denote this index by a?. Then let be that 
point of {#}, of smallest index, such that the corresponding point of {P} 
has an index > ; denote this index by ctg. Proceeding in this manner, 

we have a set of points Q lt Q Vl , Q Vii ... Q^, ... corresponding in order to 
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a set of points P aj , P a .,, P aa , ... P an , ... where a x < a 2 < a. A < ... < a„ < — 
There exists a number a of the second class, which is the limit of the 
sequence Oj, Og, ... a B , ...; and by hypothesis there exists a point P a , 
which has a for index. Now the set {$} can contain no point which 
corresponds to P a , because each point Q TI corresponds to a point of {P} 
with an index less than a, and thus there is a contradiction in the hypo¬ 
thesis that a occurs amongst the indices of the points of (PJ. Hence there 
exivst numbers of the second class which do not occur as indices in the 
set {P}, and these numbers form a set which is a part of the aggregate of 
numbers of the second class. In this set there must be a lowest number y, 
and this number y is the first which does not occur amongst the indices 
of the set {P}. That every part of the aggregate of numbers of the first 
and second classes has a lowest number will be shewn, in Chapter rv, to 
be a consequence of the structure of the ordered aggregate. 

EXAMPLES 

1. On a straight line AB, let us denote by l\, P 2 , P 3 , ..., those points at which the 

ratio AB/PB has the values 1, 2, 3,_ The point P 1 coincides with vl, and the point B 

can only be represented by P M . Now take any one of the segments P r P ri ,; this may for 
convenience be represented on an enlarged scale. Denote by Qr2' QrS' , the points 
on /vv.. at which P r P r+x /QP ril takes the values 1, 2, .*1, ...; thus P r] 1 can only be repr< - 
sonted by Q l ruJ . Supposing this to have been done with every segment P r P rXl of AB, let 
us imagine all the points Q to be marked on AB, and to be numbered from left to right. 

In P x P 2 , we shall have 1, 2, 3, ... a>, 

in P 2 P 3 there Avill be w + I, w +2, ... to.2, 

and in P 3 P 4 w . 2 + 1, w . 2 -+ 2, ... u >. 3; 

the point B can be represented only by u> 2 . If now we proceed to take each segment 
QrsQr,a+ 1 > an( f to divide it in a similar manner, at points B for which QraQr.s^i/PQr mi 
has the values 1, 2, 3, ..., and then imagine all the points R obtained in every such seg¬ 
ment Q TS Q r S+1 to be marked on AB, and numbered as before, from left to right, it will 
be seen that all the numbers c » z p + tuq + r will be required, and that the point B can be 
represented by o> 3 . The points P x , P.,, ... P w will have for their ordinal numbers 

], oj 2 , or.2, a> 2 .3, ... a> 3 ; 

the point Q rs will be numbered cu 2 .r -+ <v.s; the finite numbers are all used up in the first 
sub-segment of AB. By proceeding to further subdivision, we may exhibit on AB, the 
ordinal numbers w n p n ■+ />„_! + ...+ p 0 > and the point B w ill then he represented 

by o> w+1 . 

2. The properties of the integral numbers in relation to their prime factors may be 
employed to rearrange the series 1, 2, 3, ..., so that the numbers may be made to correspond 
with a series of ordinal numbers of the first and second classes. 

First take the primes 1, 2, 3, 5, 7, 11, ...; these correspond with the numbers of the 

first class 1, 2, 3, ... n, _ Then take the squares of the primes, omitting unity; we thus 

have 2 2 , 3 2 , 5 2 , 7 2 , ll 2 , ..., corresponding to w, w +1, w +2, ... w -t «,_ 

We then take the cubes of the primes, 

2®, 3®, 5®, 7 s , 11®, ..., corresponding to a>.2, a>.2 + 1, ... a*.2 + n, ..., 
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and in general, 2 r+1 , 3 r+1 , 5 7M1 , corresponding to cu.r, a >.r \ J, ... u>.i -* v, — We may 
then take the numbers ah which consist of the product of two prime factors; these, arranged 
in ascending order, correspond to or, m' -+ 1, ... a>- -t n, .... Next take the numbers a l b- y 
which consist of the squares of the last set; these correspond to or j w, w- + aH 1, — We 
then take the successive sets of numbers of the forms n :i h :i , u*b 4 . ...; we thus obtain the 
numbers which may be taken to correspond with 

or | to.2, to 2 4 to.2 -\ 1, ... or i co.3, ... or 1 to./> ; (j . 

all of which are less than or. 2. The sets of numbers of the forms 

a 2 b, (a 2 b) 2 , ... (a 2 b) n , ... a'’b, (u^b) 2 , ... (<i A h)'\ ... n x b, (a 4 6 )-, ..., 
may then be taken. Afterwards, we may proceed with the numbers which contain three 
different prime factors, and so on. It is clear that this mode of rearranging the integral 
numbers in their natural order, so that they correspond in the new order with ordinal 
numbers of the first and second classes, admits of great variety. In every case, there will 
be some lowest number of the second class, which is not employed in the correspondence 
established. 


PROPERTIES OK AGGREGATES OK CLOSED SETS 

67. The following theorem, due to Cantor, relating to a sequence of 
(dosed sets, each of which contains the next, will be established: 

J laving given a sequence of closed sets G l , G 2 , ... G m , ... each of which 
contains the next , there exists a closed set G M each point of which belongs to 
G m , for every value of m. 

The sets may be in any number of dimensions. 

Apply to the fundamental cell (or interval) in which G x is contained a 

system of nets J) 1 , I) 2 , ... D n) _ There must he at least one mesh d lJ of 

the net , that contains points which belong to all of the sets G m : for 
otherwise, for some value of //v, no mesh would contain a point of G rn . 
In case more than one mesh of Dj has this property, we may suppose d l 
to be that one of such meshes which has lowest rank in the ordered set 
of all the meshes of D 3 . Consider next those meshes of l) 2 that are in d A ; 
as before there must be at least one of such meshes which contains points 
that are in all the sets G m ; let d 2 be that one of such meshes which has 
lowest rank of all the meshes of I) 2 that are in d x . Proceeding in this 
manner, there is determined a sequence dj , d 2 , ... d m , ..., of meshes be¬ 
longing to D ly JJ 2 , ... D m , ... respectively, each of which contains the next, 
and each of which contains points that belong to G m for every value of m. 
The sequence {d m } defines uniquely a point P that is in all the meshes of 
the sequence. This point P is a limiting point of G m , for each value of m ; 
and since G m is closed, it belongs to G m . Thus the existence of at least one 
point of G m has been established. The set G„ is possibly finite, and is in 
any case closed. For, in an arbitrarily small neighbourhood of a limiting 
point P of the set <2*,, there exist points of G„, and therefore points of 
G m ; hence P is a limiting point of G m . Since G m is closed, P must be a 
point of G m for each value of m, and therefore P belongs to G u . 
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The corresponding statement does not necessarily hold good for a 
sequence { G m } of sets, each of which contains the next, when the sets G m 
are not closed; for the point P, in the above proof, is then not necessarily 
a point of the unclosed set G m . The set G„ therefore does not necessarily 
exist. 

This is illustrated by the case in which G m is the linear set of points 

1 , , 1 _ ,_ The point zero is such that, in any neighbour- 

m m f 1 m 4 2 1 J a 

hood of it, points of G m exist for each value of m, but that point does not 

belong to G m ; consequently the set G w does not exist. 

The following generalization of Cantor’s theorem has been given* by 
Sierpinski: 

Having given a family F of closed sets , the necessary and sufficient con¬ 
dition that there exists one point at least which belongs to all the closed sets 
G, of F, is that every finite number of the sets G belonging to the family F 
has at least one point in common . 

This may be proved by the use of a system of nets. 

The following is a corollary of tins theorem : 

If F is a family of closed sets G, such that , if any pair of the sets G be 
taken , one of them contains the other , there exists at least one point common 
to all the sets G which belong to the family F. 

THE TKANSIINITE DERIVATIVES OF A SET OF POINTS 

68. If G denotes a set of points in a cell (or interval) (a, b ), it has been 
shewn in § 57 that the derivatives G a) > G {2) , ... G {n) > ... are all closed sets, 
and that all the points of any one of these sets, after the first, are con¬ 
tained in the preceding set. If G is of the second species, G (n) exists for 
ail values of n , and in this case, in accordance with the theorem of § 67, 
the set D (G a) , G {2) , ...) which contains points belonging to every G in) 
exists, and is closed. This closed set may be denoted by G {m \ where w is 
the first transfinite number. It is defined to be the derivative! of G f of 
order o>. In case G (w) contains more than a finite set of points, we can 
proceed to form successive derivatives of G <w) in a manner similar to that 
in which the derivatives G {1) , G l2 \ ... of G were formed. These successive 
derivatives may be denoted by 6 f(u,+1) , G {u>+2) , ... 6r (w+w) , ..., and are re¬ 
garded as the derivatives of G of orders co 4- 1, a> + 2, ... co -} n, _ They 

have the same properties as the derivatives of finite order, viz. that all 
the points of each are points of G {1) , and that all the points of any one of 
them are points of the preceding ones. 

It may happen that one of the derivatives # <w + w) contains no points; 

* Bulletin of Acad, of Sciences of Cracow, April—May, 1918. 
f See Cantor, Math. Annalen , vol. xvn (1880), p. 357. 
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then the process of forming derivatives has come to an end, the last one 
being G f(w+n " 1) . If this is not the case, a repetition of the above reasoning 
shews that the set D (G {Ui+1) , G { “+ 2) , ... G {uj+n) , ...) contains at least one 
point, and is a closed set; this set is denoted by 6r (w * 2) , and is defined to 
be the derivative of G of order to. 2. In the same manner we can proceed 
to form further derivatives, whose orders are numbers of the second class. 

In general, if a li a 2 , a 3 , ... a n , ... denote a sequence of numbers of the 
second class, whose limiting number is /J, the same reasoning as before 
shews that, if all the derivatives G {ai) , G (a *\ ... G (a ^, ... exist, then the set 
D (G (x '\ G M y ... G {a n\ ...) contains at least one point, and is a closed set. 
This is denoted by G {ti) , and is defined to be the derivative of G of order /?. 

If we form the successive derivatives of the set G, whose orders are 
the numbers of the first and second classes, it may happen that there is 
a first number y, of the first or second class, for which G { ^ =0; but this 
number y cannot be a limiting number of the second class. 

It may, however, happen that no number y, of the first or second class, 
exists for which (7 ( > ) = 0, so that derivatives of G exist of orders corre¬ 
sponding to all the numbers of the first and second classes. It will be 
shewn in § 82 that, if G does not vanish, for some number y, of the 
first or of the second class, then there necessarily exists a number /?, of 

the first or second class, such that G (3} = 6 J(a+1) = G (3+2) =_ This set G {3) 

is a perfect set, and it is frequently denoted by G {ll \ where fl is the first 
transfinite number of the third class. The notation G {a) may however be 
employed, independently of the acceptance of the theory of numbers of 
the third class. 

Conversely, if G {n) does not exist, G {y) must first vanish for some 
number y of the first or second class, which number cannot be a limiting 
number. 

EXAMPLES 

1*. Let G denote the enumerable set of points, each one of which is given by 

1 1 _1 1 

2« + 2»*4 rn l t m\ + * *' + 2 n + m i+ w *+ • • m « ’ 

where n has all positive integral values, excluding zero, and Wj, ra 2 , ... m n have all positive 
integral values including zero, independently of one another. 

It is easily seen that in G {n) , the points , ... all occur, and hence that G {tu) 

exists, and consists of the single point zero. 

2*. Let G denote the enumerable set of points, each one of which is given by 
11 1 1 1 
2 + 2 m i+ m » + * * * + 2 m i+ m *+ . • + m n + 2 ^* 1 +m,+ ... + • • ■ + m irHP+«i 

1 1 

+ 2» n i-fm 1 + ... + + 2 w i +m * + • • • +<?«+”".~+7p * 

* These examples were given by Mittag-Leffler, Ada Math . vol. iv (1884), p. 58. 
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where m 2 , ... m n , p, q 19 q 2 ,... q p have all positive integral values, including zero. In 
this case G (w + n) consists of the single point zero. 

3*. Let G denote the enumerable set of points, each one of which is given by 

111 1 J 

-4- 4 .- 4 - . 4 4- 

2 n 2”+ m i 2 n ^ m i+ m t 2 n + m i On-tw^-K. 

1 1 

+ 2 n + m i+... -I V t </, + * * ‘ h 2 n + m t 1 "'a 4- ■ • •"+ "'r.-T JH t/a . . -f Up ’ 

where ??, rn j, m 2> ... m n ,p, q ly ... ( 7 ^, have all positive integral values. In this case 6 ,<w * z> 
exists, and consists of the single point zero. 

SETS OF INTERVALS OR CELLS 

69. The properties of a set of intervals, or of cells, which intervals, or 
cells, are assigned in any manner, are closely connected with the pro¬ 
perties of sets of points, and will therefore be considered here in some 
detail. 

Let us suppose that a finite set of non-overlapping intervals has been 
defined, all in the finite interval (a, b). Denoting these intervals by 
(Oj, bj) (a 2i b 2 ) ... ( a n , b n ), let us suppose that all the intervals (a r , b r ) of 
the set are such that none of their lengths b r - a r are less than a fixed 
positive number e. Then the number n of the intervals cannot exceed 
(b — o)/e. For the 2 n points a lf b l , a 2i b 2 , ... a n , b n divide (a, b) into at 
most 2n -\ 1 parts, the sum of the lengths of which is b — a . These parts 
consist of the n given intervals, and in general of complementary intervals; 
it follows that the sum of the lengths of the n given intervals cannot 
exceed b — a; and thus that the number of the intervals cannot exceed 
(b - a)/<r. 

Next, suppose that any non-finite set of non-overlapping intervals is 
defined, all of the intervals lying in the fundamental interval (a, b). Choose 
a sequence e 1} e 2 , * w , ••• of positive decreasing numbers which converges 

to zero. The number of intervals of the given set which are of length e n 
is finite, since it does not exceed (b — a)/c n . The intervals of the given 
set can now be arranged in order of their lengths, taking first those that 
are ^ e l5 then those that are < € x and e 2 ; and so on; there being only 
a finite number of intervals in each set. In case a number of intervals 
are of equal length their order may be that in which they occur in (a, b) y 
from left to right. Therefore, since all the intervals of the given set can 
be arranged as a simply infinite aggregate, they form an enumerable 
aggregate. 

Next, suppose that the intervals of a given non-overlapping set are in 
the unbounded interval (— oo, oo), in which the position of any point is 
denoted by x. If we consider the correspondence £ = xjVx 2 -f 1, where 
* This example was given by Mittag-Leffler, Acta Math. vol. iv (1884), p. 58. 
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the radical always has the positive sign, the unlimited interval (— oo, oo) 
corresponds to the open segment (— 1, 1) in which the point f lies. The 
intervals of the given set correspond uniquely to intervals in (— 1, 1), 
and the set of these latter intervals is enumerable; hence the given set 
is so also. It has thus been shewn that: 

Every set of intermis in a bounded , or unbounded , interval which is such 
that no two of the intervals overlap is either finite , or forms an enumerable 
aggregate *. 

The corresponding theorem can be shewn to hold for a set of non¬ 
overlapping cells in two or more dimensions. Two such cells may have 
a portion of their boundaries in common. 

it will be sufficient to consider the case of plane cells; the method of 
proof can be extended to the case of cells of any number of dimensions. 

First, let a finite set of non-overlapping cells (a r (1) , a r (2) ; b r a) , b r {2) ) 
(r = 1, 2, 3, ... n) be contained in the cell (a (1) , a (2) ; 6 (1) , 6 (2) ), and let us 
suppose that 6 r (1) — a r {l) ^ c (1) , b r {2) — a r {2) ^ e <2) , for r — 1, 2, 3, ... n ; where 
€ {1) , e (2) denote fixed positive numbers. The number n of the cells cannot 
exceed (6 U) — a (1) ) (b {2) — a (2) )/e (1, e (2) . 

For, let the fundamental cell (a (1) , a (2) ; 6 (1) , b {2] ) be divided into parts 
by constructing straight lines parallel to the x {2) axis through the points 
on the x a) axis which are represented by the 2 n numbers a 1 (1) , 6 1 (l) , 
a 2 (1) , ^ 2 (l) } ••• a n l) > ^n (1, j an d by constructing straight lines parallel to the 
x il) axis through those points of the x (2) axis that are represented by the 
2n numbers a 1 (2) , V 2) , a 2 i2) > V 2) > ••• a n i2) , 

The sum of the products of the sides of these parts into which the 
fundamental rectangle is divided is clearly equal to (b n) — a (1) ) (6 (2) — a (2) ). 
Every one of the rectangles of the given non-overlapping set is either 
identical with one of these parts, or is the sum of two or more of the parts. 
It then easily follows that 

"if (b r a) - a r a) ) (b r {2) - a r (2) ) ^ (6 (1) - a (1) ) (6< 2 > - a< 2) ); 

r 1 

and from this it follows that the number of the rectangles of the given 
set cannot exceed (6 (1) — a (1> ) (b (2) — a (2) )/e (1) e (2) . 

Now let us consider a given non-finite set of cells, no two of which 
overlap, and all in the rectangle (a (1) , a <2) ; 6 (1) , 6 (2) ). Give to e (1) the values 
in a sequence of diminishing numbers e 1 (1) , e 2 (1) , ... € n (1) , ... which con¬ 
verges to zero; e (2) having the values in another such sequence e 1 <2) , e 2 <2) , .... 

The number of cells of the given set which are such that one side is 
c n (1) , and the other s e n {2 \ is finite; for it is not greater than 
(6< l > - a' 1 ') (b< 2 ' - a i2) )/€ n {1) e n (2) . 

* Cantor, Math. Annalen , vol. xx (1882), p. 118 et seq. 
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We can now arrange the cells of the given set in order, taking first 
those for which the sides are £ t 1 (1) and £ e 1 (2) ; then those for which the 
sides are < € 1 (2) and ^ € 2 (1) , e 2 (2) ; and so on; there being only a finite 

number in each set. In each of such finite sets the cells are arranged in 
descending order of magnitude of the products of the side, or, in case of 
equality, in order of rank. Thus, since the whole set can be arranged as 
a simply infinite aggregate, the set is enumerable. 

The case in which the cells of the set are in unbounded plane space 
can be considered as before by employing the correspondence 

- x n) /{(x a> ) 2 4 - 1}*, £< s » - x (v /{(x m ) 2 -h 1}*. 

We have now the general result: 

Any set of non-overlapping cells in p-dimensional space forms an enumer¬ 
able aggregate. 

It will be observed that, although geometrical language is employed 
above, the proof of the theorem is essentially arithmetical. 

70 . Every isolated set is enumerable. 

This theorem may be proved by applying the theorem in § 69. 

Let P (x) be a point of an isolated linear set. The positive numbers h 
can be divided into two classes, the first consisting of the numbers h such 
that the closed interval ( x , x 4 h) contains no points of the given set, 
except the point x , and the second class consisting of those numbers h 
for which this is not the case. The two classes will be separated by a 
number h l9 where x 4- h t is either a point of the given set, or else is a 
limiting point of the given set but does not belong to it. A similar interval 
(x — h 2 , x) may be defined on the left of x\ the interval (x — k 2 , x 4 ^i) 
contains no point of the given set in its interior, except x. Let p be the 
smaller of the two numbers and let the interval (x — p, x 4 p) 

correspond to the point x. Taking the set of all intervals (x — p, x 4 p) 
that correspond to the points of the given set, we see that this set of 
intervals is non-overlapping, and is consequently enumerable. Since each 
interval corresponds to the point x at its centre, it follows that the given 
isolated set of points is enumerable. 

In the case of an isolated set in space of two, three, or more, dimen¬ 
sions, it can be shewn that, with each point x as centre, a circle, sphere, 
or hyper-sphere of radius 2p can be so determined that no point other 
than x is interior to it, and such that this is not the case for any circle, 
or sphere, of greater radius. We then take the circle, or sphere, of radius p, 
corresponding to each point x of the set. The set of all such circles, or 
spheres, will be non-overlapping. In every sphere a cell with equal edges 
can be inscribed; and the set of all such cells will be non-overlapping; to 
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each cell there corresponds the point of the isolated set at its centre. It 
now follows, as before, that the given isolated set is enumerable. 

Any set of points G is the sum of an isolated set and of one which is 
a component of G'. It follows that, if the derivative O' is enumerable, so 
also is Q ; but the converse does not hold. 

Every set of points which is of the first species is enumerable. 

For, if 6* be the order of G Y , the set G (s) contains only a finite set of 
points; and therefore G {8 ~ 1} is enumerable. Consequently G {s ~ 2) , G r(i, " 3) , ... G 
are all enumerable sets. 

A set of points of the second species is enumerable if one of its trans- 
finite derivatives is enumerable. An unenumerable set G cannot have any 
enumerable derivative. 

71. Let us consider a given set of overlapping intervals, not necessarily 
enumerable, contained in a finite segment (a, b). Let $ be the set of those 
points of (a, b) each of which is an interior point of one interval at least 
of the given set. 

If P be a point of S, it is interior to an interval a of the given set, 
and therefore a neighbourhood of P exists, such that every point of it is 
an interior point of a ; and therefore all the points of that neighbourhood 
of P belong to S. Hence P is an interior point of 8; and therefore S is an 
open set. Now it may be shewn that: 

Every open linear set of points consists of the interior points of a finite , 
or enumerable , non-overlapping set of intervals. 

For, if x be a point of 8> it can be shewn, in exactly the same manner 
as in § 70, that an interval (x — h 2 , x + hj can be defined which has the 
property that every interior point of it belongs to S , but that x — h 2 , x + h x 
do not belong to S. In this manner, to each point x , of 8, there is corre¬ 
lated a definite interval 8 (x) ; to all the points interior to 8 (x) the same 
interval 8 (x) corresponds. Now let us consider the set of all such intervals 8 ; 
these intervals are non-overlapping, and thus form a finite, or enumerable, 
set of intervals. Their interior points are identical with the set 8. 

The following theorem has been established: 

Every set of intervals contained in a finite segment can be replaced by a 
set of non-overlapping intervals of which the interior points are the same as 
those of the given set. 

If we consider the given set as a set of open intervals, the theorem is 
equivalent to the statement that the set of points, each of which belongs 
to one or more open intervals of the given set, is itself an open set. This 
is a generalization, for the case of an enumerable set of open sets, of a 
theorem given in § 56. 
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The new net may be spoken of as the set of non-overlapping open 
•intervals equivalent to the given set of open intervals. 

In the case of a p-dimensional set, it is shewn in exactly the same 
manner as in the case p 1, that, having given a set of overlapping cells, 
the set S, of points, each of which is interior to on© at least of the cells, 
is an open set. The nature of an open set in space of more than one 
dimension will be considered later. 

72. The properties of any set of open intervals in a finite segment 
have been shewn to depend upon those of a non-overlapping set of such 
intervals; and we therefore proceed to the consideration of the latter. 

Every point of (a, b) which is not interior to an interval of the non¬ 
overlapping set is either 

(1) , a common end-point of two intervals of the given set; or 

(2) , a point interior to, or at an end of, an interval not belonging to 
the given set, this interval containing no point which is interior to any 
interval of the set; or 

(3) , a limiting point, on both sides, of end-points of intervals of the 
set; or 

(4) , an end-point of an interval of the given set, and also a limiting 
point, on one side, of end-points of intervals of the given set. 

If either a or b is an end-point of an interval, we reckon that point 
as belonging to the points (1). 

The points described in (2) and (3) may be described as external points 
of the given set; and if a or b is a limiting point of end-points, it will be 
reckoned as an external point. 

The points described in (4) may be spoken of as semi-external* points. 

Those points of the segment (a, b) which are not joints of a given set of 
non-overlapping open intervals form a closed set. 

This theorem is that particular case of the theorem proved in § 56 , 
that the complementary set of an open set (relatively to a closed interval 
in which it is contained) is a closed set, which arises when the open set is 
the given set of non-overlapping open intervals. 

It will now be shewn thatf, unless a given set of non-overlapping intervals 
is a finite set , there must be at least one external or semi-external point; in 
other words, the whole interval (a, b) cannot be filled up by an indefinitely 
great number of non-overlapping intervals, each one of which abuts on 
the next, without leaving at least one point over, which is neither interior 

* This term is due to W. H. Young, Proc. Land. Math . Soc. (1), vol. xxxv (1902), p. 260. 
t This theorem was given by W. H. Young, ibid. p. 261. 
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to an interval nor is an end-point of two intervals; the points a, h being 
regarded as end-points of two intervals if they are end-points of one 
interval of the given set. 

If there be any complementary intervals, i.e. intervals in which no 
point belongs to the given set of non-overlapping intervals, then the points 
of these intervals are all external points, and we therefore need only con¬ 
sider the case in which no such complementary intervals exist. We ob¬ 
serve that, when the number of intervals is not finite, their end-points 
must have at least one limiting point P. Now this point P cannot be 
interior to one of the given intervals; for, if it were so, it would have a 
neighbourhood, viz. the interval to which it is interior, within which are 
no end-points. Neither can P be a common end-point of two intervals; 
for it would then have a neighbourhood on the right, and also one on the 
left, within which there is no end-point except P itself. The point P must 
consequently either be an external point, i.e . one which is not an end-point 
but is a limiting point, on both sides, of end-points; or else it must be an 
end-point of one interval, and a limiting point, on one side, of end-points. 
If a, or 6, is not an end-point, it is regarded as an external point. It will 
subsequently be shewn that the external and semi-external points form 
a set which may be either finite, or of cardinal number a, or of cardinal 
number c. 

Having given an overlapping system of intervals , those jx>ints that are 
end-points of intervals of the system and are not interior jmnts of any interval 
of the system form a non-dense enumerable set. 

If P be a left-hand end-point of an interval Pp of the given set, and 
if P is not interior to any interval of the set, Pp cannot have in its interior 
any other left-hand end-point Q of an interval Qq of the given system, 
such that Q is not interior to any interval of the set. Hence all such 
intervals Pp are non-overlapping, and therefore form an enumerable set. 

Their left-hand end-points form an enumerable set, which is clearly 
non-dense. Similar reasoning applies to those points which are right- 
hand end-points of intervals of the given set, and are not interior to any 
interval. 

Thus the theorem has been proved. 

EXAMPLES 

_ l 2” - 1 \ 

L*. In the interval (0, 1) take the intervals (0, }), (f, jj). ... {■— 9 '<m+i ) anc * 

also the intervals obtained by reflecting these intervals in the point The point J is 
external to ail the intervals, and yet the limiting sum of the intervals is equal to 1, the 
length of the whole interval (0, 1) in which the enumerable set of intervals is contained. 

* See W. H. Young, Proc. Lond. Math. Soc. (1), vol. xxxv (1902), pp. 249-251. 
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If instead of reflecting the intervals in the point J, we take the interval (J, 1), the 
point i is now a semi-external point, and the limiting sum of the intervals is the same as 
before. 

( On-1_] on _]\ 

2« » 2 n41 j *** °* ^ erva ^ 8 ’ anc ^ divide each 

interval into a set of sub-intervals similar to the whole. We now have a new enumerable 
set of intervals which has no external points, but of which the semi-external points form 
an enumerable set J, fa , .... 


THE HEINE-BOREL THEOREM 

73. If a set A of intervals, all in the fundamental interval (a, b), be suck 
that every point of the closed interval (a, b) is an interior point of at least one 
interval of the set A (the end-points a , b being regarded as interior to an 
interval when either of them is an end-point of such interval), then a finite 
set of intervals, all belonging to A, exists, which has the same property as the 
set A, viz. that every point of the closed, interval (a, b) is interior to at least 
one interval of the finite set. 

This theoremf, which is known as the Heine-Borel theorem, is of great 
importance in the Theory of Functions, and may be proved as follows: 

Apply a system of nets with closed meshes to the interval (a, b). There 
must exist some value of n such that every mesh of the net D n is interior 
to one interval at least of the set A. For, let us suppose, if possible, that 
this is not the case. Let a n denote those meshes of D n each of which is 
not interior to any interval of the set A. By hypothesis, a n exists for 
every value of n ; and it is clear that the closed sets a 1 , , ... a n , ... are 

such that each set contains the next. In accordance with the theorem 
established in § 67, there is at least one point x that is contained in all the 
closed sets {a n }. The set of all such points x is closed; let ./• be the lower 
extreme of the set. The point x is interior to some interval (x — e, x + c') 
of A; or to an interval (a, a + c) in case x =-= a; or to an interval (b — c', b) 
in case x — b. 

When n is sufficiently large, d n the maximum breadth of the meshes 
of D n is less than both € and e ; or, when x = a, is less than e'; or when 
x = b, is less than e. For such a value of n, x is either interior to a mesh 
of Z) n , which mesh is interior to the interval (x — e, x + e'), or else x is 
a common end-point of two meshes of D n both of which are interior to 
that interval; or x may be at a or at b. In any case a mesh of a n is interior 
to an interval of A; contrary to the supposition made above. 

♦ See W. H. Young, Proc. Land. Math. Soc. (1), vol. xxxv (1902), pp. 249-251. 
t The first statement of the theorem was given by Borel, Annales sc. 4c. normals (3), vol. xii 
( 1895), p. 51. See also W. H. Young, Proc. Loud. Math. Soc. (1), vol. xxxv (1903), p. 301 and 
Jlendiconti di Palermo , vol. xxi (1906), p. 125. The most general statement is that given by 
Gross, Wiener Sitzungsber. vol. cxxm i a (1914), p. 809. 
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It has thus been shewn that, for a sufficiently large value of n, each 
of the meshes of D n is interior to one at least of the intervals A; if, with 
each mesh of D n there be associated an interval of A to which it is interior, 
the finite set of such intervals of A is a set such as is required. 

In the above theorem it is not assumed that the given set of intervals A 
is enumerable. It has been proved that the interval (a, 6) can be divided 
into a finite number of parts, each of which is interior to a set of intervals 
belonging to A. Each such set of intervals may be finite, or infinite 
(enumerable or not). The effective selection of a finite set of intervals 
of A, which satisfies the condition of the theorem, presupposes a know¬ 
ledge of the mode in which the set of intervals A is defined. The theorem 
does not by itself provide any general method of making such effective 
selection. In the most general case, the interval (a, b) is divided into a 
finite number of parts, each of which has associated with it an (in general) 
infinite set of intervals of A, to all of which it is interior; the effective 
selection of the required finite set of intervals involves the choice of one 
of the possibly infinite set of intervals all of which contain one of these 
finite parts of (a, b). The number of such choices to be made in each 
particular case, being finite, the principle known as the “ multiplicative 
axiom/’ which will be discussed in Chapter tv, is not required in its most 
general form. 

It is clear that the above proof is applicable to the case in which cells 
in any number of dimensions take the place of linear intervals. The 
method of employment of systems of nets is precisely the same as in the 
case of the linear system. Accordingly we have the Heine-Borel theorem 
for the case of p-dimensionai cells. 

If a set of p-dimensional cells A (p) , all in the fundamental cell (a, 6), be 
such that every point of the closed cell (a, b) is an interior point of at least 
one cell of A (p) (a point on the boundary of (a, b) being regarded as interior 
to a cell to whose boundary it belongs ), then a finite set of cells , all belonging 
to A (p) , exists , which has the same property as the set A (p) . 

74 . The Heine-Borel theorem can be generalized so as to apply to the 
case in which the points that are to be interior to a finite number of 
intervals, or cells, form any given closed set G , instead of consisting of 
all the points of the closed cell (or interval) in which the given set A (p) 
is contained. The generalized theorem may be stated as follows: 

Having given a ( bounded) closed set G , and a set of cells A (p) , such that 
each point of G is interior to one cell at least of the set A (p) , there exists a 
finite set of cells , all belonging to A (p) , such that each point of G is interior 
to one at least of the cells of the finite set . 

To establish this theorem a slight modification only of the proof of 
the theorem when G consists of all the points of (a, b) is required. We 
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may choose the cell (a, h) so that it contains G , and we can neglect any 
part of the set A (v) that is not in {a, b). Those meshes of the nets D l , D 2 , ... 
which contain no points of G are throughout disregarded. The finite set a n 
consists of those meshes of D n that contain points of G and are not interior 
to any cell of A (p) . It is then clear that the point x is necessarily a point 
of the closed set G. 

If the given set G were not closed, the point x might be a limiting 
point of G which did not belong to G, and thus the argument would 
break down. 

The following theorem may be established by employing the generalized 
Heine-Borel theorem: 

Having given a closed set, an enumerable set can be determined which has 
for derivative the given set. 

With each point of the closed set G as centre let there be taken a cell 
of span d. A finite number of these cells are such that every point of G is 
interior to one of them; let this set be C d , and let P j (d) , P 2 id) , ... P nd id) 
their centres, all points of G. Let d have a sequence of decreasing values 
d x , d 2 , ... converging to zero. 

The totality of the points Pf d) , ... P nd d) y taken for every value of d 
in the sequence, is an enumerable set Ii of points of G. Since H consists 
of points of the closed set G, every limiting point of H belongs to G. 
Every point of G which does not belong to H is interior to all the cells 
of a sequence of cells of spans d x ,d 2 , ... and is therefore a limiting point 
of the centres of the ceils of this sequence; therefore a point of G which 
does not belong to // belongs to H'. Considering those points of G which 
belong to H, but not to H ', we see that these are all isolated points of G , 
and each one of them is the centre of some cell which contains no point 
of G other than its centre. In each such cell place an enumerable set of 
points having the centre of the cell for its sole limiting point. Let the 
totality of all these enumerable sets be the set K, which is enumerable, 
since the isolated points of G form an enumerable set. The set K' consists 
of the points of G which belong to H but not to H', together with the 
limiting points of the centres of the cells employed, and these also belong 
to G. The enumerable set II -f K has G for its derivative. 

75. It has been pointed out* by de la Vallee Poussin that the generalized 
form of the Heine-Borel theorem still holds good in case A (p) is replaced 
by an aggregate S of open sets. More generally, instead of A (p) , any 
aggregate of sets may be taken, such that each point of G is an interior 
point of one or more of the sets of the aggregate. This may be stated as 
follows: 

* See Integrates de Lebesgue, pp. 15, 112. 
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If a closed (bounded) set G and an aggregate S of sets be such that every 
point of G is an interior point of one at least of the sets of 8, a finite number 
of the sets of 8 exists such that each point of G is an interior point of at least 
one of them. 

To prove this theorem, it may be observed that, with each point P 
of G a eeJl may be associated, with P in its inferior, and such that the 
cell is interior to one or more of the sets of S. The aggregate of all such 
cells may be taken to be the set A (p) in the generalized form of the Heine- 
Borel theorem. A finite part of the set A (J>) exists such that each point 
of G is interior to one or more of the cells of this finite set. Each of these 
cells is contained in a set belonging to 8. Hence there exists a finite 
aggregate of the sets of 8 which has the required property. 

In case the sets 8 form an aggregate H lt // 2 , ... H m , ... IIp , ... such that 
each one is contained in the next , the closed set G, each jx>int of which is in 
one or more of the open sets, must be contained in II $, where ft is some fixed 
number of the first or of the second class. 

For a finite number of the sets II exists such that every point of G is 
contained in one or more of them. If p be the index of that one of tills 
finite set which has the highest index, G must be contained in Hp. 

From this theorem, the following may be deduced: 

If G 1 , G' 2 , ... G it)y ... Op , ... are closed sets such that each conta ins the next , 
and F is a closed set of points such that no point of F is contained in all 
the sets G, there exists a definite number of the first, or of the second , class , 
such that no point of F belongs to Gp . 

To prove this, we consider, with respect to some cell containing G 1 , 
open sets C (Gf), C (G 2 ), complementary to G Xi G 2 , .... Each of these open 
sets is contained in the next. Any point of F is contained in some of 
the sets C (G x ), C (G 2 ), ...; and the set F is therefore contained in C (Gp) 
for some value of fS, and is therefore not contained in Gp. 

It has already been pointed out that, when G is a set that is not closed, 
and each point of G is interior to at least one of the cells of a given set, 
there does not necessarily exist.any finite part of the given set of cells 
such that every point of G is interior to one of them. However, in any 
case, an enumerable part of the given set of cells exists which has this 
property. Thus: 

If G be any set of pomts , not necessarily closed , and, each point of G be 
interior to one at least of the cells of a given unenumerable set A (p) , there 
exists an enumerable set of cells all belonging to A (p) , such that every point 
of G is interior to one at least of the cells of this enumerable set. 

This theorem can, as in the case of the Heine-Borel theorem, be proved 
by means of a system of nets applied to the finite, or infinite, interval, or 
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cell, in which G is contained. Moreover*, instead of A (J>) , an aggregate of 
sets can be substituted, such that every point of G is an interior point of 
at least one of the sets of the aggregate. 

Of a net D n , for any large enough value of n , say n l9 some of the 
meshes containing points of G will be interior to one or more of the cells 
(or intervals) of the given unenumerable set. Let these meshes be 
denoted by ft. For values of n greater than n x , we consider only those 
meshes of D n that are not contained in ft. For some such value of n , 
say n 2i there will be meshes ft, containing points of G, not contained 
in ft, each of which is contained in one or more of the intervals, or cells, 
of the given set. We obtain in this way a sequence ft, ft, ft, ... of finite 
(or enumerable, in case G is unbounded) sets of meshes, each of which 
contains points of G, and is contained in one or more of the intervals, or 
cells, of the given set. Each point of G must be contained in one of 
the meshes belonging to this set {ft. The meshes of ft, ft, ... form an 
enumerable set, and corresponding to each of such meshes, there exists a 
cell (or interval) of the given set. Hence there exists an enumerable part 
of the given set such that every point of G is interior to one or more of the 
intervals, or cells, of this enumerable part. 

In the choice of one out of a possibly infinite number of cells which 
have a given mesh in their interiors, the number of choices being in this 
case indefinitely great, the '‘multiplicative axiom” is in general involved. 

However, the following theorem may be established!, which is suffi¬ 
cient for applications, and in the proof of which no use is made of the 
multiplicative axiom: 

If each 'point of a given set is interior to a definite cell corresponding to 
that point , an enumerable set of these cells can be so determined that all the 
points of the set are interior to one or more of the cells of the enumerable set. 

If the set G be not bounded, we may place it in correspondence with 
a bounded set, so that, to the given cells, there corresponds a set of cells 
having the corresponding property with reference to the bounded set of 
points. There is accordingly no loss of generality in considering a bounded 
set only, which we can suppose to be contained in a fixed cell with equal 
sides. Instead of the cell of the given set A (P) corresponding to a point P 
of the given set G f we may substitute a cell A' (P) with the point P as 
centre and with its sides all equal, and take A' (P) to be the greatest such 
cell which has no point exterior to A (P). We have now, corresponding 
to each point P of the given set, a cell A' (P) with P as centre. Take a 
set of positive diminishing numbers {e n } which converges to zero, and 
let G n be that part of G which consists of points for which the spans of 

* See Lindelof, Comptes Rendus, vol. cxxxvh (1903), p. 697. 
f See Borel, Lemons sur lea fonetions monogenes, p. 12. 
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the corresponding cells A' are > e n . Since every point of G n is in a cell 
(with equal sides) of spans all > e„, every point of G n ' is in a cell of this 
system. Hence a finite set of these cells can be determined which contain 
in their interiors all the points of G n ' , and therefore all the points of G n 
with the possible exception of a finite number. By adding the cells corre¬ 
sponding to this finite number of points of G n , we obtain a finite set of 
the cells A' which contain all the points of G n . We have now a finite set 
of cells which contain all the points of G x , and a finite set which contain 
all the points of G 2 that do not belong to G x \ and generally, a finite set of 
cells which contain all the points of G n that do not belong to G l9 G 2 , ... 
or G n . Taking every value of n , we have now an enumerable set of the 
cells A' that contain all the points of G , and we take the set of cells A 
which correspond to the enumerable set of cells A' as the required set. 

76. J n a proof given by Heine* that every continuous function is 
uniformly continuous there is contained the germ of the theorem now 
called the Heine-Rorel theorem, as it was first explicitly stated and proved 
by Borelf, for the case of a linear interval (a, b). Various proofs of the 
theorem have since been published J. It was pointed out by H. F. Baker§ 
that the proof by Goursat|| of Cauchy’s fundamental theorem in the 
theory of functions of a complex variable contains a general method of 
procedure, equivalent to the use of nets made in § 73. The extension to 
the case of any closed set G was given^J by W. H. Young, and by Borel**. 

By means of an analysis of the reasoning of Heine, W. H. Young was 
led|t to the discovery of the following more general theorem*, called the 
Heine-Young theorem: 

If, with each point x of a closed interval (a, b), we have associated a pair 
of intervals r x and l x , such that, (1), x is the left-hand end-point of r x and the 
right-hand end-point of l x and, (2), if x' is an internal point of l x , then x is 
an internal or end-point of r x >; there then exists a finite number of the intervals 
r x abutting end to end and covering the whole segment (a, b). 

It is unnecessary to suppose r b and l a to be defined. 

Starting with a as end-point, we take the interval r a , or (a, a^); we 
then take the interval r Xl or (x ly -x 2 ); and so on. We thus obtain a series 
of intervals of the given set, (a, x x ) (x 1 , x 2 ) ... (x n _ lt x n ) _ 

* Journal J. reine und angewandtc Mat. vol. lxxiv (1871), p. 188. 
f Annqles sc. 4c. normals (3), vol. xn (1895), p. 50. 
f See for example Borel’s Lemons sur Vintegration, p. 105. 

§ Proc. Lond. Math. Soc. (1), vol. xxxv (1902), p. 459, and (2), vol. I (1903), p. 24. 

|i Trans. Amer. Math. Soc. vol. I (1900), p. 15. 

Proc. Lond. Math. Soc. (1), vol. xxxv (1902), p. 387. For a further extension of the theorem, 
see W. H. Young, Mess, of Math. vol. xxxin (1904), p. 129, and also Proc. Lond. Math. Soc. (2), 
vol. n (1905), p. 67. 

** Comptes Rcndvs, vol. cxl (1905), p. 298. 
ft Proc. Lond. Math. Soc. (2), vol. xiv (1914), p. 114, 
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If, for some value of n, # n == b, the condition of the theorem is satisfied 
by the n intervals thus obtained. Otherwise, the sequence x x , x 2 , ... has 
a limiting point X on the right of all the points {x n }. There is then an 
interval lx with X as right-hand end-point. 

All the points {r„}, except a finite number of them, are interior to the 

interval lx ', let these interior points be x n> x n+1 , _ By the hypothesis 

(2) in the statement of the theorem, r Xn must reach at least as far as X , 
and this is not the case, as it reaches only to x n+1 . Hence the sequence 
{x„} cannot be infinite. Thus the theorem is established. 

In case there exists no interval l b , so that the conditions of the theorem 
hold for the half-open interval (a, b), there exists an enumerable set of 
the intervals r x abutting on one another and covering the half-open 
interval (a, b). Por it is clear that in this case the point X must coincide 
with b. 

The following theorem, originally due to Lusin*, and employed by 
him in the theory of derivatives of a function, follows from the Heine- 
Young theorem: 

//, associated with every point x of a closed interval (a, b), we have all the 
intervals with x as end-point that lie in a certain neighbourhood of the point x 
on both sides (except for the points a, b where the neighbourhood is on one 
side), then a finite number of these intervals exists , abutting end to end , and 
covering the interval (a, b). 

At each point x we take for r r the smallest interval with x as left-hand 
end-point containing all those of the given intervals which have x for 
end-point, whether originally associated with x or not; and for l x the part 
of the given neighbourhood on Hie left of x. 

There is then a finite set of these intervals r x covering (a, 6), and 
abutting on one another. Let these be (a, x Y ) (x lf x 2 ) ... (x n _ 1} 6); and con¬ 
sider (x r _ ly x r ). We may choose an interval of the given set with x r _ 1 as 
left-hand end-point; if this interval does not reach to x r we add one of 
the given intervals associated with the point x r , and thus two of the given 
intervals make up (x r _ lt x r ). Doing this for each interval (x r _ ly # r ), by a 
finite number of choices, we obtain the required finite set which covers 
(a, b). 

By means of Lusin's theorem the Heine-Borel theorem can now be 
deduced. If we replace those intervals of A which contain x as interior 
point by all intervals with x as end-point contained in each one of the 
intervals of A, we have the condition in Lusin’s theorem satisfied. The 
finite set of abutting intervals which covers (a, b) having been determined, 

♦ Recueil de la soc. mat. de Moscou , vol. xxvm, 2 (1911). 
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we replace each one of the intervals of that finite set by an interval of A 
that contains it; this requires only a finite number of choices. Thus the 
Heine-Borel theorem is deduced. 


77. The following theorem will now be proved: 

If any unenumerable set of overlapping intervals in (a, b) be given , then 
an enumerable set of intervals all belonging to the given set exists , of which 
the interior points are the same as those of the given set . 

It has been shewn in § 71 that the given set can be replaced by a non¬ 
overlapping set of intervals with the same interior points. An interval of 
this second set is however not in general an interval of the given set. 

Let PQ be an interval of the equivalent non-overlapping set; then 
every internal point of PQ is an internal point of one interval at least of 
the given set. The point P is either an end-point of some interval Pp of 
the given set, or else it is a limiting point of end-points of an infinite number 
of intervals of the given set. In the latter case there exists an enumerable 
sequence P x p x , P 2 p 2 > P 3 Jh > ••• of intervals of the given set such that P is 
the limiting point of the sequence of points P l9 P 2 , ... P n , .... Similarly, 
unless Q is an end-point of an interval qQ of the given set, it is the limiting 
point of a sequence Q x , Q 2 , ... Q n , ... of end-p dnts of intervals q x (?,, 
q 2 Q 2i ••• InQm ••• of the given set. Consider the intervals P X Q X , P 2 Q 2y ..., 
P n Q n , • • • > where P ly P 2 , ... may be taken all to coincide with P in case 
the interval Pp belongs to the given set, a similar convention being made 
as regards Q. Since every point of P 1 Q x is interior to some interval of the 
given set, therefore, in accordance with the Heine-Borel theorem, a finite 
number of intervals of the given set exists, such that every point of P X Q X 
is interior to one at least of them. Let a similar determination of a finite 
set of intervals be made for each of the intervals P 2 Q 2 , PzQa, ••• P n # n > •••; 
we have then altogether an enumerable set of finite sets of intervals. The 
totality of these intervals forms a finite, or an enumerable, set of intervals 
belonging to the given set, which contains every point in the interior of 
PQ as an interior point. Applying the same process to each interval PQ 
of the equivalent non-overlapping set, and remembering both that the 
intervals PQ form a finite, or an enumerable set, and that an enumerable 
set of finite or enumerable sets is itself enumerable, we derive the con¬ 
clusion that an enumerable set of intervals exists, all belonging to the 
given set, such that their internal points are identical with those of the 
given set. 

The assumption, here made, that, if P be a limiting point of end-points 
of intervals of the given set, a sequence of such end-points exists which 
converges to P, will be discussed in Chap. iv. 
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THE LEBESGUE CHAIN OF INTERVALS 

78. K or the purposes of the Theory of Functions it is convenient to 
consider a set of intervals in a given linear segment, of the kind known 
as a Lebesgue chain. 

Jf (a, b) be a given linear interval , a set C a b of non-overlapping closed 
intervals such that every point of the closed interval (a ^ x - b) is either a 
left-hand end-point of one of the intervals or an internal point of one of these 
intervals , is said to for?n a chain stretching from a to b. The chain is sup¬ 
posed to have a last interval which contains b as interior point or as left-hand 
end-jK)int. 

To consider the nature of such a chain, let us suppose that we have 
defined an interval (a, with a as left-hand end-point, then a set of 
intervals (^,^ 0 ), {x 2 , xf), ... from left to right, in case the right hand 
end-point of one of these intervals is b, we have, by adding one interval, 
a chain consisting of a finite number of closed intervals, reaching from 
a to b. But, if this is not the case, the points x y , x 2 , ... x v , ... will have a 
limiting point £ x on their right, which may coincide with b. in which case 
we have, by adding one more interval, a chain reaching from a to b con¬ 
sisting of an enumerable set of intervals. But if t )1 < b\ starting from £ 1? 
a set of intervals (£ ly £ 2 ), (£ 2 , £ 3 ), is supposed to be defined. If these 
reach from £* to b ; either by taking a finite number, or by taking an 
enumerable set, together with one more interval, 8 , we have again a chain 
from a to b, consisting of the intervals 

(* 1 . *z)> (* 2 > *3) ••• (ll, ii), (ii, ii) ••• > 8 . 

But if £j, £ 2 , ... have a limiting point rj 1 (< b), we proceed as before to set 
up new intervals proceeding from 7 ^. This process may be carried on 
indefinitely, until a chain is defined reaching from a to b . That this will 
always happen when any set of rules is prescribed for the definition of 
the intervals is asserted in the following theorem*: 

If, far each point of the closed interval (a, b), there be assigned, by some 
prescribed set of rules, one single interval with the point as left-hand end-point , 
then, if X be any point in the closed interval (a, b), there is one and only one 
chain stretching from a to X, composed of intervals of the given set. 

It will be observed that the theorem asserts ( 1 ), the existence of a 
chain from a to X, and ( 2 ), its uniqueness. Also, if a chain stretching from 
a to X exists, and X' be any point on the left of X , a chain from a to X' 
exists which is a part, or the whole, of the chain from a to X. 

* See Pal, Iierid. di Palermo , vol. xxxiii (1912), p. 352; also G. C. Young, Quart Journ. of 
Math. vol. xlvh (1916), p. 142, and W. H. & G. C, Young, Proc. Lond. Math. Soc. (2), vol. xiv 
(1915), p. 128 
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If (a, x x ) be the interval corresponding to a, the theorem is clearly 
true for all points x such that a % x & x x . 

Let us assume, if possible, that there exists in (a, b) a set of points Q , 
for which the property in the theorem does not hold good. The set G 
must have a lower boundary point A (> x Y ), which may or may not 
belong to G. Let us first suppose that X does not belong to G ; it is conse¬ 
quently a limiting point of G. There is, by hypothesis, a unique chain 
C a x reaching from a to X . Since C a x has a last interval (X — e', X 4- e), 
or (X, X + c), the chain G a x reaches from a to £, where £ is such that 
0 < £ - X < e. Now C a x is the only chain that can reach from a to £; 
for any chain that reaches from a to £ must contain some chain that 
reaches from a to A r , and this can only be the chain C a x . Jt follows that, 
in a certain neighbourhood of X on its right, there are no points of 6r, 
which is contrary to the hypothesis that X is a limiting point of G. 

Next, let us suppose that X belongs to G, whether it be a limiting 
point of G or not. Consider a sequence of points x 1 , x 2i ... x n , ... of which 

X is the limiting point, and such that x 1 < x 2 < x 3 < ... < x n <_ A unique 

chain C a x » stretches from a to x n ; also if n > n, C a x n is a part, or the 
whole, of C a x »'. Consider the complete set C of intervals each of which 
belongs to C a *n, from and after some value of n. Then C\ together with 
the interval (A, X + e), corresponding to the point A", constitutes a chain 
reaching from a to A. For each point on the left of X is reached by C a x n 
for some value of n, and is therefore an interior point, or a left-hand end¬ 
point of an interval of C. Any chain reaching from a to X contains as 
a part a chain from a to x n , and this can only be C a x ». As this holds for 
every value of n, any chain from a to A r must contain C a x » for every value 
of n ; and therefore it contains C. Also C must be independent of the 
particular sequence {x 71 }; for any point x' m of another sequence must lie 
in an interval (x n , and the unique chain from a to x', n must be a 

part, or the whole, of the chain from a to x n +1 . Any chain from a to X 
can thus only consist of C and the interval (A, A + e); thus there is a 
unique chain reaching from a to A, contrary to the hypothesis that X 
belongs to G. It now follows that no such set as G can exist, and the 
theorem is therefore established. 

79. The reasoning by which Lebesgue established the existence of a 
chain from a to b depends upon the employment of transfinite numbers 
of the second class, and upon the principle that every enumerable ordered 
aggregate must be exhausted before some particular number of the second 
class is reached. If a set of intervals 

(a, x Y ) (x l9 x 2 ) ... (x n _ lf x n ) ... x» +1 ) ... (Xfi 9 x fi+l ) ... 

be numbered from left to right, the numbers employed will all be of the 
first, or of the second, class. If the process of construction does not cease 
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before some particular number of the second class is used, the set of 
intervals would be unenumcrable; and, as they are non-overlapping, this 
is impossible (§ 69). Thus, by a set of such intervals, starting with a , the 
point b must be reached, and the existence of a chain reaching from a to b 
is established. 

It may happen however that, corresponding to each point x of (a, 6), 
not a unique interval, but a finite or infinite set of intervals is defined, 
with x as left-hand end-point. In that case, the theorem as to the existence 
of a Lebesgue chain takes a more general form than that of Pal, given 
above, and the proof given in § 78 fails*. For chains from a to x 1 and 
from a to x 2 , where x 2 > x x , may both exist, but it does not follow that 
the former is part of the latter. The general existence theorem may be 
stated as follows: 

//, to each point x of the interval (a, b) (except b) there be assigned a set of 
intervals (finite or infinite) with x as left-hand end-point , an enumerable 
non-overlap ping set of the intervals can be so determined as to contain every 
point of the semi-open interval (a, b) as interior or left-hand end-point. 

A proof of this theorem not involving an infinite number of acts of 
choice is a desideratum. 

For most of the purposes of the theory of functions the more restricted 
form of the theorem suffices, in which a single interval is defined, with 
each point of (a, b) as left-hand point. 

CLOSED AND PERFECT LINEAR SETS 

80. The following theorem may be stated: 

Those points of the linear segment (a, b) which are not points of a given 
set of non-overlapping open intervals form a closed set of points. 

This theorem has been proved in § 56, where it was shewn that the 
complementary set of an open set, relatively to a closed interval in which 
it is contained, is a closed set. In case a, or 6, is a point of the set of open 
intervals, it is regarded as an interior point. 

Conversely, it has been shewn in § 56, that: 

Every closed set of points in the linear segment (a, b) is the complement 
of a non-overlapping set of open intervals. 

In accordance with the classification given in § 72 of the points which 
do not belong to a set of open intervals, it appears that: 

The most general linear closed set of points in an interval (a, 6) consists 
of (1), the end-points of a set of non-overlapping intervals , (2), limiting points 
of such end-points , and (3), the points interior to intervals every point of 
which belongs to the closed set . 

♦ W. H. & G. C. Young, Proc. Lond. Math. Soc. (2), vol. xiv (1915), pp. 128-130. 
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The open intervals belonging to the set C ( G ), complementary, rela¬ 
tively to (a, 6), to a given closed set G in (a, b) are said to be the intervals 
contiguous to, or complementary to, the said G. The set of all such intervals 
may be spoken of as the set of contiguous intervals , or complementary 
intervals , for the closed set 0. 

If the set G is non-dense, no interval exists in (a, b) which consists 
entirely of points of G ; and the set of contiguous intervals is then every¬ 
where dense in (a, 6), since no interval can be determined in (a, b) so as 
to contain no points of the set of contiguous intervals. Thus: 

Every linear non-dense closed set * in an interval (a, b) consists of the 
end-points of the intervals of an everywhere dense set of open intervals and of 
the limiting points of such end-points. 

The most general type of closed set is obtained by adding to a non- 
dense closed set all the points of some of the contiguous intervals. 

The points of a non-dense closed set G consist in general of three classes: 

(1) those which are common end-points of two contiguous intervals 
abutting on one another; 

(2) semi-external points of the set of contiguous intervals; i.e. points 
which are end-points of one interval, and also limiting points, on one side, 
of end-points; and 

(3) external points, i.e. such as are not end-points of any contiguous 
interval, but are limiting points, on both sides, of such end-points. 

The point a, or the point 6, if it belongs to G , may be regarded as 
belonging to (1) or (3), according as it is, or is not, an end-point of a con¬ 
tiguous interval. 

Those points which belong to (1) are clearly isolated points of G. 
Hence, if no such points exist, every point of G is a limiting point; and 
therefore G is perfect. It follows that: 

Every non-dense perfect linear set G consists of the end-points of an 
everywhere dense set of non-overlapping intervals (contiguous to G), no two 
of which abut on one another , together with the limiting points of these end¬ 
points. 

If the closed set G is such that no semi-external points exist, then 
every contiguous interval abuts on another one at both its ends. In this 

* This relation between everywhere dense sets of intervals and closed sets was discovered by 
Du Bois-Reymond and by Harnack. See Du Bois-Reymond’s Allgemeine Functionentheoric (1882), 
p. 188; also Math. Annalen, vol. xvi (1880), p. 128, where everywhere dense setfc of intervals are 
introduced. See also Harnack, Math. Annalen , vol. xix (1882), p. 239, and Bendixson, Acta 
Math. vol. ii (1883), p. 416, and 0/v. af Svensk. Vet . Fork. vol. xxXix (1883), part 2, p. 31. 
Proofs of the fundamental theorems based on the amalgamation of abutting intervals have been 
given by W. H. Young, Proc. Lond. Math. Soc. (2), vol. i (1903), p. 240, and by SohoenfUes, 
OdUinger Nachrichten, 1903. 
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case, all the points of G are either end-points of adjacent intervals, or 
limiting points, on both sides, of a sequence of such end-points; unless 
a or b be a limiting point, in which case it belongs to G. Such a set may 
be either enumerable, or it may contain a perfect set*, in which case it 
is unenumerable (see § 83). A closed set may be enumerable, and yet 
there may be contiguous intervals which do not abut on another con¬ 
tiguous interval. For example, let the contiguous intervals of a set-<7 in 
the interval (0, 1) consist of the open intervals 


( 0> 2 ) an<1 (2 


1 


1 


-f 


1 


f- 2 ’ 2 1 n 4- 1) 


where n — 1 , 2, 3,_ The interval ^0, ^ does not abut on another 

interval, so that J is a semi-external point. This set G is enumerable. 

To construct an unenumerable closed set which has no semi-external 
points, let 11 be a non-dense perfect set. In each interval ( a , p) contiguous 

k k 

to H , place the enumerable set consisting of the points a + n , p- n , 

where n — 1 , 2, 3, ..., and k is a number less than & a . The totality 

of these sets for all the intervals (a, ft) is an enumerable set K. The set 
II j K is closed and unenumerable; each of its contiguous intervals abuts 
on another one at both its ends. A semi-external point a, of II, is an 
external point of the closed set II 4- K. 


81 . Every non-dense. linear closed set is, in general, made up of an 
enumerable set and of a perfect set. 

Let the intervals complementary to the set G be arranged in enumer¬ 
able order, that of descending magnitude; we may denote them by 
Sj, 8 2 , ... 8 n , — If G is not perfect, it contains isolated points, each of 
which is the common end-point of two adjacent intervals; let 8 Pl be the 
first of the intervals {8} at an end of which there is such a point; let 8 JV 
be the interval which abuts on 8 Pl at that end. It may happen that the 
other end-point of 8 P , is also a common end-point of two intervals. If so, 
let 8 P „ be the interval which abuts on 8 V >, and so on: after a finite, or 

enumerable, set of such intervals 8 P ,, 8 P „, 8 P . we must arrive at an 

interval of which the end-point does not belong to G if the set of isolated 
points of G, or else at an end-point of the domain of G. It may happen 
that 8 Pl at its other end abuts on another interval; in that case we 
proceed, in the same manner as before, to find the intervals 8 qt , 8 g ,, ... 
each of which abuts on another one. Now conceive all the intervals 


* It was incorrectly stated by Schoenflies, in his Bericht , vol. 1 , p. 78, and also in the second 
edition of the present work (vol. I, p. 113) that a closed set is necessarily enumerable when each 
contiguous interval abuts on another one at both its ends. 
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8 P ,, 8 P „, ..., and, if they exist, 8^, 8 tf „, ... to be amalgamated with 8 Pl 
into one interval 8 Pl (1) , by removing all the common end-points. If any 
isolated points of G now remain, let 8 P2 be the first interval of {8} after 
8 Pl , of which an end-point is such a point; proceed as before, we then 
have an interval 8 1>2 (1) formed by amalgamating a finite or enumerable set 
of intervals. We proceed in this way, and thus form a set of intervals 
8 J>1 (1) , 8 Pa n) , ... no end-points of which are points of G t . 

Since G G, \ G { ' \ where G il) is the derivative of G, the set of intervals 
{8 (l) } complementary to G {1) consists of the intervals 8 Pi (1) , 8 P2 (n , ... and 
of any intervals {8} which remain after such intervals as 

8 P ,, 8„, , ... 8 0 ,, & Q ,, ... 

have been removed, and the 8 v a) substituted for the 8 P . 

We proceed in a similar manner with G il) ~ G t {1} 4- <7 (2) , again re¬ 
moving a finite or enumerable number of the set {8 (1) }, and again with 
(r (2) , and so on. It may happen that the process comes to an end after 
a number n of such stages, either if G t {n) does not exist, in which case 
G in) -- 6 Y(nfl) , and thus G (n) is perfect; or else, if G {n) does not exist, in 
which case G , being the sum of a finite number of enumerable sets S6V r) , 
is itself enumerable. If the process does not come to an end for any finite 
value of n , we form the derivative G {u)) - I) ( G n) , G i2 \ ... G in) , ...), which 
contains all the points common to all the derivatives of G of finite order. 
This set has been shewn, in § 68, to exist, and to be a closed set; G {UJ) is 
then resolved as before into G t (oi) f G {uJ: l) , and we proceed further as 
before. 

We obtain, by proceeding in this manner, 

G - G t 4- G}" 4- ... 4 G}^ 4- 6 y l (tofl) h ... 4- Qp' 4- 
where ft is a number of the first or second class. It will now be shewn 
that there must be some definite number ft of the first or second class, 
for which this process comes to an end, either by Gp' 1 containing no 
points, in which case G ip) — G ifi fl) , so that G^ ] is perfect; or else by G {fi+1) 
containing no points, in which case G, being the sura of an enumerable 
set of finite, or enumerable, sets, is itself enumerable. The {8} contain all 
the indices 1, 2, 3, ... n 9 ...; from these indices we must remove a finite, 
or an enumerably infinite number, to obtain those indices which occur in 
the {8 (1) }; and again an enumerable set of indices must be removed from 
those which occur in the {8 (1) }, to obtain those which occur in the {8 <2) }. 
Now as the indices 1 , 2, 3, ... n, ... are enumerable, the process of re¬ 
moving successively a finite, or enumerably infinite, set of them must 
cease for some order ft of 8 (£J) , for otherwise a more than enumerable 
infinity of indices could be removed from the set 1, 2, 3, ... n } ..., which 
is impossible; hence, for some fixed number ft of the second class, all the 
indices must have been removed. 
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It has thus been shewn that, unless the given set O is enumerable, for 
some number ft of the first or second class, G {U ~ G { ? +u \ and therefore 
G { * } is perfect. Thus G has been resolved into an enumerable set and a 
perfect one, which may be called the nucleus of G. 

If, for any value of ft, G ifi) = 0, the set G is enumerable. 

82. The following theorem*, more general than that of § 81, includes 
the latter as a particular case. The proof here given may be taken as 
alternative to that of § 81. 

If P 2 , ... P n , ... Pp , ... P a , ... are all closed linear sets of points such 
that (1), if a x < a 2 , all the points of P ^ belong to P ttl , and (2), if in any 
interval , any set P a contains only a finite number of points , the set P a+1 
contains no points in that interval ; then either Pp must vanish for some 
definite number ft of the first or second class , or else there is a definite number ft 
such that Pp is a perfect set. 

If, for some number ft, the set Pp vanishes, then P y vanishes for all 
values of y which are > ft. 

Let us now suppose that there exists no number ft such that Pp vanishes. 
In this case there exists a set of points, which may be denoted by P n , 
such that each point of the set belongs to Pp , whatever number ft may be. 
The set P a is closed, for if p be a limiting point of the set, in an arbitrarily 
small neighbourhood there are points of Pp , whatever number ft may be; 
hence p belongs to Pp , whatever ft may be, and thus p itself belongs to P (1 . 

It will now be shewn that P fi contains no isolated points, and is there¬ 
fore dense in itself. If P n contains an isolated point p , a neighbourhood 
of p can be found which contains no point of P n except p; let Q be that 
part of P x which is contained in this neighbourhood. In the neighbour¬ 
hood considered, let us suppose a sequence of intervals 8 lf 8 2 , ... 8 n , ... 
constructed, each one containing the next one and the point p , and such 
that 8 n converges to zero as n is indefinitely increased. Let Q (n) denote 
that part of Q which lies in 8 n but not in 8 n+1 , then 

Q - Q {1) + 4> (2) + ^ -f- Q {n) 4- ... + p. 

There must exist a number ft x , of the first, or of the second, class, for 
which Q {1) contains no point of Pp x ; otherwise Q (l) would contain points 
which belong to P n , and this is not the case. Similarly, there exist numbers 
Ao A*, ... ft n , ... such that Q i2) contains no points of Pp 2 , and Q l3) contains 
no points of Pp z , and so on. Of the numbers ft x , ft 2 , ... ft n) ..., let y x be 
the first which is > ft x , then let y 2 be the first which is greater than y x , 
and so on; we have therefore a sequence y Xi y 2 , ... y„, ... of increasing 

numbers all of which belong to the set ft x , ft 2 , ... ft n ,_ This sequence 

7i> yn ••• 7n> ••• is either finite, with say y as the last, or else there is a 

* See Baire, AnncUi di Mat. (3), vol. in (1899), p. 46 et seq. 
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limiting number y of the second class which is greater than all of them, 
and therefore greater than all the numbers ft, ft, ... ft, — The se ^ 0 
can have no point, except p , which belongs to P y , hence, since P y contains 
only one point in a certain interval, P y + X contains no point in that interval, 
and does not contain p\ which is contrary to the hypothesis. 

It has now been shewn that P fi is closed and dense in itself; it is 
therefore perfect. Let us next consider the enumerable set of intervals 
which are complementary to P n . For any one of these intervals there 
exists a number y such that P y contains no point in the interior of the 
interval. As before, it is seen that there exists a number, of the first or 
the second class, which is greater than all these numbers y; if this number 
be ft the set P^ contains no points which do not belong to P a . It is thus 
seen that Pp is perfect, and P p = P fi+ 1 = ... == P n . The theorem has now 
been completely established. 

If, in the above theorem, the condition (2) be omitted, the set P 0 is 
closed but not necessarily perfect. The set P p , as before, contains no 
points that do not belong to P n ; it then follows that 
P0 = Ppil = Pfl+ 2> “ . 

We can thus state the following theorem: 

If P x , P 2 , ... P n , ... Pp , ... P a , ... are all closed sets stick that any one 
contains all the others with a higher index; then, either Pp vanishes for some 
definite number ft of the first , or the second , class , or else there is a definite 
number ft, from and after which all the sets are identical. 

83. Every perfect linear set* has the cardinal number c of the continuum; 
and every closed infinite set has the cardinal number c, or else the cardinal 
number a of the rational numbers. 

Let the intervals whose internal points are the set C ( G ), the comple¬ 
ment of the perfect set G, be denoted by {8}; and let A denote the greatest, 
or one of the greatest in case of equality, of the intervals {8}. Let l, the 
whole interval (a, b) in which G lies, be divided into the three parts 
4, A, 4 so that Z = l Q + A -j- Z x , where l 0 is on the left, and Zj on the right 
of A, the greatest interval of {8}. Denote the greatest of the intervals {8} 
in Z 0 , by A 0 , and the greatest in 4? by A x ; then the interval Z 0 is divided 
by means of A 0 into three parts 4o > Ao, l 01 in order from left to right, and 
the interval l x is divided by means of A x similarly into Z 10 , A l5 Z n . Pro¬ 
ceeding in this manner to a further subdivision, let A pq be the greatest of 
the intervals {8} which lie in l pqi where p, q each has one of the values 
0 or 1; then l PQ is divided into three parts l PqQ , A„ a , l PQl ; and so on in¬ 
definitely. The intervals {8} are thus arranged in the order 
A, Aq, A x , Aqq, A oi, A 10 , A u , ... 

* Cantor, Math. Annalcn, vol. xxiii (1884), pp. 486-488. 
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and each interval of {8} occurs at a definite place in the sequence. Con¬ 
sider a sequence of intervals Z, l P , l pa , l VQr , ..., where p , q , r, ... all have 
definite values, each of which is either 0 or 1. Each of these intervals is 
contained in the preceding one, and has one end-point in common with it; 
and the sequence determines a single point P which is interior to all the 
intervals of the sequence, unless, from and after some fixed index, all the 
indices are identical, in which case P is a common end-point of all the 
intervals after a fixed one. Hence, since the point P is not interior to any 
of the intervals {8}, it is a point of G. Conversely, every point of G can 
be so determined by means of a sequence of intervals; for every point 
of G belongs either to Z 0 or to , and also to one of the four intervals 
/oo, Z 01 , Z 10 , Z n , and so on. The point P is the limiting point of the end¬ 
points of the intervals A pq , A var , ... with the indices the same as those 
of the sequence l v , l pq , l PQr , ... which determines the point. 

Every number of the continuum ( 0 , 1 ) is expressible in the dyad scale 
by means of a sequence -p, - 7 * 7 , *yx/r, ..., where each of the numbers 
p, q , r , ... is either 0 or 1 ; and all numbers are expressed uniquely in this 
manner, except those for which all the digits after some fixed one are 1 , 
these numbers being also expressible by a sequence in which only 0 occurs 
after some fixed place. The numbers last mentioned correspond as indices 
of l p ,l PQ ,l pgr , ... to a point of G which is an end-point of one of the 
intervals { 8 }; but in every other case a number in the dyad scale corre¬ 
sponds to a point of G which is not an end-point of the intervals { 8 }. 
{Since the set of numbers of the continuum ( 0 , 1 ) has the cardinal number c y 
it follows that the points of G form a set of the same cardinal number, 
because each point of G corresponds uniquely to a single number of the 
continuum, except that two points of G which are end-p.oints of one con¬ 
tiguous interval correspond to a single number of the continuum. Every 
closed set which is not enumerable has been shewn to contain a perfect 
set as component; such a set has therefore the cardinal number c. 

It will appear from the theory of order-types which will be discussed 
in Chapter iv that the set of intervals { 8 } which define a perfect set G, 
when taken in their order of position from left to right, have an order-type 
which is the same as 17 the order-type of the rational numbers which lie 
between 0 and 1 , excluding 0 and 1 themselves, taken in their natural 
order in the continuum. Tt follows that a correspondence can be estab¬ 
lished between the intervals and the rational numbers, in which any two 
intervals correspond to two rational numbers that have the same order. 
If we take each rational number to correspond to the end-points of the 
corresponding interval, then each irrational number corresponds to a 
point of G which is a limiting point of end-points of intervals. 
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EXAMPLES 


L Let :r be a number given by x - J ^ -4 ... \ -J 


where the numbers 


3 3 2 3 3 3 n 

c,, c 2 , ... r M , ... have each one of the values 0, 2. The set J.rJ is a non-dense perfect set. 
No number of the set lies between 


c x r 2 0 2 2 

3 + 3 2 + "’ f 3 w+ 3 n + 1+ 3 7, + 2 


a + s- " 


i 

4 - 

3 W 5 


and 


r i c 'i 
- 1 + z + . 
3 :p 


3 n ’ 


these two numbers determine a complementary interval of the set, the interval being of 
length . ’J'he number of complementary intervals of length is 2 n ~ 1 , hence the sum 

0C on—1 

of all the complementary intervals is 2 , which is unity. It is clear that the set of 

v l 3 n 

complementary intervals is everywhere dense, and thus the set of points is non-dense. 
This example was constructed* by Cantor, and is the first example of a perfect non-dense 
set which has been purposely constructed. 


2. Let us suppose that the numbers of the interval (0, 1) are expressed in the dyad 
scale, in the form a x a. z a A ... a n ...; where each a is either 0 or 1. Each number for which 
the all vanish, after some fixed one <* n , which must be 1, is also representable as an 
unending radix-fraction, in which a n is 0, and all the subsequent digits are 1. Let the 
numbers now be interpreted as if they were in the decimal scale. To each irrational number 
in the dyad scale, there corresponds a single number in the decimal scale, represented by 
the same digits. Of each rational number, not represented by a recurring radix-fraction, 
there is a double representation in the dyad scale, and there correspond two numbers in 
the decimal scale, which define a complementary interval of the set of points which re¬ 
presents the numbers in the decimal scale. A perfect non-dense set of points is thus defined. 


3. Taking a positive integer m (>2), let the interval (0, 1) be divided into w equal 
parts, and exempt the last part from further subdivision. Divide each of the remaining 
m 1 intervals into w equal parts, and in each case exempt the last part from further 
subdivision. Let this operation be continued indefinitely. The points of division form a 
non-dense set; for if an interval d be taken anywhere in the interval (0, 1), k may be so 

chosen that f, and a segment ( a ut entirely within d , (ran be determined. 

m 1 ' 2 \m k / 

This segment is either an exempted interval, or its tn th part is one. The end-points of the 
intervals, together with their limiting .points, form a non-dense closed setf, of cardinal 
number c. 


4. As inf Ex. 3, let the interval (0, 1) be divided into m equal parts, and the last be 
exempted from further division. Then let the remaining m ~ 1 parts each be divided into 
m 2 equal parts, the last of each being exempted from further division. Let the remaining 
parts be then divided into m 3 equal parts, the last of these in each case being exempted 
from further division. If this process be carried on indefinitely, the end-points of the 
divisions, together with their limiting points, form a non-dense closed set, of cardinal 
number r. 


* See Math. Anvalen , vol. xxi (1883), p. 590. 

+ See H. J. S. Smith, Proc. Lond. Math. Soc. (1), vol. vi (1875), pp. 147, 148. 
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5. Let k lf k 2 , ... k n , ... be a sequence of positive integers, each of which is greater than 
unity, and defined according to any law. 

It can be shewn* that every irrational number x , in (0, 1), can be uniquely represented 
in the form 

Ci Co C n 

x = ? + ,j + , +..., 

k x k x k 2 k x k 2 ... k n 

where c n < k n , and not all of the numbers c„, c n+1 , ... are zero, for any value of n. 

It can further be shewn that 


_ | _ _ V2 _ _ Vn 

k* ~k,k 2 - k,k a ... k n **” 

where r} n — k n - 1 - c n . If, from and after a certain value of n, the condition c n = k n - 1 is 
always satisfied, then all the -r\ n vanish, and x is rational. It thus appears that these rational 
numbers are capable of a double representation in the form 


Cl C« Cm | 

x — i + j / + ... + . — j -y- 

ki l\k 2 kjk^ ... k n 




(1) by the vanishing of all the c, after some fixed one, and (2) by the condition c n =fc n -1 
being satisfied, from and after some fixed value of n. 


If we now take those values of x y for which every c does not exceed some fixed integer A, 
these values of x form a non-dense perfect set G k . It is easily seen that the interval of 
which the end-points are 


and 


Cj c 2 c n A A 

k x k 2 kjk 2 ... k n k x k 2 ... k n+1 k x k 2 ... k nJrt 

c i , c 2 , , c n + ^ 

k t k^k^ "' k x k 2 ... k n 


+ ... 


contains no points of the set in its interior, although these points belong to the set. 


A particular case of this set consists of the numbers given by 


. c i , C 2 
10 ‘ 


+ iOi.a.s + — + ]0n: 


where every c is ^ 9. This set consists of the transcendental numbers first defined by 

Liouvillef. 


PROPERTIES OF THE DERIVATIVES OF LINEAR SETS 

84. If a set is dense in any sub-interval of the interval in which it is 
contained, its derivative G ll) contains every point of the sub-interval, and 
is identical, so far as such sub-interval is concerned, with the totality of 
the points of the sub-interval; we confine ourselves therefore to the case 
in which G is a non-dense set, and consequently its derivatives are also 
non-dense. 

The derivatives of transfinite orders have been defined in § 68; and it 
was there shewn that there is either a last derivative, whose order is some 
number of the first class, or non-limiting number of the second class; or 
else that derivatives of all such orders exist, and have a set of points $ <0) 
in common. 

* Broden, Math. Ann. vol. li (1899), p. 299. 
f lAouville’e Journal , vol. xvi (1851), p. 133. 
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It was shewn in § 68, that G ll) being a non-dense closed set, two cases 
arise: 

(1) If G {1) is enumerable, in which case G is also enumerable, then (r (/3) 
vanishes for some number of the first or the second class. A set with 
an enumerable derivative is called a reducible set. 

(2) If G (l) is not enumerable, then there exists some number ft, of the 
first or second class, for which (7 (8) is a perfect set, and is consequently 
identical with G ifi + l) , and with G im as defined in § 68. The set G {1) is the 
sum of an enumerable set and the perfect set G i$) . A set G which has this 
property, is said to be irreducible. 

It should be observed that, when G (1) is unenumerable, and conse¬ 
quently of cardinal number c, the same as the cardinal number of its 
perfect component, we are unable to make any inference as to the cardinal 
number of G itself. This may be a or c, or other cardinal number between 
the two, in case such a number exists. 

CLOSED SETS IN TWO OR MORE DIMENSIONS 

85. In considering the properties of closed sets of points in space of 
two or more dimensions it is sufficient to treat in detail the case of plane 
sets only, because these sets exhibit sufficiently clearly the respects in 
which linear closed sets differ from other closed sets. The results obtained 
for plane closed sets can immediately be extended to the case of such sets 
in any number of dimensions. In the case of a linear set, each point P 
which does not belong to a given closed set is enclosed in an open interval 
which contains no points of the set, and this interval has a maximum 
length in both directions, the end-points of such maximum interval 8 
being points of the closed set (or an end-point of the fundamental interval), 
and this maximum interval is identical with 8, for all points interior to 8. 
But, in the case of a plane set, if we confine ourselves to areas of given 
form and orientation, such as rectangular cells, and these take the place 
of the linear intervals 8, it is not the case that a closed set is defined as 
the set of boundary points, together with the external points, of a unique 
system of such cells. 

If P be a point which does not belong to a given plane closed set G t 
in a fundamental cell, and if we draw through P a straight line parallel 
to the line whose equation is y = mx, then those points of the given set 
which lie on this straight line form a closed set (see § 56), and the* point P 
must be interior to an interval A m (P) contiguous to this closed set. If, 
on either side of P, there are in this straight line no points of G , then on 
this side the extremity of the interval A m (P) may be regarded as the 
point in which the straight line intersects a side of the fundamental cell. 
The interval A m (P) exists for every value of m, and the extremities of 
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the interval, for a fixed P, are points of the plane set, or points on the 
boundary of the fundamental cell. The region of plane space A p , defined 
by all these intervals, for every value of m, when that straight line is 
included for which x has a constant value, is free in its interior from 
points of G. Such a region A m (P) may be regarded as the analogue, for 
plane sets, of the interval 8 (P ) for the case of linear sets. But the analogy 
is not complete, for if Q be an interior point of A w (P), it is not necessarily 
the case that A rn ( Q) is identical with A m (P). 

If however we work only with rectangular cells, there exists in general 
no unique rectangular cell, corresponding to a point P of 0 ( G ), which is 
such that no interior point of it is a point of G, and that on its boundary 
there is a point of G. If we describe a square cell of sides 2p with its 
centre at P, and containing in its interior and on its boundary no point 
of G , we may keep three of the sides fixed, and move the fourth parallel 
to itself until it either contains a point of P, or becomes coincident with 
one of the sides of the fundamental retd-angle. We may then proceed to 
move another side parallel to itself until the same thing happens, and so 
on with the other two sides. The resulting rectangle will in general depend 
upon the order in which the moving of the four sides takes place. Thus we 
obtain, in general, no unique rectangular cell corresponding to the point P. 

86. It is however possible, for a given closed non-dense plane set G , 
to construct an enumerable set, not unique, of cells which is everywhere 
dense, and such that every point of G lies on the boundary of a rectangle, 
or is a limiting point of points which lie on the boundaries of such cells. 
Let us denote by S the fundamental cell in which the whole set G lies, 
and let 8 be a cell constructed, as above, for a point P of the set C (G). 
Produce the sides of 8, when necessary, until they cut the sides of S 9 thus 
dividing S into at most nine different rectangles, of which one is 8, and 
the others may be denoted by S ri where r — 1, 2, ... 8. In each rectangle 
S r take any point P r which does not belong to G , and construct for P r a 
free rectangle 8 r , as before; let the sides of 8 r be produced, when necessary, 
until they meet the sides of 8 ri then S T is divided into at most nine rect¬ 
angles, which consist of 8 r and at most eight rectangles S rs , where 
s ^ 1,2, ... 8. 

Proceeding in this manner, we obtain a set of rectangles 

S, S r , S T$ , & 1 rs i , ..., 

and in them a set of rectangles 8, 8 r , 8 rs , 8 rst , ..., each of which contains 
no points of G in its interior; each of the numbers r, s, t } ... being one of 
the digits 1, 2, 3, ... 8. If p be a point of G which is not on a boundary 
of any rectangle 8 n , it must be in the interior of each of an unending 
sequence of rectangles S r , S r8 , 8 rat , ..., where r, $, t, ... have definite values; 
and this set of rectangles must converge either (1), to a point in the interior 
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of all of them, or (2), to a linear interval, or (3), to a definite rectangle S w 
in the interior of all of them. In case (1), the point to which the rectangles 
converge is a limiting point of those points of G which lie on the boun¬ 
daries of the definite sequence of rectangles 8 r , 8 rs , $ rst , _ In case (2), 

there must be, in the limiting linear interval, at least one point which is 
a limiting point of G: for, if not, the whole interval could be enclosed in 
a rectangle which contains no points of G\ and this is impossible. In 
ease (3), we start with the rectangle and take a point not belonging 
to G inside it, which point exists, since G is non-dense, and construct the 
maximum free reel-angle 8 W . Produce its sides as before to meet those of 

, and proceed as before to construct 8^ rst - a nd 8 wrs/ _ This process 

can be continued until an index is reached which may be any number of 
the second class, but the point p must be readied before some definite 
number of the second class appears as index; this following from the fact 
that the number of non-overlapping regions which are contained in a 
given space must be enumerable. Thus the point p is reached after an 
enumerable set of steps of the process. 

It has therefore been shewn that: 

If G is a non-dense closed plane set of points t an everywhere dense enu¬ 
merable set of rectangles crisis, such that every point of G is on a boundary 
of one or more of the rectangles , or is a limiting point of such points , or lies 
in a linear interval which is the limit of a sequence of the rectangles. 

In case the set G is perfect , the rectangles of the set must either not abut 
on one another , or every common side must contain either no points of G, or 
else a perfect set of points of G. 

87. That a perfect plane set G has the power of the continuum* may 
be proved bv projecting the points of G on a side of the fundamental 
rectangular celt. The linear set of points which are the projections of G 
is a closed set. For if P be a limiting point of the set of projects, let pp' 
be an arbitrarily small neighbourhood of P (on the side), of which P 
is the centre; construct straight lines PQ , yxp pq\ perpendicular to pp', 
meeting the opposite side of the fundamental rectangle in Q , q , q '. In the 
rectangle jjqq'p there are an infinite number of points of G. Let a sym¬ 
metrical system of nets, with closed meshes, be fitted on to this rectangle 
pqqp \ the number rn (see §51), whose square is the number of meshes 
in D 1 , being taken to be odd. There is in at least one mesh of D x an 
infinite number of points of G in the interior or on one of its boundaries 
parallel to pp '; and one such mesh at least must exist with its centre on 
PQ ; for otherwise P could not be a limiting point of the projection of G . 
We take that one d Y of such meshes which is nearest to pp'. Similarly 

* See Bendixson, Bib. Bvensk. Vet. Handl. vol. ix (1884), where the first proof of this theorem 
was given. 
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there exists a mesh d 2 , of X> 2 , contained in d r , with its centre on PQ , and 
containing an infinite set of points of G. Proceeding in this manner, we 
obtain a sequence d lf d 2 , ... of meshes each containing the next, and all 
having the same property as d x . These define a point on PQ which must 
be a limiting point of G , and therefore belongs to G. Therefore P is a 
point of the projection of G; and thus the projection of G is a closed set. 
If a point P be an isolated point of the projection of 6r, we may as before 
construct a rectangle pqq'p' which contains no points of G that do not lie 
on PQ. The component of G in this rectangle must be perfect, and there¬ 
fore the straight line PQ contains a perfect component of G. If the pro¬ 
jected set is perfect, then it has the power c of the continuum; and if it 
contains isolated points, these must be the projections of perfect linear 
components of G ; therefore, in either case, G has the power of the con¬ 
tinuum. 

It is clear that this method of proof can be extended to the case of 
a set in any number of dimensions. 

THE ANALYSIS OF SETS IN GENERAL 

88. With a view to the general analysis of sets of points in any number 
of dimensions, it is necessary to classify the points of a given set according 
to the cardinal number of those points of the set that are in the arbitrarily 
small neighbourhoods of the various points*. 

An isolated point of any set G is such that in a sufficiently small neigh¬ 
bourhood of the point there are no other points of G. For this reason an 
isolated point may be said to be of degree zero in the set. 

A limiting point of G, such that in every sufficiently small neighbour¬ 
hood of the point there is an enumerably infinite set of points of G , is 
said to be a point of enumerable degree in the set , or of degree a in the set. 
Such a point may or may not belong to G. 

In case the point is such that, in every neighbourhood of it, there 
exists a set of points of G that has the cardinal number of the continuum, 
the point is said to have degree c in the set. 

As we are not entitled to assert that every infinite set of points has 
either a or c for its cardinal number, we contemplate the existence of sets 
having a hypothetical cardinal number x different from a ore. A point 
such that, in every sufficiently small neighbourhood of the point, there 
existed a set, of cardinal number x, of points belonging to G, would be 
termed a point of degree x in the set G. 

* An analysis of this kind, for sots in general, was given by Cantor, Acta Math. vol. vn (1885), 
p. 105. A more elementary presentation of the matter, without the use of transfinite numbers, 
has been given by W. H. Young, Quart. Joum. of Math. vol. xxxv (1904), p. 102. 
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A point P, whether it belongs to G or not, is said to be a point of 
unenumerable degree in G if, in every neighbourhood of the point, there 
exists an unenumerable set of points which belong to G . Such a point has 
been termed* by Lindelof a 'point of condensation. As however this ex¬ 
pression is employed by some writers to denote a limiting point, it is better 
avoided. The set of all such points P (belonging to G or not) is closed, for 
any limiting point clearly belongs to the set. It is obvious that, if G be 
enumerable, it contains no points of unenumerable degree in G, for every 
part of an enumerable set is enumerable (finite or infinite). 

Conversely it may be shewn that: 

If no jx)int of G is a poini of unenumerability in G, the set G is enu¬ 
merable. 

Let a system of nets be fitted on to the finite, or indefinitely great, 
cell in which G is contained. Any point P, of G, is defined by a unique 
sequence of meshes of the successive nets of the system. There must be 
a value nj> of n , from and after which all the meshes {d n } of the sequence 
which defines P contain only an enumerable set of points of G. Thus, to 
each point P of G , we can correlate one mesh d np ; but the same mesh 
may be correlated with more than one point P. Taking all the meshes 
d np , for all the points P of G, these form an enumerable set, being part 
of the enumerable set of all the meshes of the system of nets. Since each 
mesh d np contains only an enumerable set of points of G, it follows (§ 58) 
that G is an enumerable set. 

89. Every set G, not enumerable , is the sum of an enumerable set (possibly 
absent) and a set in which each point is of unenumerable degree in the set , 
and which is therefore dense in itself. 

Thus G = H -j- K , where H is enumerable, and K is such that each 
point is a point of unenumerable degree in G. Since every point of K has 
in its neighbourhood an unenumerable set of points of G, and therefore 
of K , the set K is dense in itself. To prove the theorem, let H be that 
part of G , each point of which is a point of zero, or of enumerable, degree 
in G ; it is then clear that H must be a set in which every point is of zero, 
or of enumerable, degree in H ; therefore H is enumerable. The set 
K = G ~ H is then unenumerable because each point of it contains in its 
neighbourhood an unenumerable set of points of G, and therefore of K. 

It is clear that every limiting point of a set of points of unenumerability 
is itself such a point. Since K is dense in itself, it is contained in its de¬ 
rivative K' ; and as K' has no isolated points, it is perfect. 

If G is a closed set, it must contain all the points which are of 


See Comptes Bendus , vol. cxxxvii (1903), p. 697. 
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unenumerable degree in it; thus K' and K are identical, and therefore K 
is perfect. Therefore we have the theorem : 

A closed set is the, sum of an enumerable set and a perfect set , either of 
which may be absent. 

The proof of the theorem here given is due to Lindelof (loc. cit .). 

For the special case of linear closed sets this theorem has already been 
proved in § 81. But the following theorem may here be added*: 

A linear closed set is the sum of an en umerable set and a set which is dense 
in itself , such that each point of the latter set is a limiting point on both sides. 

For, a point of K that is a limiting point only on one side is also a 
point of K' which can only be a limiting point of K' on one side; moreover 
the set of all such points of K\ viz. the extremities of the intervals con¬ 
tiguous to it, is enumerable. Therefore those points of A" that are limiting 
points of K on one side only form an enumerable set; and if these be 
removed from K we have a set K A the points of which arc? limiting points 
of A, and therefore also of K 1 , on both sides. 

It is asserted in the first, theorem given above that every unenumerable 
set contains a component that is dense in itself. An enumerable set may 
contain such a component, and if it docs, its derivative contains a perfect 
set, and thus the enumerable set is irreducible. 

It has been proved by Sierpihskit that every set of points can be 
decomposed into the sum of two sets, the first of which contains no com¬ 
ponent that is dense in itself, and is effectively enumerable, and the 
second of which is dense in itself. Either set may be absent. 

90 . If G is an unenumerable set , those points of G which are of the sam,e 
degree x (> a) in G form a set which is dense in itself, and of which the cardinal 
number is h x. 

Let us fit on a system of nets with closed meshes to the unbounded 
space in which G is contained. Consider any point P that is of degree 
x in G ; P is in a mesh d n (x) in which the set of points of G is of degree x \ 
n may have the smallest value for which this is the case. When all such 
points P are taken, the meshes d n (x) corresponding to them form an 
enumerable (or finite) set, a part of the enumerable set of all the meshes 
of the system of nets. In each of these meshes d n (x) there is a set of 
points of G of cardinal number x. It will be shewn in Chapter iv, § 155, 
in connection with the general theory of cardinal numbers, that an enu¬ 
merable set of aggregates of points, each of cardinal number x, cannot be 
assumed to have the cardinal number x, unless x — c. Since all those points 

* W. H. Young, Quart. Journ. of Math. vol. xxxix (1908), p. 76. 
t Fundamenta Mat. vol. i (1920), p. 1. 
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of G that arc of degree x in the set are contained in the enumerable set of 
meshes so obtained, it follows that the set of such points of G has a cardinal 
number a x, except that it certainly has the cardinal number c, in case 
x = c. 

To shew that the set is dense in itself, we observe that, in a sufficiently 
small neighbourhood (a, /?) of P x , there is a set of points of <7, of cardinal 
number x ; and that none of these points can be of degree in G higher 
than x. For if there were such a point Q of degree higher than x, in some 
interval (a, ft') contained in (a, ft) and containing Q in its interior, there 
would be a set of points of G of cardinal number higher than x\ but this 
is impossible, as all these points would be in (a, ft). Again, the points of G 
in («, ft), other than P Xi cannot be all of degree lower than x ; for if they 
were so, their cardinal number would be lower than x, since they could 
be enclosed in an enumerable set of non-overlapping intervals with P x as 
sole external point. Moreover, since, in any arbitrarily small neighbour¬ 
hood of P X1 there are points of the same degree x in G, P x is a limiting 
point for such points. Therefore the set of points such as P x is dense in 
itself. 

The set of points of degree c is dense in itself, and of cardinal number c. 

91. Any set G consists of isolated points which form an enumerable 
set called the adherence of G, and of limiting points which form a set called 
the coherence of G. Denoting the adherence and the coherence of G by 
Ga, Gc respectively, we have G = Ga 4- Gc. 

The set Gc can in a similar manner be split up into its adherence and 
its coherence, which we denote by Gca and Gc 2 respectively; thus 

Gc = Gca + Gc 2 . 

The set Gca is an isolated set, and therefore enumerable; and if we proceed 
to resolve Gc 2 in a similar manner into its adherence Gc 2 a, and its co¬ 
herence Gc 3 y and then to resolve 6'c 3 , it is clear that the process may be 
continued any number n of times. We thus obtain 

G = Ga + Gca + Gc 2 a f ... 4- Gc*-*a 4- Gc\ 

The set Gc n ~ l a may be named the adherence of G of order n, and Gc n 
may be denominated the coherence of G of order n. 

It may happen that, for some value of n, Gc n vanishes; in that case 
G has been split up into a finite number of enumerable sets, and is conse¬ 
quently itself enumerable. If this be not the case, the process may be 
continued indefinitely, and Gc n then exists for every value of n. We then 

define D (G, Gc, Gc 2 , ... Gc n , ...), 

the set of points common to all the coherences of G, to be the coherence 
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of order co, and denote it by Gc”. It is clear that every point of G which 
does not belong to one of the sets Gc n-1 a, belongs to Gc ", hence we have 

G = Z,Gc nl a + Gc 

the summation being taken for all values of n belonging to the first class. 

We now split up Gc” into its adherence Gc ,u a, and its coherence Gc ” +1 , 
and proceed further to obtain the adherences and coherences of G of the 
orders of the various numbers of the second class. If a ly a 2 , ... a n , ... is 
a sequence of numbers of the second class, which has for its limit, the 
coherence of order ft is defined by 

Gc 8 - D (Gc-*, Gc% ... Gc l », ...). 

We now obtain a resolution of G of the form 
G = ZGcPa 4- Gc?, 

where y is any number of the first or second class, and the summation 
refers to all values of p which are less than y. Each adherence Gc v a is 
an isolated set, and therefore enumerable; and if G contains a component 
which is dense in itself, this component is contained in Gc?. 

First suppose G to be an enumerable set; the process of analysis must 
then cease for some number y of the first or second class. For if Gc v a 
existed for every number y of the second class, we should have obtained 
an unenumerable set of adherences containing no points in common, and 
all belonging to G: thus G could not be enumerable. 

The cessation of the process may take place in two different manners: 

(1) , if for some number y of the first or second class, Gc ? = 0, G has 
been resolved into an enumerable set of adherences, and it contains no 
component which is dense in itself: 

(2) , if for some number y, 

Gc?o — 0, 

in which case Gc? = Gc? +1 , the set Gc? then contains no adherence, and 
every point of it is a limiting point, and Gc? is therefore dense in itself. 
The set G has consequently been resolved into an enumerable component 
which contains no part that is dense in itself, and into a set which is 
enumerable and dense in itself. 

Next, let us suppose that G is an unenumerable set. Then it has been 
shewn that those points of G which are of unenumerable degree in G form 
a set that is dense in itself; and those points which belong to the ad¬ 
herences of all orders are points of zero, or of enumerable, degree, and 
thus form an enumerable set. It follows, since all points that do not 
belong to that part of G which is dense in itself belong to the adherences, 
that the number of adherences must be enumerable; and thus that, for 
some number y of the first or second class, Gc? is dense in itself. The set 
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G& may consist of an enumerable set dense in itself, and of sets of higher 
cardinal numbers dense in themselves. 

It has thus been shewn that any set G may be represented by 
G=U+V a + XF x + F c , 

where U is an enumerable set which contains no component that is dense in 
itself , V a is an enumerable set of points of degree a dense in itself , V c is a 
set , of cardinal number c, consisting of points of degree c, dense in itself; 
V x is a set dense in itself consisting of points of degree x , where a < x < c. 

If, as is probable, no cardinal numbers exist between a and c, the 
sets V x can be omitted. A set such as F 0 , or V x , or V 0 is denominated a 
homogeneous set of degree a, or x , or c, in the set G . 

If G is a closed set, then, as has been shewn in § 89, V e is perfect, and 
X V x cannot exist. 

INNER AND OUTER LIMITING SETS 

92. Let S lf S 2 , ... S ny ... be a sequence of sets of points, in one or 
more dimensional space, such that each set $ n . fl of the sequence is con¬ 
tained in the preceding one S n , then the set S uy or D (S ly ... S n , ...), 
consisting of points each of which belongs to all the sets of the sequence, 
is said, when such set exists, to be the inner limiting set of the sequence 
of sets. A point which does not belong to S n , S n+1 , ... but belongs to 
is said to be shed at the index n. 

When S ly S 2 , ... S n9 ... is a sequence of sets, each one of which S n is 
contained in the next $ n+1 , the set S u , or M (S lf S 2 , ... S n , ...), which 
consists of the set of those points each of which belongs to all the sets, 
from and after some value of n dependent on the particular point, is said 
to be the outer limiting set of the sequence. 

If any sequence Xj, X 2 , ... X n , ... of sets be given, and we take 8 t — X x , 
S 2 = (Si, X a ), £ 3 = D (Xj, X 2 , X 8 ), and in general = D (X x , X 2 ,... XJ, 
the inner limiting set $«, when it exists, also defines D (X 1? X 2 , ... X m , ...), 
and may be regarded as defined by the given sets X x , X 2 , .... 

For a point that belongs to all the sets X!, X 2 , ... X n , ... belongs to all 
the sets S ly S 2y S 3> ... S n , ...; and a point that belongs to all the latter 
sets belongs to all the former sets. 

Thus it has been shewn that the set of points common to all the sets of 
any given sequence of sets is an inner limiting set . 

Again, if we take 8 X — X l5 $ 2 = M (X x , X 2 ), S 3 = M (X x , X 2 , X 3 ), ... the 
outer limiting set S w also defines M (X 1? X 2 , ... X n , ...), and thus the set of 
alt points that belong to one at least of the sets of any given sequence of sets 
is an outer limiting set . 
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If S„ be the inner limiting set of the sequence S 1 , S 2i S 3 , ... of sets, 
each of which contains the next, the sequence C (SJ, C (S 2 ) f C (S 3 ), ... of 
the sets complementary to the first sequence with respect to a cell (or 
interval), whether finite or not, which contains them all, is such that each 
set is contained in the next, and their outer limiting set is C (SJ). 

It has been shewn in § 67 that, when the sets , S 2 , ... are all closed 
sets, the inner limiting set always exists, and is a closed set. 

Every set of points can be exhibited as the inner limiting set of a sequence 
of sets each of which contains the next; and also as the outer limiting set of 
a sequence of sets each of which is contained in the next. 

Let a set G be contained in the interior of a cell (a (1) , a (2> , ... a {J)) ; 
b (1) ,b {2 \ ... b iv) ); and let Go be the component of G that is contained in 
the cell (a (1) , a < 2 >, ... a<*»; 6b a \ 06< 2 >, ... 6b <*>), where 0 < 6 < 1. 

If 0 lf 0 2 , ... 6 n , ... be an increasing sequence of values of 0 that con¬ 
verges to 1, the sets G 0}1 Ge z , ... will be such that each one is contained 
in the next, and their outer limiting set is the given set G. 

Also G is the inner limiting set of the sequence of those sets which are 
complementary to the sets of a sequence for which C (G) is the outer 
limiting set. 

In case G is unbounded, it may be placed in correspondence with a set 
interior to the cell (— 1, 1), and the theorem therefore holds for G , since 
it holds for the set that corresponds to G. 


SETS OF THE FIRST, AND OF THE SECOND, CATEGORY 

93. The outer limiting set of a sequence of non-dense sets G x , G 2 , ... G n , ... 
each of which contains the preceding one is said to be a set of the first category. 

This is equivalent to the definition given* by Baire, that, if P l9 P 2 , ... 
P n , ... be any sequence of sets, non-dense in the fundamental interval, 
or cell, in which they are all contained, the set M (P 1 , P 2 , ... P n , ...), con¬ 
sisting of all the points that belong to any of the given sets, is of the 
first category. For we have only to take G n = M (P ls P 2 , ... P n ); and G n 
is non-dense. The set P l9 P 2 , ... P n , ... can always be replaced by non- 
dense sets, Q lt Q 2 , Q 3 , ... no two of which have a point in common; for 
we may take 

Qi ~ P\ > $2 ~ Pi ~~ D (Pi, P 2 ), (? 3 = P 3 — 1) (Q t , P 3 ) — D (Q 2 , Pg), .... 

Any set which is not a set of the first category is said to be of the second 
category. 

* Annali di Mai . (3), vol. m (1899), p. 65, where the distinction between sets of the first and 
second categories was first introduced 
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It is clear that a set of the first category is enumerable if the sets 
G\, (J 2 , ... G n , ... are all enumerable, and that every enumerable set is of 
the first category. 

An enumerable set of the first category may be everywhere dense. 
For example, the set of rational points in a given cell, or interval, (a, b) 
is of the first category, and is everywhere dense. 

A set of the first category may have the cardinal number of the con¬ 
tinuum, and may then be everywhere dense, or non-dense. 

A set of the first category is not necessarily closed. As an illustration 
of this fact, let us consider a set of cells P X Q X , P 2 Qz * ••• ••• each of 

which contains the next, and such that the sequence converges to a single 
point p interior to all the cells of the sequence. Let G\ consist of a set of 
points on the boundary of P X Q X \ let G 2 consist of G\ together with a set 
of points on the boundary of P 2 Q 2 ; and generally, let G v consist of G n ^ x 
together with a set of points on the boundary of P n Q n . The point p does 
not belong to G n for any value of n, and therefore does not belong toG r w , 
but it is clearly a limiting point of G u) ; and therefore G u) is not a closed set. 

To illustrate the fact that an unenumerable set of the first category 
may be everywhere dense, fit on to the cell, or interval, (a, b) a system of 
closed nets. Let G\ be a non-dense perfect set in (a, 6); in each mesh 
of D l place a non-dense perfect set, and let G 2 be the perfect set made up 
of all such non-dense perfect sets, together with G\. Let G 3 consist of G z 
together with non-dense perfect sets placed in all the meshes of Z) 2 ; and 
so on. 

The set of the first category defined by G l , f/ 2 , ... is clearly everywhere 
dense; for any sub-cell contains a mesh of I) n , for some sufficiently large 
value of n, and thus contains points of G n , and therefore of G u . 

In case is closed, it must be non-dense, for it will be shewn that 
G (GJj is everywhere dense, and therefore cannot contain a cell, or 
interval. 

94. A set of points which is complementary/ to a. set of the first category 
is a set of the second category. 

In particular, a cell, or an interval, is not the sum of two sets of the 
first category. 

To prove this theorem, we shew, in the first place, that the set com¬ 
plementary to a set of the first category is everywhere dense. For let 
(a, /3) be any cell, or interval, in the fundamental cell, or interval, in which 
the set of the first category is contained. There exists a cell, or interval, 
(«i, &) interior to (a, /?), in which there is no point of the non-dense set G l . 
Again (cq, fi x ) contains in its interior a cell, or interval, (a 2 , p 2 ) containing 
no points of G 2 ; and so on. 
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There exists then a point interior to all the cells, or intervals, 

(a, p) (Hj, ft) (a 2 , ft).K, ft). 

which is not a point of 6r w . Hence the set complementary to G„ is every¬ 
where dense. It follows from this that a set of the first category cannot 
contain all the points of any interval, or cell, and is thus a diffuse set. 
Next, let us assume, if possible, that C (QJ) is itself the limit of a sequence 
(ft, (ft, ... Q n , ... of non-dense sets, each of which is contained in the next. 
The sets G x -|- (ft, (ft f ft, ... are all non-dense, and each one of them is 
contained in the next; their limit, which is of the first category, is identical 
with the fundamental cell, and this has been shewn to be impossible. 
Hence the complement of a set of the first category is not of the first 
category. 

The converse of the above theorem does not hold good. It is not true 
that every set of the second category is the complement of a set of the 
first category. Thus there are two kinds of sets of the second category; 
those which are complementary to a set of the first category, and those 
for which this is not the case. If, for example, in a linear interval (a, b) 
the points of a sub-interval (« L , ft) be taken, these make up a set of the 
second category, and the complementary set is also of the second category. 
It has been shewn* by Mahlo that the points of a linear interval (a, b) 
can be divided into two sets, each of which is everywhere dense, and of 
the cardinal number of the continuum, and neither of which is of the first 
category. 

95. A set of the second category which is complementary to a set of 
the first category is saidf to be a residual set. 

A residual set can be obtained by the successive removal from a funda¬ 
mental cell, or interval, of non-dense sets of points belonging to a sequence 
of such sets. It has been shewn in § 94 that a residual set is everywhere 
dense. It will now be shewn that: 

A residual set has the cardinal number of the continuum. 

Let (ft, be a set of the first category, and C (G m ) the corresponding 
residual set. Let a system of nets be fitted on to the fundamental cell, 
or interval, in which (ft is contained. If n be sufficiently large, there is a 
mesh of the net D n that contains no point of (ft; the smallest value of n, 
say n x , for which this is the case may be chosen. Let this mesh be denoted 
by d . There is some smallest value of n, say n 2 , such that d contains in 
its interior two (or more) meshes d 0 , ft, of the net D nz , neither of which 
contains a point of (ft. In case there are more than two such meshes, 
ft, ft are those of lowest rank. Again, there is some smallest value of n, 

* See Leipz. Bcr. vol. lxv (1913), p. 283. 

f See Denjoy, Journal de Math . (7), vol. i (1915), p. 123, who has introduced this terminology. 
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say n 3 , such that both d 0 and d ± contain within them two or more meshes 
of D„ 8 , none of which contains a point of 0 3 . Let the two meshes in d 0 
be denoted by d m , d 01 ; and the two meshes in d A by d 10 , d n . Proceeding 
in this way, we can define a sequence of meshes d p , d VQ , d vqr) ... each of 
which contains the next in its interior; where each of the indices p, q> r, ... 
is either 0 or 1; and where the sequence p, q,r, ... is defined in accordance 
with some set of rules. The point interior to all the meshes of this sequence 
does not belong to G„, and is therefore a point of the residual. But such 
a sequence can be defined corresponding to each number of the con¬ 
tinuum; leaving out those sequences in which all the indices, from and 
after a fixed one, are all 1, the correspondence is unique. Hence the 
residual must have the cardinal number c, of the continuum. Further, it 
can be proved that: 

Any finite , or enumerably infinite , set of residuals have in common a set 
which is also a residual set . 

Let // (1) , ... be residuals obtained by removing successively the 

sets of sequences {6r n m }, {6r n (2) }, ... of non-dense sets from an interval, or 
a ceil; the non-dense sets can be arranged in enumerable order 
a a) a < 2 > fi <n (l o) (i <2) q a) 

If these sets be removed successively from the interval, or cell, we obtain 
the residual jy //m y 

96. It is often of importance to consider the properties of sets that are 
contained in a given perfect set H , which may be non-dense in the con¬ 
tinuum. 

If Pj, P 2 , ••• P n , ••• be a sequence of sets all of which are non-dense in 
the perfect set H, the set M (P l9 P 2 , ...) is said to be of the first category rela¬ 
tively to H. A set which is not of the first category relatively to H is said to 
be of the second category relatively to H. A set whose complement relatively 
to H is of the first category relatively to H is said to be a residual set rela¬ 
tively to H. 

It can be shewn that: 

A set of points which is the complement , relatively to the perfect set H , 
of a set of the first category , relatively to II, is of the second category relatively 
to H. 

In particular H cannot be the sum of two sets both of the first category 
relatively to H. 

This is proved in a manner precisely similar to that in which the 
theorem for the case in which H is a continuous cell, or interval, was 
proved in § 94, Thus it is first shewn that a residual relatively to H is 
dense in H , and then the proof of the theorem is completed as in § 94. 
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It follows that a set of the first category relatively to H is diffused in H. 

It can be proved, as in § 95, that a residual with respect to II has the 
cardinal number of the continuum. 

It has been shewn by Denjoy* that a residual relatively to a perfect 
set II itself contains a perfect set. 

It will appear, from the theory of order-types developed in Chapter iv, 
that the points of a perfect linear set can be made to correspond uniquely 
with points of a continuous interval (a, 6), in such a manner that the 
relative order of two points of the perfect set is the same as that of the 
corresponding points in (a, b ); the end-points of an interval contiguous 
to the perfect set corresponding to one point of (a, b). To a closed set H f 
non-dense in the perfect set, there corresponds a closed set, non-dense in 
the continuum; and sets of the first, or the second, category relatively to 
the perfect set correspond respectively to sets of the first, or of the second, 
category in (a, b). ft thus appears that the properties of sets of the first, 
and the second, category in the continuum can be immediately extended 
to sets of the first, and the second, category relatively to a perfect set. 

This is a particular case ,of the general property of any perfect set; 
viz. that all descriptive properties of sets of points in the continuum 
correspond to identical properties of sets of points in a perfect set, even 
if the perfect set be itself non-dense in the continuum. 

EXAMPLES 

1. Let P 2 , P 3 , ... i J ra , ... be an enumerable set of points in an interval («, b); the 
set may be everywhere dense in (a, b). The finite sets 

(Pi), (Pi, P t ), P*), (Pi,P» Ptf)* - 

are each closed, and the given set is the limiting set, which is therefore of the first category. 
The remaining points of (a, b) form a set of the second category, which is a residual. 

2. Denoting the points of the interval (0, 1), as in Ex. o, § 83, by 

C. Co Cji 

x = 1 4 J— - + ... + +..., 

*1 * 1 « 2 • • • k n 

where c n <k n ; let the fixed integers , k 2 , ... k n , ... form a sequence which increases 
without limit. Iff aj, a 2 , ... a n , ... is any sequence of positive integers which increase 
without limit, let G n denote the set of those numbers x, which are such that the integers 
Cj, c 2 , ... ... are all <a n . The sets G\ , , ... Gr n , ... are a sequence of perfect sets, 

each one of which contains the preceding ones; the set G^ is then a set of the first category. 

3. The numbers of the continuum (0, 1) may be divided into sets, of the first, and of 
the second category, in the following manner: All the numbers in (0, 1) may be expressed 
as endless decimals; the finite decimals being therefore not used. Lett se t> H consist 
of all those numbers in which the digit 9 occurs only a finite number of times, and of those 
numbers also in which, from and after some place, all the figures are 9. The comple- 

* Journal de Math. (7), vol. I (1915), p. 232. 
t Broden, Math. Annalen, vol. li (1899), p. 299. 
t See Schoenflies, Bericht, vol. i, p. 106. 
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mentary set K consists of all those numbers in which 9 occurs an infinite number of times, 
except those in which every figure is 9, from and alter some place. The set H is the limit 
of a sequence of non-dense closed sets H lt H 2 , ... // w , ... each of which is of cardinal 
number c. For, let H x consist of the numbers of the form -afoc ... A:999 ..., in which every 
figure is 9, after some fixed place, and in which none of the figures a, />, c, ... k is 9; together 
with those decimals in which no figure is 9. No number of the set H 1 can lie within the 
interval ('abc ... &899 ..., •abc ... A’999 ...) which is therefore a complementary interval of 
the set. The set 11 n may be taken to consist of the numbers of the form 'abc ... hi t 999 ...» 
in which k is not 9, and not more than n 1 of the figures a , 6, c, ... h, are 9; together with 
those decimals in which 9 does not occur. That each of the sets H n is of cardinal number c, 
follows from the fact that it contains all the decimals in which 9 does not occur; and these, 
if interpreted in the scale of 8, represent all the numbers of the continuum (0, 1). The set 
// is everywhere dense, since it contains that evervwhcre dense set of numbers in which 
every figure is 9, after some place. The set K, being of the second category, is also every¬ 
where dense, and of cardinal number c. 

4. The following method of dividing the continuum (0, 1) into two portions, each of 
which is everywhere dense, and of cardinal number r, lias been given by Brod6n*: Let 
Iq 4 h + 1 2 + • • • +/ n + ... denote a divergent series of positive numbers, such that the limit 
of l n y as n is indefinitely increased, is zero. Let a be a positive number <1, and let 
n 19 n 2y ... n Ly ... be a sequence of increasing positive integers. It is possible to choose the 
divergent series so that each of the ratios / Wj // n , ••• is <«* ; if this be done, the series 

OO l ni ' _ 00 

2 l ni is convergent, its sum being < ^ . Each of the series obtained from 2 l ni , by 

leaving out a finite number of terms, is also convergent. The convergent series, so obtained, 
form an unenumerable set: for they arc obtained by multiplying the terms of the series 

OO 

2 l n , each either by 0, or by 1; and thus there is a series corresponding to each fractional 

i l 

number expressed in the dyad scale. Corresponding to each convergent series there is a 
divergent series which consists of / 0 + /j 4 ... +/ n + ..., with the convergent series removed 
from it. We obtain in this manner an unenumerable set of divergent series. The con¬ 
vergent and divergent series, each of which consists of terms of / 0 4 I 1 -f ... -4 l n -f..., may 
now be correlated with the numbers of the continuum (0, 1). I^et these numbers be ex¬ 
pressed in the dyad scale, in the form • a 1 a 2 a 3 ...» where every a is 0, or 1, and the case in 
which every figure is zero, after some place, is excluded. To one of the series l v + l Q +l T 4 ..., 
we may take that number to correspond in which a p , a q , a r , ... are all 1, and the remaining 
digits 0. The points of (0, 1) are thus divided into two classes; one of these consisting of 
all the numbers which correspond to convergent series, and the other of those corresponding 
to divergent series. 


ORDINARY INNER LIMITING SETS 

97. The inner limiting setf of a sequence of open sets, each one of 
which contains the next, is said to be an ordinary inner limiting set. 

The outer limiting set of a sequence of closed sets, each of which is 
contained in the next, is said to be an ordinary outer limiting set. 

* Crelle'8 Journal , vol. cxvm (1897), p. 29. 

f The term inner limiting set is due to W. H. Young, who has investigated the properties of 
such sets; see Leipz. Her. 1903, p. 287; for further properties see also his papers in the Proc. 
Ijond. Math. 8oe„ (2), vol. i (1903), p. 212, and (2), vol. m (1905), p. 372, where the theory is 
extended to p-dimensional sets. 
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Thus an ordinary inner limiting set is the complement of an ordinary 
outer limiting set. 

In the case of a linear set of points contained in a given interval (a, 6), 
to which the case of a linear set in an unbounded interval may be reduced 
by the method of correspondence, an ordinary inner limiting set is the 
.set of points common to a sequence {A n }, where A n is, for each value of n , 
a set of open intervals contained in (a, b). 

In the particular case in which all the sets A w are everywhere dense 
in (a, 6), the complementary closed sets C (A n ) are all non-dense. In this 
case the ordinary inner limiting set is of the second category, and is of 
that species which is termed (§ 95) a residual set. 

The case of linear sets will here be considered in detail. 

The following theorem will be established: 

Every ordinary ( linear) limiting set is either enumerable , or it has the 
power of the continuum . 

If the sets A n are all everywhere dense in (a, b), the inner limiting set, 
being a residual set (see § 100), has the cardinal number of the continuum, 
in accordance with the theorem of § 95. 

Next, let us suppose that the sets {A n } are not all of them everywhere 
dense in (a, b); and let us suppose further that the inner limiting set 
contains a part H which is dense in itself, so that H ' is perfect. The set H' 
may be placed in correspondence with points of the interval (0, 1), so that 
the order of corresponding points in //' and in the interval (0, 1) is always 
the same. To each point in (0, 1) there corresponds a single point of H f , 
except that the end-points of an interval contiguous to H' correspond to 
a single point in (0, 1). The points of H correspond to the points of a 
set H t everywhere dense in (0, 1); and those intervals of A n that contain 
points of H correspond to a set of intervals A„' everywhere dense in the 
interval (0, 1). The set H therefore corresponds to a residual set in (0, 1); 
and therefore H has the cardinal number of the continuum. 

The only sets of points which contain no components that are dense 
in themselves are enumerable sets, and therefore the cardinal number of 
an ordinary inner limiting set is c or a. 

It has thus been established that : 

An ordinary inner limiting set has the power of the continuum if it 
contains a set that is dense in itself ; and if it contains no such component it 
is enumerable . 

An enumerable set which contains a component that is dense in itself 
cannot be an ordinary inner limiting set. 
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98. If any linear set G is given, we may suppose a point x, of G , to be 
in the interior of each interval of a sequence (x), S 2 (x), ... 8 n (a;), ... of 
intervals, each one of which contains the next; and this will be the case 
for each point x in G. Let d n be the upper boundary of the lengths of the 
intervals 8 n (x), taken for every point x in G\ and suppose the intervals 
so determined that the numbers d lf d 2 , ... d n , ... form a sequence which 
converges to zero. Let A n be the set of open non-overlapping intervals 
which have the same interior points as the set of intervals {S n (x)}. We 
have then a sequence of sets A x , A 2 , ... A w , ... of open intervals, which 
will define an ordinary inner limiting set, to which all the points of G 
must belong; but it may also include limiting points of G that do not 
belong to G. For it is clear that any point of the inner limiting set is at 
a distance from some point of G n that does not exceed d n \ hence any 
point of the inner limiting set which is not a point of G n is the limiting 
point of a sequence of points of G n , since d n ~ 0, as n ~ oo. 

The set of points of G' which do not belong to G , but which belong to 
the inner limiting set defined by the sequence {A n }, will depend in general 
on the mode in which the intervals of the sets A n are defined. In order 
that the given set G may be an ordinary inner limiting set it must be 
possible so to define the sets A n that no point G' that does not belong to 6? 
is in A n for every value of n. 

Some criteria will be here given for deciding in respect of a given set 
whether this is possible or not, that is, whether the set is an ordinary 
inner limiting set or not. 

A point which is in A m , but not in the sets A w41 , A m42 , ..., is said to 
be shed from the sequence {A n } at the index m. 

If all the intervals {8 n (#)} be taken to be of equal length 2 c n , with the 
point x in the centre of 8 n (x), where c n ~ 0, as n ~ oo, then every limiting 
point of G belongs to the ordinary inner limiting set defined by the 
sequence of intervals. 

For, however small c n may be, there are points of G whose distance • 
from a limiting point p is less than c n . 


EXAMPLE 


The following example, given by Borel*, drew attention to the fact that a sequence of 
sets of intervals constructed as above may define an inner limiting set that contains points 
other than those of the given set. 


jp 

Let us suppose that each rational point £ in the interval (0, 1) is enclosed in the interval 

( - - 4, - + 4] , where A has the same value for all the points. In this manner the rational 
q <f q <?J 

2 

points are enclosed in a set of overlapping intervals, whose sum is less than A2 (q - 1) ^, 


* Lemons sur la thdorie des jauctions , p. 44. 
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or than 2A2 ^, which can be made as small as we please by choosing A small enough. The 
eqiiivalent set of non-overlapping intervals defines, by means of the end-points and their 


i , for all points ^ . 

<f g 


limits, a closed set such that for any point of the set j ^ -q 1 1 

Now consider the set of points defined by 

a, a., a.. a,,. 

x ~To + Jo 3, + 10 s ' h+ io* + •**’ 

where each a is ^.9, and the a’s are such that an infinite number of them are different from 
zero. It has been shewn by Liouville that these numbers x arc transcendental. Let 

nr.» 


then 


q 10 10' 

a n+1 


+ 4: + ”* f ijn- thus <7^-10" 


q ]()<n4-i)' + ”* 


i ( a rrt 1 , \ ^ * 

<i n V q ”7 q n ‘ 


It follows that, if x is one of the above transcendental numbers, whatever value A may 
have, it is interior to an interval ( 

provided A £ jQ( W !_ 3 ) n .; anc ^ however small A may be, values of n can be found for which 


A p A 
g 3 q q 


For suppose q = 10 n ; then ~ -x < 


this inequality is satisfied. Therefore rational points ^ can be found, however small A may 


be, such that x lies within the intervals 


original points ^ which the intervals are drawn to enclose, there arc other points which lie 
inside the intervals, for all values of A, when A is diminished indefinitely. 


- ^ + W It thus appears that, besides the 

q g 3 q q* ) 


99. Those limiting points of an enumerable set of points P that do not 
belong to P form an ordinary inner limiting set. 

Let p x , p 2 , ... p n , ... denote the points of P, and let H denote the set 
of those limiting points of P which do not belong to P. 

The points of H can be enclosed in a set A x of open intervals that do 
not include the point p 1 . Interior to A lf a set of open intervals A 2 can 
be defined which include all the points of //, but not the point p 2 ; and 
so on. The sequence A lt A 2 , ... has for its inner limiting set all the points 
of H , but none of the points p l9 p 2 , ...; and H can have no limiting points 
that do not belong either to itself or to P. Hence H is an ordinary inner 
limiting set. 

Every isolated set is an ordinary inner limiting set. 

For each point x , of an isolated set, may be enclosed in an interval of 
a sequence 8 X (a;), S 2 (x) } ... 8 n (x), ..., each interval containing the next, 
and so that 8 n (x) converges to zero; and such that 8 X (x) contains no point 
of the given set other than x. The set A n may then consist of the intervals 
8 n (x) for every point x of the set. It is then clear that the only points 
interior to A n , for every value of n, are the points of the given isolated 
set, which is therefore an ordinary inner limiting set. 
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Every closed set is an ordinary inner limiting set. 

Enclose each point P , of a closed set G , in an interval of length 2p, 
with P as its middle point; any part of such an interval that is not in 
(a, b) may be disregarded. We have thus a set of overlapping intervals 
which contain all the points of G within them. Consider the equivalent 
set of non-overlapping intervals (§ 71). We have thus a set A, of non¬ 
overlapping intervals, each of which is of length p; and therefore A 
must be a finite set, which contains all the points of G. Let p have all 
the values in a sequence of diminishing numbers that converges to zero ; 
the corresponding sequence of finite sets of intervals A l9 A 2 , ... A w , ... is 
such that each contains the next, and every interval includes points of G. 
The inner limiting set which they define must be the set G, since G contains 
all its limiting points. 

100. The necessary and sufficient condition that an enumerable set P 
should be an ordinary inner limiting set is that P contains no component 
that is dense in itself. 

That the condition is necessary has been shewn above, since every 
inner limiting set that has a component that is dense in itself has the 
power of the continuum. 

To prove the sufficiency of the theorem, we employ the mode of 
analysis of a set, given in § 91, into adherences and coherences. 

Any enumerable set P can be resolved into a sum of sets 
P i + P% 4- ... 4- Pp 4- Qp , 

where P 1} P 2 , ... Pp are all isolated sets, and Qp is a component of a 
perfect set Gp\ /3 denotes a number of the first, or of the second, class. 
When P contains no component that is dense in itself, the decomposition 
of P ceases at, or before, some definite number a of the first, or of the 
second, class, when there is no set G a . 

It has thus been shewn that, when P contains no component that is 
dense in itself, it can be resolved into a finite, or enumerably infinite, set 
of inner limiting sets, of which there may, or may not, be a last set. Let 
P y be one of the components into which P has been resolved, y denoting 
a number of the first or second class. We now fix on a sequence of sets 
of intervals enclosing the points of P y , such that all the intervals are 
interior to the intervals complementary to G y \ then the set 

Py+i 4 * ^ y +2 4 * • • •, 

which is contained in G y , has no limiting points in any of the intervals 
which enclose the points of P y , for all its limiting points must be in G y . 
The sequence of sets of intervals having thus been fixed for every P y , we 
can now shew that each limiting point p, of P, which does not belong to P, 
is shed from the whole sequence of sets of intervals, at a definite index. 
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The point p is either a limiting point of P x , belonging to L 1 , the set of the 
isolated points of P', or is contained in G x . In the former case it is shed 
from the intervals enclosing P x at a definite index; and, not being a 
limiting point of P 2 f P 3 + ..., it is shed from the intervals enclosing the 
points of that set, at a definite index; consequently it is shed from the 
intervals enclosing P, at a definite index, the greater of the two former 
ones. In the latter case, unless p is in G 2 or in P 2 ', it is not a limiting 
point of P 2 + P 8 f and does not come into any of the intervals en¬ 
closing the points of P 1 ; it is therefore shed at a definite index. If p 
belongs to G lf G 2 , ... and to every G before G a , but is not in G a , it may 
be a point of P a '. In that case it is not a limiting point of the set 

Pa |-1 i“ Pa + 2 ' P * * * » 

and does not come into the interior of any of the intervals which enclose 
the points of P x , P 2 , ..., or any P with index less than a. It is therefore 
shed, at a definite index, from the sequence of sets of intervals enclosing 
the points of P. The theorem has thus been established. 

A corollary to the above proof is that every enumerable set is the 
sum of an ordinary inner limiting set, and of a set which is dense in itself. 

In particular* it follows that: 

Every reducible set is an ordinary inner limiting set. 

For the derivative of a reducible set being enumerable, the given set 
can contain no component that is dense in itself, since the derivative of 
such component would be perfect, which is impossible when the derivative 
is enumerable. 

Since an ordinary inner limiting set that is everywhere dense consists 
of the points that are common to a sequence {A„} of sets A„, of non-over¬ 
lapping open intervals which are everywhere dense, and since the points 
that do not belong to A n form a non-dense closed set, it follows that an 
everywhere dense ordinary inner limiting set is a residual set (§ 05); the 
complementary set, of the first category, being the outer limiting set of 
a sequence of non-dense closed sets. 

101. Any unenumerable set can, in accordance with the result of § 91, 
be expressed in the form P = U + V a + S V x -f V c ; and we observe that, 
if V t is absent, the necessary and sufficient conditions that P may be an 
inner limiting set are that V a and E V x should both be absent; this follows 
from the preceding results. 

If V c exists, we observe that no point of U + V a -h EV X can be a 
limiting point of V c ; for any limiting point of V c must be a point of 

* See Hobson, Proc . Lond. Math. Soc. (2), vol. ii (1904), p. 316. Another proof of the general 
theorem, applicable to any number of dimensions, has been given by Brouwer, Proc . Roy. Soc. 
Amsterdam , vol. xvm (1) (1916), p. 48. See also vol. xx (1917) for further information on the 
subject. 
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degree c in the set P. If V c is everywhere dense in (a, 6), it follows that 
U 4- V a f 2F* is absent. The set V c may be non-dense in (a, 6), or it 
may be dense in some parts of (a, 6), and non-dense in other parts. 

It will be shewn that V c is in general made up of a part which is non- 
dense in (a, b), and of a finite, or indefinitely great, number of parts each 
of which is everywhere dense in a particular interval in which it lies. 
Suppose that an interval («, P) can be found in which V c is everywhere 
dense; and let x be a point in (a, b) such that x ft. Then those values 
of x for which V c is everywhere dense in (a, x), together with those values 
for which this is not the case, define a section of all the numbers of the 
continuum (p, b); and this section defines a number p x - ft. Similarly, we 
may assign a number a , so that (a lf /?,) is the greatest interval con¬ 

taining (a, p) which is such that V e is everywhere dense in it. If, in the 
parts of (a, b) external to (aj, p x ), the set V c is dense in any interval, then 
we proceed to fix the greatest interval for which it is everywhere dense, 
in this manner we obtain a finite, or enumerably infinite, set of detached 
intervals contained in (a, b), in each of which V c is everywhere dense; and 
the remainder of (a, b) may consist of a set of detached intervals and a 
set of points, in this remainder the points of V c form a non-dense set. 

No point of U I V a 4 ZV X can be in an interval (a lf P ± ) in which V c 
is everywhere dense. If V c is the part of V c which is non-dense in (a, 6), 
every point of (J 4- V a 4- 1,V X must lie in one of the intervals comple¬ 
mentary to the perfect set V c '. It is to be observed that in V c are included 
the end-points of the intervals (a x , ft), in case those end-points belong 
to V c . 

In order that P may be an inner limiting set, it is necessary that the 
part of U 4- V a 4- 2 V x which is in each interval complementary to V c ' 
should be an inner limiting set; and this cannot be the case unless V a 
and 2 V x are absent. 

It has thus been shewn that: 

In order that an unenumerable set of points may be an inner limiting set T 
it is necessary that the set should contain no points whose degrees in the set 
are other than 0, a, or c, and that it should contain no component which is 
dense in itself , and, whose points are of degree a in the set. 

The determination of the necessary and sufficient conditions that any 
given unenumerable set of points, however defined, may be an ordinary 
inner limiting set, has now been reduced to the problem of determining 
the criteria for the case of a set which is dense in itself, and all the points 
of which are of degree c in the set. The case in which the latter set is 
non-dense in its domain may be reduced, by the method of correspondence, 
to that in which it is everywhere dense; and the problem is therefore 
reducible to that of determining the conditions under which a given 
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everywhere dense set of points, all of degree c in the wet, is a residual set, 
of the kind described in § 100. No investigation of all the possible types 
of such sets has yet been carried out, and therefore the problem remains 
as yet unsolved. 


PLANK SETS OF POINTS 

102. It has already been shewn in § 85, in the ease of closed sets, that 
the descriptive properties of sets of points in two or more dimensions are 
in some respects less simple than those of linear sets. The properties of 
plane sets of points are of special importance in the theory of functions 
of a complex variable, but they do not differ in any essential respect from 
the properties of sets in three or more dimensions. The definitions of 
closed sets, open sets, frontiers, etc., given in § 55, for such sets, are 
identical with those for linear sets, but an account will here be given of 
the most important descriptive properties of sets in more dimensions 
than one. Although the proofs of the theorems are, for simplicity of 
language, given for plane sets only, they may be extended to the case of 
sets in more dimensions than two, without material alteration. 

Each point (x, y), of a plane set, is defined by the two real numbers 
which are the rectangular Cartesian coordinates of the point. As has 
already been pointed out in § 53, a correspondence may be established 
between the points in a finite cell, or rectangle, and the points in un¬ 
bounded space, of such a character that the descriptive properties of sets 
of points are unaltered by the transformation; a special convention being 
introduced concerning the points on the boundary of the finite rectangle, 
or cell. It makes no essential difference, in the properties which will here 
be considered, whether the closed or open sets are considered as bounded, 
that is as contained in a finite open, or closed, rectangle, or whether they 
are in the unbounded plane space; provided that, when necessary, an 
adjunct boundary at infinity is postulated which corresponds to the 
boundary of a finite rectangle. 

103. The frontier of a set of points G has been defined, in § 55, as the 
set of points each of which belongs to one of the sets G y G (G), and is a 
limiting point of the other set. It will be shewn* that: 

If the complementary set of G exists , then the frontier of G and C (G) 
exists , and is a closed set. 

Let P be any point of G, and P' a point of C (G), and consider those 
points of G that are on the linear segment PP' y i.e . those points whose 

coordinates are X , where P is (x, y), and P' is (x\ y') t 

and h is a positive number, or zero. The linear set of points of G y on 


* Jordan, Cours d’Analyse, vol. I, p. 20. 
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PP', has, in accordance with the theorem of § 47, an upper boundary Q . 
This point Q , which may coincide with P, is a point of the frontier of 
G and C (G); for if Q is a point of G, it is also a limiting point of 0 (G), 
and if it is a point of 0 (G), it is a limiting point of G. Therefore, if C (G) 
exists, there is always a frontier of G and C (G), Moreover, let 

•• Qn, ... 

be a convergent sequence of points of this frontier. Denoting by Q the 
limiting point of the sequence, Q is itself a point of the frontier; for in 
the set {$„} there is contained a convergent sequence of points all of which 
belong to G, or else such a sequence of which all points belong to C (G). 
If these points all belong to C (G), and consequently to G', then Q belongs 
to the closed sets O' and {C (G)}' \ if they all belong to G, and consequently 
to {C (G)}\ then Q belongs to G' and to {0 (G)}'. In either case Q is a 
point of the frontier; and thus, since every limiting point of the frontier 
belongs to it, the frontier is a closed set. 

If all points of G belong to its frontier, G has no interior points. 

104. The distance* of two points P, or fa, y ), and Q , or fa', y '), has 
been defined in § 50 to be the number {(or — x') 2 f (y -- y ') 2 }^; and this 
may be denoted by PP'. 

If P is a point of a set G Y , and P' a point of another set G 2 , the dis¬ 
tances PP', when every such pair of points is contemplated, form an 
aggregate of numbers which have a lower boundary. In case this lower 
boundary is a positive number d (G x , G 2 ) (> 0), the sets G 1 and G 2 are said 
to be detached, from one another. The number d (G x , G 2 ) is said to be the 
distance between the two sets G x , G 2 . 

If one set G 2 consists of a single point p, d (p, G 1 ), or simply d, is said 
to be the distance of the point p from the set G x . 

If two closed sets , G 1 , G 2 are detached from one another, there exists at 
least one pair of points P, P', belonging respectively to the two sets, such that 
the distance PP' is equal to the distance of the sets from one another. 

Let d denote the distance d (G lf G 2 ), of the closed sets from one 
another; and let , e 2 , ... e n , .. / be a sequence of decreasing positive 
numbers that converges to zero. For each value of n, a pair of points P n , 
or ( x n , y n ), and P w ', or ( x n \ y n '), belonging to G l , G 2 respectively, can be 
determined such that P w P n ' 2 < d 2 - f* e n . A unique point p n , or 

fan > Vn> ) Vn )j 

exists in the four-dimensional continuum, corresponding to each pair of 
points P n , P n '. The set of points p l9 p 2 , ... p n) ... has at least one limiting 
point fa, y, x' , y'); let P, P' denote the two points fa, y), fa', y') in the 

* Instead of the distance so defined, Jordan employs in this connection the “ccart,” defined 
by | x - x' | +1 y - y' j . This makes no difference in the developments. 
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plane space. It will be shewn that P and P' belong to G x , G 2 respectively, 
and that PP f is equal to d. An integer m can be so determined that 
x — x n > y — y„, x ' — x n > y' ~ Vn are all numerically less than an arbi¬ 
trarily chosen positive number 77 , provided nzm. It follows that P is a 
limiting point of the sequence {P n }, and that P' is a limiting point of the 
sequence {P n '}; and thus that P and P' belong to G x , G 2 respectively, since 
these sets are closed. We have, further, 

I X - x' I s I X - x n I + I x n - x n ' I + I x„’ - x’ I 5, 2r, + I I, 

and similarly | y — y' | < 2rj 4 - | y n — Vn |» for n z m. From these ine¬ 
qualities we see that (x — x) 2 + (y — y') 2 < Sf + Arj + P n P n ' 2 , where A 
is some fixed number; hence PP' 2 < Hr} 2 + Arj f- d 2 - he n . Since 77 and e n 
are both arbitrarily small, it follows that PP' 2 u d 2 , and thus that PP' 2 y 
which is certainly not less than d 2 , must be equal to d 2 . The theorem has 
now been established. For the choice* of the infinite set of pairs of points 
P n , P n \ the multiplicative axiom is required. 

105. A closed set of points is said to be connex , or single sheeted, when 
it is not the sum of two or more closed sets. 

It should be observed that the expression “connex” has betm em¬ 
ployed in § 41 in a somewhat different sense. 

A connex closed set which does not consist of a single point is a perfect set. 

For an isolated point of a closed set G can be considered as a set 
detached from the closed set consisting of all the remaining points of G ; 
and hence, if an isolated point existed, G could not be connex. 

ur, P r are any two points of a connex perfect set G, and e be any assigned 
positive number, a finite number of points p x , p 2 , ... p n , of the set, can be so 
determined that Pp x , p x p 2 , p 2 p 3 , ... p n -iP n , p n P' are all < e. Conversely, if 
this condition holds for each pair of points P, P' of G, and for every positive 
number e (h depending on e), then G is connex. 

The condition stated in the theorem is sufficient to ensure the con- 
nexity of G. For if G be the sum of two closed sets G t , G 2 , the distance 
between which is d (> 0), we may choose e to be < d. If P is a point 
of G x and P' a point of G 2 , and p x is a point of G such that Pp x % e, the 
point p 1 belongs to G x ; again if p 2 be a point of G such that p x p 2 s €, the 
point p 2 belongs to G x \ and so on. Since p n belongs to G x , whatever finite 
value n may have, it is impossible that p n P's e, because p n P' z d. Again, 
the condition is a necessary one. For, let us suppose that, for some value 
of c, the condition is not satisfied for every pair of points. If P belong 
to such a pair, the set G may be divided into two parts G x and G t ; where 
G x is such that, for each point P' belonging to it, a definite set of points 
of G l9 viz. p 1 , p 2 , ... p n , exists, such that Pp ly p x p 2 , ... p n P' are all £ c; 


* See Chap, iv, § 197 et seq. 
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and G 2 is such that, for each point of it, this condition is not satisfied. 
It can be shewn that G x , G 2 are both closed sets, and that the distance 
between them is > e. For, if p is a limiting point of G x , it belongs either 
to G x or to G 2 ; and since there are points. p n , of G x , such that pp n < e, 
the point p clearly belongs to G 1 ; therefore G 1 is a closed set. Again, if q 
is a limiting point of G 2 , it cannot belong to'6^; for a point P', of G 2 , 
can be found such that qP' < c; hence, if q belonged to G x so also would 
P' ; thus G 2 is closed. It is clear that no pair of points of G x , G 2 can exist, 
of which the distance is ^ e; hence, for these sets, d > e. It has thus been 
shewn that if, for any value of 6, the condition in the theorem is not 
satisfied, G can be divided into two detached closed sets, and it is there¬ 
fore not connex. 

106. An open set, defined in § 55 as one such that every point is an 
interior point, is also frequently called an open domain. 

When the open domain is such that any two points P, P' of it may 
be joined by means of a finite set of linear intervals Pp l , p x p 2 , •• p n P \ 
such that all the points of all these closed intervals belong to the domain, 
it is said to be a connex open domain. 

A connex open domain is also called a Weierstrassian domain , or a 
continuum. It may be either bounded or unbounded. The terms domain 
and region are also used to denote a connex open domain together with 
some, or all, of its frontier points. 

If, to a connex open domain, all its frontier points be added, it becomes 
a closed set, connex in accordance with the definition of § 105. But a 
closed connex set does not necessarily become a single connex open domain 
when all its frontier points are removed from it. Thus, for example, the 
set of all the points interior to, or on the circumferences of, two circles 
that touch each other externally, is a closed set, but the interior points 
of the circles do not form a single connex open set, but two such sets. 

Each point of an open domain is such that a circle exists with the 
point as centre, and with all its interior points in the domain, but with 
one or more points of the frontier on its circumference. 

For the point P has a definite distance dp from the closed frontier of 
the open domain, and in virtue of the theorem of § 104 there is one point 
at least of the frontier on the circumference of the circle with centre P 
and radius dp . 

Every open set of points is the sum of a finite , or an enumerably infinite , 
set of connex open domains. 

If P be any point of the given open set 0, let the component 0 Xf of O, 
be such that every point Q , of O x , satisfies the condition that P and Q 
can be joined by a set Pp x , p x p 2 , ..., p n Q of straight lines of which all the 
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points are points of O. The set 0 1 is clearly an open set, for a point Q of 
it has a neighbourhood, of which every point q can be joined in the above 
manner to P; and thus Q is an interior point of 0 1 . Therefore any point 
P, of O , is a point of a connex open domain which is a component of O. 
It will be shewn that the number of such connex domains contained in 0 
is enumerably infinite, or finite. Apply to the finite, or infinite, rectangle, 
in which O is contained, a system of closed nets {I) n }. There is a smallest 
integer n x , such that D ny has one or more meshes interior to O ; there is a 
smallest integer n 2 (> nf), such that those meshes of D ni which are not 
interior to O contain one or more meshes of l) fl2 that are interior to 0 ; 
and so on. We obtain in this manner a sequence of sets of meshes d ny , d V2 ,... 
such that each mesh of the sequence is interior to 0, and such that no 
mesh of the sequence contains any other such mesh. Every such com¬ 
ponent of O, as 0 1 , contains meshes of the sequence ; and since the set of 
meshes in the sequence is enumerable, it follows that the aggregate of all 
components of O that consist of connex open sets is enumerable, whether 
finite or not. It follows from this theorem that every closed set of points 
has for its complement, relatively to a finite , or infinite, cell, a finite, or an 
enumerably infinite, set of connex open domains. If the set of connex open 
domains is everywhere dense in a closed domain, the closed set is non-dense 
in that domain. 

These connex open domains are the true analogues, for plane (dosed 
sets, of the open intervals contiguous to a linear closed set of points, and 
they may therefore be said to be the open connex domains contiguous to 
the closed set. In § 86, attention has been confined to domains of a par¬ 
ticular type, viz. cells, and it lias there been shewn that there exists, in 
general, no unique set of cells which have properties in relation to a given 
closed set fully analogous to the properties of the open intervals contiguous 
to a linear (dosed set. If, to a non-dense closed set, there be added all 
the interior points of some of the contiguous domains, we obtain a closed 
set of the most general type. A closed set will be perfect, if no two of the 
contiguous domains abut on one another, or also if they abut on one 
another in such a way that the points common to the two boundaries 
form a perfect set. 

107. A bounded connex domain together with its frontier points forms a 
connex closed set. 

Since the frontier is closed, it is easily seen that the set formed by 
adding all the points on the frontier of the bounded connex domain is 
closed. 

If P, Q are both interior points of the closed set, they are both points 
of the bounded connex domain, and thus satisfy the requisite conditions 
as to the mode in which they may be joined. If P, Q are points on the 
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frontier, interior points P\ Q' may be so determined that PP', QQ f are 
both less than e; and then P', Q' may be joined by a series of straight 
lines Pp 1 , Pipi , ..., p n Q' each of length less than e, and such that every 
point of all of them is an interior point of the connex domain. It thus 
follows that P, Q may be joined so that the requisite condition is satisfied. 
The theorem will also hold good for an unbounded domain, provided those 
points at infinity which are boundary points are adjoined. 

The boundary of a connex domain, although closed, is not necessarily 
itself connex. For example, if the connex domain consists of the points 
between two concentric circles, the boundary consists of the two circum¬ 
ferences, and is not connex. 

The Heine-Borel theorem in its generalized form, proved in § 74, has 
already been emancipated, in § 75, from the condition that the set to 
which the points of the closed set are interior is necessarily a set of cells. 
It has in fact been shewn in § 75 that a set of open domains may be em¬ 
ployed instead of a set of cells. The theorem may then be stated in the 
following form: 

If every point of a {bounded) closed set G belongs to one or more of the 
connex open domains of a set 1) {not necessarily enumerable) of such domains , 
there exists a finite set of connex open domains belonging to 1) such that every 
point of G belongs to one at least of the domains of the finite set . 

In a similar manner it is shewn that, in the last theorem of § 75, 
applicable to any set of points G, open connex domains may be employed 
instead of cells. We thus have the following theorem: 

If each point of a given set of points belongs to a definite connex open 
domain corresponding to that p<mit, an enumerable set of these connex open 
domains can be so determined that all the 'points of the set belong to one or 
more of the. domains of that enumerable set. 

If the enumerable set of connex open domains be {/)„}> it may be 
shewn that, if H be any closed set contained in G, all the points of II 
belong to one or more of the domains of a finite set which is a component 
of {D n }. 

For, since each point of the closed set is interior to one or more of the 
domains {/)„}, the result follows from the generalized Heine-Borel theorem. 

108. The case when the set is a bounded open set O, in which, to each 
point of O, there corresponds a domain which consists of the interior of a 
circle with the point as centre, is of special importance in connection with 
the theory of a complex variable. 

Let dp be the distance of the point P, of a bounded open set O, from 
the frontier of the set. To the point P, the interior of the circle of radius 
dp , and centre P, may be taken to correspond. In accordance with the 
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theorem of § 107, every point of the open set is interior to one at least 
of an enumerable set { C n } of these circles. Each of the circles has on its 
circumference one or more points of the frontier of the open set. If the 
radius of the circle with centre P be Odp , instead of d P , where 6 is some 
fixed number (< 1), an enumerable set of these circles { C n } exists, such 
that each point of the open set is interior to one or more of the circles {<?„}, 
and all the points on the circumferences of these circles are points of the 
open set O. 

The results may thus be stated as follows: 

If O be any bounded open set , there exists (1 ), an enumerable set of circles 
{C n } such that each point of 0 is an interior point of one at least of the circles , 
and such that every interior point of the circles is a point of O, and the circum¬ 
ference of each of the circles contains one or more points on the frontier of O ; 
and (2), there exists an enumerable set of circles {(7 W } such that each point of O 
is interior to one or more of the circles , and every point on the circumference , 
or interior to , any of the circles , is a point of 0. 

If 0 consist of a single connex open domain, any two points P, Q> 
of 0, can be joined by means of a finite number of the circles of the set 
{C n }, or also by a finite number of the circles of {(?„}; so that P, Q are 
interior to circles of the finite set, and each circle overlaps the next. 

For P and Q may be joined by a finite set of segments of straight lines 
all of which segments are points of (). This broken line joining P and Q 
constitutes a closed set //, which, in accordance with the last remark in 
§ 107, is such that each point of // is interior to a finite set of circles be¬ 
longing either to {O n }, or to {(?„}. Whichever of these sets of circles be 
employed, we have a chain of circles such that P and Q are each interior 
to one of them, and such that each circle of the chain overlaps the next one. 

109. The following theorem is the analogue, for a plane set of points, 
of the theorem proved in § 71 for an open linear set. 

Having given a set of overlapping open domains , there exists a set of non - 
overlapping open domains such that an interior jxnnt of a domain of either 
set is an interior jmnt of a domain of the other set. 

The proof of the theorem is the same as that in § 71; it being shewn 
that every point interior to one or more of the given domains is an interior 
point of the set of all such points. The set of all points interior to one or 
more of the given domains is open, and consequently consists of a finite, 
or an enumerable set of connex open domains. 

Every open domain 0 is the sum of an enumerable set of closed cells which 
do not overlap , though any two of them may have a portion of their boundaries 
in common. 
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We may prove the theorem for the case in which the open domain is 
bounded, as it can then be extended to the case of an unbounded open 
domain, by the method of correspondence. 

Apply a system of nets, with closed meshes, to a fundamental ceil 
which contains the open domain. There is a smallest value n ly of n, such 
that D ni has one or more meshes which, being completely closed, are 
interior to O. Let these meshes, arranged in order of rank, be denoted 
by D ni - There is a smallest value n 2 , of n , (> Uj) so that D n% contains 
meshes not contained in D ni , each of which is interior to O. Proceeding 
in this manner, we have a sequence of finite sets of meshes D ni , D ni , ..., 
arranged in order, such that each mesh is interior to O y and no two of the 
meshes overlap one another. 

Any point of O is contained in each of a unique sequence of meshes 
d ly d 2y d 3 , ... belonging to I) x , I) 2 , ... respectively. These are the meshes 
in which the point would have been contained if the nets had been half 
closed instead of completely closed. For all sufficiently large values of 
n > d n is interior to 0. The smallest such value of n must be one of the 
numbers n ly n 2y ...j^say n 9x . Thus the point P y of O y is in a mesh d np 
which belongs to Z) w . It is thus proved that all the points of 0 are 
points of the sequence D nj , D n2 , ..., 

An open .set is the outer limiting set of a sequence of closed sets. 

First let the open set O be bounded, and Jet H be its boundary. Let 
d be a fixed number, and consider the set of points G d which consists of 
all points of O at a distance from H that is > d. This set G dy which cer¬ 
tainly exists provided d be sufficiently small, is a closed set. For let {P„} 
be a convergent sequence of sets of points of G d that has P for its limiting 
point. 

The distance of P from H cannot be less than d, for if it were, for a 
sufficiently great value of n , P n would be at a distance from FI less than d y 
which is not the case. If d have the values in a sequence d ly d 2 , ... d ny ... 
which steadily converges to zero, the closed sets G djy G d%y ... 6?^, ... are 
slJ ch that each is contained in the next, and any point of O belongs to all 
of the closed sets, from and after some value of n. Thus O is the outer 
limiting set of the sequence { G dn } of closed sets. 

In case the open set is unbounded, the theorem may be deduced from 
the case in which it is bounded. In this case the closed sets may have 
parts of their frontiers at infinity. 

The theorem may also be proved by considering the set of circles {(?„} 
such that every point of O is an interior point of one or more of the circles, 
and that every point in, or on the circumference of, each circle belongs 
to O. For we may consider the closed set M (G ly C 2 , ... G n ) = G n ; then O 
is the outer limiting set of the sequence {G n } of closed sets. 
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A closed set is the inner limiting set of a sequence of open sets. 

This is at once deduced from the last theorem by considering the 
complementary sets. 

110. Having given a non-finite family of sets of points , there exists at 
least one jx)int such that , in its arbitrarily small neighbourhood , there are 
points belonging to an infinite number of sets of the given family. Such a 
point may be an adjoined point at infinity. 

In the case in which each of the sets of points consists of a single point 
this theorem reduces to that of the existence of the limiting point of a 
set of points; and thus the theorem may be considered to be a general¬ 
ization of the fundamental theorem of § 52. 

In case all the sets of the family are contained in a finite fundamental 
cell, the theorem is proved exactly as that in § 52. A system of nets being 
applied to the fundamental cell, a sequence of meshes d l , d 2 > ... each con¬ 
taining the next, and belonging to the successive nets, is obtained, and 
each of which contains points belonging to an infinite number of the sets 
of the given family. The point defined by the sequence {d 7t } is a point 
which satisfies the condition of the theorem. In case there is no finite 
fundamental cell, the theorem is proved by the method of correspondence 
with a family of sets in a finite cell. In this case the point determined 
may be an adjoined point at infinity. 

It is easily seen that all the points that satisfy the condition of the 
theorem form a closed set; the points at infinity, if any, being included 
in the set. 

In case all the sets are open domains, we have the following theorem : 

Having given a non-finite (enumerable or urienumerable) family of open 
domains , there exists at least one point such that , in its arbitrarily small 
neighbourhood ,, there are contained domains which are portions of an infinite 
number of the domains of the given family. Such a point may be an adjoined 
\point at infinity. The set of all such points is a closed set. 

111 . The theorems in § 82 can be extended to the case of a set of two 
(or more) dimensions. 

Instead of intervals 8j, 8 2 , ... 8„, ..., each containing the next, and 
converging to the point p , we take a convergent set of cells. Instead of 
intervals contiguous to the perfect, or the closed, set P n , we take the set 
of contiguous domains. The procedure in the proofs is then otherwise 
unaltered. 

In case the sets P 2 , P 3 , ... P a , ... consist of the successive derivatives 
of a closed set P l3 if a set Pp does not exist, the previous set P^ consists 
of a finite set of points, and in case there is a number ft such that 



155 


109—112] Plane Sets of Points 

Pp ~~ Pp n , the set Pp is perfect. We thus obtain a proof of the theorem 
already established in § 89, that every p-dimensional set is the sum of a 
perfect set and an enumerable set. A proof has been* given by de la Vallee 
Poussin of the second theorem of § 82, applicable to closed sets in any 
number of dimensions, in which the theorems of § 75 are employed. 


112. Every set of points can' be expressed as the, sum of a set which is 
dense in itself , and of an enumerable set which is non-dense in every perfect set . 

In the first plat e, if a set is non-dense with respect to every perfect 
set, it is enumerable. For, if a set is not enumerable, there is a set of 
points of unenumerability which is perfect, and contains points of the 
set and, in general, others not in the set. The given set would be every¬ 
where dense in this perfect set, contrary to the hypothesis that it is non- 
dense in every perfect set. 

Let E be a given set ; and consider a point />, whether belonging to E 
or not, which is such that, in every neighbourhood of 7 ;, there is a perfect 
set with respect to which E is everywhere dense. 


Let P be the set of all such points p. Any perfect set in which E is 
everywhere dense must be contained in P ; lienee P has no isolated points; 
moreover P is clearly a closed set, and therefore it is perfect. It will be 
shewn that E is everywhere dense in P. If a perfect set be contained in 
any interval, or domain, contiguous to P, and E were dense with respect 
to such perfect set, it would be everywhere dense with respect to some 
perfect portion of such set, and this portion would therefore belong to P, 
which is impossible; hence E is non-dense in any perfect set contained in 
a domain contiguous to P. If E were not everywhere dense in P, the part 
of P in some interval, or cell, would contain no points of E. Any point 
of P belonging to this part would have a neighbourhood free from points 
of P, which is contrary to the definition of P. Hence E is everywhere 
dense in P. Let E - D (E,P) + E 2 ; 


so that E 2 consists of those points of E which do not belong to P. It will 
be shewn that P 2 is non-dense with respect to every perfect set, and con¬ 
sequently enumerable. If there were a perfect set in which E 2 were not 
non-dense, it would be everywhere dense in a perfect set P, a portion of 
the former one. If R have points not belonging to P, P, and therefore P 2 , 
is not everywhere dense in R. If R is contained in P, P 2 has no points 
in R. Therefore P 2 is non-dense with respect to every perfect set. 

In the case of a closed set, the part that is dense in itself is the perfect 
set which is a part of the closed set; the other part, which is irreducible 
in every interval, or domain, contiguous to the perfect set, is non-dense 
in every perfect set. 


* Integral vs de Lcbvsgnt, |>. 113 . 



156 Descriptive Properties of Sets of Points [ch. ii 

Denjoy*, to whom the above theorem is due, has indicated a method, 
in the general case, of obtaining by a successive process the enumerable 
set that is non-dense with respect to every perfect set. 

113. If G be any plane set of points, and, corresponding to each 
point P, of G , a cell with centre P, and sides all equal to 2A, be contem¬ 
plated, the set of all points which belong to one or more such cells, when 
the cells for all the points P, of G, are considered, is said to be a neigh¬ 
bourhood (A) of the set G. The neighbourhood (A) of the set G is said to 
be complete, or incomplete, according as the cells are taken to be closed, 
or open. 

Every point of the incomplete neighbourhood (A), denoted by //, of 
a given set G, is an interior point of H. For any point Q, of the set H , 
is an interior point of a cell, all the interior points of which belong to // ; 
hence a neighbourhood A' (< A) of Q can be determined so that all its 
points belong to H. Thus the incomplete neighbourhood (A) of any given 
set G is an open set. 

(a) The complete neighbourhood (A), denoted by H, of a closed ( bounded) 
set 6r, is a closed set. 

For, if Q 1 , Qz , ... be a sequence of points of H which converges to a 
point Q (x (l \ x (2) ), and if Q n be denoted by (x n {1) , x n {2) ), there exists a 
point P n (£ n (1) , f* (2) ), of G , such that | x n ™ - £„ (1> |, | z n < 2) - £ w (2) |_are 
both & h. The set of points {P w } has at least one limiting point P (? (1) > f (2, )> 
which is necessarily a point of G. A sequence of increasing integers 
m x , m 2 , m 3 , ... can be so chosen that the sequence P mi , P mz , P mz , ... con¬ 
verges to P as its sole limiting point. 

Since 

aW 1 ’ - h 2 -s + h, and x ma < 2 > -hs 2 x ms m + h, 
for every value of s, we have 

x {1) ~ h £ f (1) x iv -I- A, and x l2) — h -7 f (2) ^ x {2) + h. 

It now follows that Q is in the complete neighbourhood of G. Therefore 
the set H is closed. 

(b) If G 1 , G 2 be two ( bounded ) closed, sets which have no point in common , 
a neighbourhood of G l can be determined which contains no points of G 2 . 

If d be the distance between the sets G x , G 2 , the complete, or the in¬ 
complete, neighbourhood (h), where h< djy/ 2, of G x , contains no points 
ofO,. 

The following theorem can be deduced : 

(c) If the (bounded) closed set G consists entirely of interior points of a 

* See his memoir “Sur les nombres derives,’* Joum. de Math. (7), vol. I (1915), p. 235. 
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set H, a neighbourhood (h) of G, complete , or incomplete , can be so deter¬ 
mined that it consists entirely of interior points of H. 

The points of G have a maximum distance p from the point (0, 0). 
Choose a number p (> p), and let H 1 be that part of H that consists of 
points at a distance $ p from the point (0, 0). 

It is clear that every point of G is an interior point of H 1 . Let L be 
the set of points on the boundary of II X ; L is then a closed set. A neigh¬ 
bourhood (h) of G can be determined which, whether complete or incom¬ 
plete, contains no points of L. Every point of this neighbourhood (//) is 
an interior point of // l5 and therefore of II. For any such point Q is in 
the neighbourhood (h) of some point P, of G. If the neighbourhood (h) 
of P does not consist entirely of interior points of U l9 it must contain 
one or more points of L \ and this is not the case. Therefore Q is an interior 
point of H x *. 

THE CLASSIFICATION OF A FAMILY OF SETS OF POINTS 

114. It has been shewn, in § 56, that the set M (0 lf 0 2 , ... 0 n , ...), 
of points each of which belongs to one at least of the open sets 

is itself an open set. ’ ••• ^n» • ■ • > 

Let us consider a set D (O l9 0 2 , ... O n , ...) which, when it exists, con¬ 
sists of all the points common to the open sets of the sequence. This set 
belongs to a class of sets which may be denoted by 0 {d) ; so that each set 
of the class is obtained from some sequence of open sets. A set which can 
be exhibited as M (0 1 (d) , 0 2 {d) , ...) belongs to a class of sets which may be 
denoted by O dm . In the same way, a set which can be exhibited as 
D (0 1 (dm) , 0 2 (dm) , ...) belongs to a class which may be denoted by O dmd . 
By continuing this process we can obtain an indefinite number of classes 
of sets; of these, the first is the class 0 of open sets, the second is the 
class of sets 0 (d) , the third is the class of sets 0 (dm) , the fourth the class 
of sets 0 (dmd) ; and so on. The class w may be defined as the class of sets, 
each of which consists of the points common to all the sets of a sequence, 
each member of which belongs to one of the classes 1, 2, 3, .... The class 
cj -f 1 consists of sets each of which consists of the points belonging to 
one or more of the sets belonging to a sequence of sets of class a >; the 
class a) -f 2 consists of sets each of which consists of the points common 
to all the sets of a sequence of sets belonging to class w + 1. 

Proceeding in this manner, we may contemplate the existence of a 
class corresponding to any ordinal number of the second class. 

* See Hobson, Proc. Lond. Math. Soc. (2), vol. xiv (1914), p. 150. The theorem ( c) was first 
proved by Bolza in a different manner; see Vorlesungen iiher Variationsrechnung , p. 165. 
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This classification has been given by Hausdorff*, who applies the term 
Borel-set to denote any member of any of these classes, all of which he 
shews exist. He has proved that every Borel-set is either enumerable 
(or finite), or has the cardinal number of the continuum. This is a general¬ 
ization of the theorem that every closed set, which is an () (d) , has one of 
the cardinal numbers a and c. For the whole class 0 W) , the theorem was 
established by W. H. Young (see § 1)7), in connection with the theory of 
ordinary inner limiting sets. The result obtained by Hausdorff settles the 
question as to the possible cardinal numbers of sets of points, in the case 
of sets belonging to a very extensive class. 

A detailed treatment of the possible peculiarities of structure of sets 
of points in plane space, or in space of higher dimensions, is of the highest 
importance in relation to the application of the principles of the theory of 
sets of points to abstract geometry. As a development of this character 
is beyond the scope of the present work, reference is here made to the 
account given by Schoenfliesf, where references to the literature of the 
subject will be found. A memoirf by W. H. and G. 0. Young, “On the 
internal structure of a set of points in space of any number of dimensions,” 
may also be here referred to. Information on the subject will also be 
found in W. H. Young's treatise “On the theory of sets of points.'" Re¬ 
ference may also be made to G. N. Watson’s tract-, “Complex integration 
and Cauchy’s theorem,” where a proof will be found of Jordan’s funda¬ 
mental theorem, that a plane is divided into two distinct open eonnex 
domains, by means of a closed Jordan curve. Another proof § has been 
given by Brouwer. 


SETS OF SEQUENCES OF INTEGERS 


115. A theory of sets of sequences of integers, of which the formal 
character is similar to the theory of sets of points in any number of 
dimensions, has been developed by Baire||, with a view to application to 
the Theory of Functions. 

A group of integers (tq, a 2 , ... a v ), of order p, consists of a system of 
p positive integers arranged in a definite order. 

The group (cq, cq, ... a p ), of order p, is said to be contained in each of 
the groups (cq), (cq, cq), (cq, cq, a z ), ... (cq, cq, ... a v _ x ) of orders 1, 2, 3, 
... p — 1, respectively. 

* Math. Annalen, vol. lxxvii (1916), p. 430. 

f See his Entwickdung der Lehre von den Punktmannigfaltigkeiten , vol. II, chap. iv. 

X Proc. Lond. Math. Hoc. (2), vol. xvr (1917), p. 337. 

§ Math. Annalen , vol. lxix (1910), p. 169, and vol. lxxi (1912), p. 314. 

!| Comptes Rendus , vol. cxxix (1899), p. 946. 
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A sequence of integers (a 1 , a 2 , ... a P , ...) consists of an infinite number 
of integers, defined in any manner, and arranged in an order similar to 
the sequence 1 , 2, 3, .... This sequence is said to be contained in each of 
the groups (cq), (a lf a 2 ), ... (a lf a 2 , ... a P ), _ Let P be a set of such se¬ 

quences of integers, and let A be any other sequence of integers; then if, 
for every n , there are sequences in P, other than A itself, which are con¬ 
tained in the same group, of order n , as A itself, that is sequences having 
their first n integers the same as the first n integers in A, then the sequence 
A is said to be a limit of the set of sequences P. The sequence A may, 
or may not, itself belong to P. 

The set P is said to be closed , in case all its limits belong to it. The 
set is said to be perfect when it is closed, and also every sequence in the 
set is a limit of the set. 

A set E, of groups of integers, is said to be complete if, when g is any 
group of order p, belonging to P, the groups of orders 1, 2, 3, ... p — 1, 
which contain g , also belong to E. 

A complete set E, of groups of integers, is said to be closed , if every 
group g, belonging to E, contains at least one group of higher order than 
itself, which is also contained in E. 

Having given a complete set of groups E, a sequence A may exist 
such that all the groups containing A belong to E. The set P, of all 
sequences such as A , is said to be determined by the set of groups E . The 
set F , if it exists, is closed. 

Every closed set of groups E determines a (dosed set of sequences F, 
and conversely, every closed set of sequences F is determined by a unique 
dosed set of groups E. In case F is perfect, E is also said to be perfect. 
In order that E may be perfect, it is necessary and sufficient that every 
group belonging to E should contain at least two groups of one and the 
same order superior to its own order, and belonging to E. 

If P is a set of sequences, then the set P' of those sequences which are 
limits of the set P is said to be the derived set of P, and may be denoted 
by P'. The derived set P' is closed. 

The successive derivatives P", P"', ... P (w) , ... P (a) , of finite or irans- 
finite orders, are then defined as in the theory of sets of points. If P is 
a closed set of sequences, there exists a number a of the first or the second 
class, such that P (a) = P (a+1> . Unless P is an enumerable set, it can be 
resolved into the sum of an enumerable set and a perfect set. 

Let us consider a perfect set of groups E determining a perfect set of 
sequences F. A set P, of sequences all belonging to P, is said to be non- 
dense in P, or in P, provided that every group of E contains at least one 
group of E which contains no sequence of P. 
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A set of sequences P, all belonging to F, is said to be of the first 
category, relative to F, if there exists an enumerable sequence of its 

1, 21 ■ each of which is non-dense in F, and such that each 

sequence of P is part of one at least of the sets I\. P p The 

set obtamed by removing the set P from F is said to ’be of "the’ 'second 
category relative to F. The same generic distinction between sets of the 
hrst and of the second categories holds, as in the theory of sets of 



CHAPTER III 


THE METRIC PROPERTIES OF SETS OF POINTS 

116. In Chapter n an account has been given of those properties of 
sets of points which are called descriptive, in order to distinguish them 
from the metric properties which we now proceed to discuss. In the in¬ 
vestigation of descriptive properties the notion of the distance between a 

(p ) i 

pair of points x , x', defined arithmetically by £ (x r — x r ') 2 f\ has been 

| i ) 

employed, but, although the descriptive properties of a set are invariant 
for a large class of transformations, the distance between a pair of points 
x, x' is not invariant in such transformations. 

The earliest theory of the metric properties of sets was originated, for 
the case of linear sets, by Hankel*, and was further developed by Harnackt, 
StolzJ, and by Cantor§, who extended the conception of the content of 
a set to the case of sets of points in a domain of any number of dimensions. 
Although this theory of content has now been almost entirely superseded 
by the later theory of measure, developed by Borel|| and Lebesgue*|J, we 
proceed to give an account of it, not only for historical reasons, but also 
in order to point out the respects in which it has defects which are 
remedied in the later theory of the measure of sets of points. A very 
general theory of measure has been given by Carat heodory**. 

THE CONTENT OF A SET OF POINTS 

117. Suppose a linear set of points G to be contained in the finite 
interval (a, b). Let a system of nets be fitted on to the interval (a, b). 
In this case it will be convenient to take the meshes of the nets to be 
closed at both ends. In the net of order r, let there be n r meshes, and of 
these suppose v r contain points of G within them, or at their ends. Let 
$n r , „ y b — a) denote the sum of the breadths of the v r meshes. It is clear 
that 8 nr%Vf £ £ nr+1 ., rf1 ; and thus the sequence of numbers 

8 7n, v x y 8 ns , v 2 y ■ • • 8 nj . t p r ... , 

* Math. Annalen, vol. xx (1882), p. 86. f Math. Annalen, vol. xxv (1885), p. 241. 

J Math. Annalen , vol. xxiii (1884), p. 152; see also Pasch, Math. Annalen , vol. xxx (1887), 
p. 142. § Math. Annalen , vol. xxiii (1884), p. 473. 

|| See Lemons sur la ihlorie des fauctions (1898). 

Tf See the memoir “Integrate, longueur, aire,” Annali di Mat. (3), vol. vii (1902), p. 231; 
also Lemons sur Vintegration (1904). 

** See Gotlinger Nachrichten, 1914, p. 404. 
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which are all > 0, has a definite limit Z, which is such that 0 i Z -- b — a. 
The number $ n — Z may be made as small as we please, by taking r 
sufficiently large. 

It will be shewn that the number Z is the same whatever closed 
system of nets is employed. Suppose two systems of nets to give the 
values Z t , Z 2 , of Z, respectively. We may denote by tf n w yV p) the 

values of the sum S nr , Vr for the two systems of nets, {i> r / n }, {l ) n {2) ). 

If a net l> th {l) of one system be superimposed on a net /) w , (2) of the 
other system, we obtain a net which may be denoted by 

If a denote the sum of the breadths of those meshes of LL r (,) . that 
contain points of 6 Y , it is clear that a # Wr <>t 

Let r be so large that # Wr cn tl>r <n Z A < 6, an arbitrarily chosen positive 
number; and let s be so large that the greatest of the breadths of the 
meshes in the nets I) s iZ) is < y, an arbitrarily chosen positive number. 
Of the r s (2) meshes in the sum S n ^> , v p> the number that are not also meshes 
in the sum o is at most n r — 1; hence < a I n r V • Therefore 

2* < r‘" I V rV < S, I' € + n r T). 

Now e and yj are arbitrarily small, and it therefore follows that Z 2 Z x . 
It can similarly be proved that Z-, - Z 2 ; and from the two relations it is 
inferred that Z t — li 2 . Therefore Z is a definite number, independent of 
the particular system of nets employed in defining it. 

We have now established the following theorem: 

If G be any given set of points in the interval ( a , b), there corresfxmds to G 
a definite number Z, which is such that all the points of G are interior points 
of a definite number of intervals whose sum exceeds Z by less than an arbi¬ 
trarily chosen positive number e, the number of the intervals depending on e. 

The number Z is called the content of the set G, and the content may 
have any value in the closed interval (0, b — a). 

Those sets of points for which the content is zero are of special im¬ 
portance in the Theory of Functions. A set of zero content is said to be 
an unextended , or a discrete , or an integrable set of points. 

It is clear that the definition, and the proof of existence given above, 
are applicable in the case of a bounded set of points in space of any number 
of dimensions, provided intervals are replaced by cells, and the breadth 
of an interval is replaced by the product of the lengths of the sides of a 
cell, which is regarded as the content of the cell. A point of the set that 
is on the boundary of a cell is also in one adjacent cell at least. 

118. The content of a set of points is the same as the content of its de¬ 
rivative. 

Let Z' be the content of G' , the derivative of a set G; then the points 
of O' may be included in a finite set of intervals, or cells, the sum of 
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whose contents is less than E' + 8, where 8 is an arbitrarily chosen positive 
number. There can be only a finite number of points of G which do not 
fall within the intervals, or within the cells, that include the points of G '; 
and this finite number of points may be included in intervals, or cells, 
the sum of the contents of which is arbitrarily small, say e. All the points 
of G are now included in a finite number of intervals, or cells, of total 
content less than £' -f 8 | e; and a series of diminishing values may be 
assigned to 8 and €; each sequence having the limit zero. Therefore both 
S' 4 8 + e and S' I 8 converge to the value S'; which proves the theorem. 

It follows, from this theorem, that the content of any set is the same 
as that of any of its successive derivatives. In the case of a set which is 
of the first species, one of the derivatives contains only a finite number 
of points, and consequently the set must be of zero content. 

119. A definition of the content of a linear set of points has been 
given by Cantor* which, though differing in form from that of Hankel 
and Harnack, is in reality equivalent to it. Instead of enclosing the points 
of the set G in a finite number of intervals, Cantor encloses each point 
of G in an interval 2 p, of which the point is the middle point, the number p 
being the same for each point of the set, those parts of intervals 2p which 
do not lie within (a, b) being disregarded. We have in this manner 
obtained an infinite number of overlapping intervals which contain all 
the points of G, and, as is clear, all the points of G', which is a closed set. 
If we replace this set of intervals by the set of non-overlapping intervals 
with the same interior points, each interval of this latter set is > p. The 
set, which is non-overlapping, and equivalent to the infinite set, is conse¬ 
quently a finite set, the sum of whose lengths may be denoted by II (p, G). 
When p is diminished indefinitely, the number 11 (p, G) y which cannot 
increase as p is diminished, must have a definite lower limit, which defines 
the content of either of the sets G and G' . Since the infinite set of intervals 
which has been employed only covers a finite number of detached lengths, 
this definition is equivalent to that of Hankel and Harnack. Cantor 
applied this definition to the case of a set of points in a p-dimensional 
continuum, by enclosing each poipt in a “sphere” of radius p, with its 
centre at the point; the content is then the lower limit of the volume made 
up of the points contained within the spheres. 

The essential point in the above definition of the content of a set of 
points is that all the points are contained in a finite number of intervals, 
or cells, which therefore contain all the limiting points!; and the lower 
limit of the sum of the contents of these intervals, or cells, is taken as 
defining the content of the set. If the points are contained, from the 

* Math. Annalen, vol. xxiii (1884), p. 473. 

| See Harnack, Math. Amialen, vol. xxiii (1884), p. 241. 
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commencement, in an infinite number of intervals, or cells, which are of 
unequal span, in accordance with some prescribed law, and the spans of 
these intervals, or cells, are then diminished, each one in a prescribed 
manner tending to the limit zero, then the limit of the sum of those parts 
of the fundamental interval, or cell, which are included In the infinite set 
of intervals, or cells, is not necessarily equal to the content, as above 
defined. For example, let us consider the set of rational points in the 
interval (0,1). These points can be arranged in enumerable order P l9 P 2 , 
P 3 , ...: now enclose P 1 in an interval of length e/2, P 2 in an interval e/2 2 , 
&c., P n in an interval of length e/2 n , and so on; the total content of these 
intervals cannot exceed e£l/2 n , or e; and this has the limit zero, as e is 
diminished towards zero. On the other hand, the content of the set of 
rational points is the same as that of the derived set; but this consists of 
all the points of the interval (0, 1), and is therefore unity. In general, 
any enumerable set of points can be contained in an infinite number of 
intervals, or cells, which have a total content that is arbitrarily small, 
and has the limit zero; whereas the content of the set is not in general zero. 

A completely satisfactory definition of the content of a set of points 
of the most general character should satisfy the condition of affording a 
consistent generalization of the notion of the length of a continuous linear 
set of points, or of the notions of area and volume, in the case of sets of 
points in two, or in three, dimensions. In the case of closed sets, the 
definition given above leaves nothing to be desired in this respect; but in 
the case of unclosed sets, the definition leads to consequences which are 
at variance with the fundamental properties of lengths, areas, and volumes, 
as understood for the case of continuous domains. If G 1 , G 2 are two 
complementary sets of points in the continuous interval (0, 1), then, in 
order that the contents of the sets G 1 , G 2 may accord with a generalization 
of the notion of length, their sum should be unity; however, when G 1 
and G 2 are unclosed, this condition is in general not satisfied by the de¬ 
finition given above. For example, if G x consists of the rational points, 
and G 2 of the irrational points, each of the two sets G lf G 2 has its content 
unity, the same as that of the interval (0, 1) itself. Again, let us consider 
an everywhere dense set of non-overlapping intervals contained in (0, 1); 
then the internal points of these intervals form an open set G X) of which 
the derivative consists of all the points of the interval (0, 1); the external 
and the end-points of the intervals forming a non-dense closed set G 2 . It 
will be shewn subsequently that the everywhere dense set of non-over¬ 
lapping intervals can be so chosen that the limit of the sum of their lengths 
is an arbitrary number l , where l is subject to the condition 0 < l £ 1; 
whereas the content of the set G x is, in accordance with the definition 
given above, always unity, and therefore may differ from the sum of the 
contents of the sets of points contained in the separate intervals. To 
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obtain the content of the closed set G 2 , cut off, from each of the intervals 
which define G ly the l/2wth part of its length at each end; the limiting 
sum of the intervals, so restricted, is l (1 — 1/n). Of these restricted 
intervals, a finite number can be so taken that their sum is > l (1 — lIn) — e, 
and < l (1 — Ijri), where e is an arbitrarily chosen positive number. All 
the points of G 2 are now enclosed in the finite set of intervals which is 
complementary to the finite set of restricted intervals. The sum of these 
complementary intervals < 1 — l (1 — 1/n) 4- c, and > 1 — l (1 — 1 jn)\ the 
sum has for its lower limit the number 1 — Z, which is therefore the content 
of G 2 . The sum of the contents of G x , G 2 is therefore not equal to unity, 
which is nevertheless the content of 

G x + G 2 ~ (0, 1). 


THE PROBLEM OF MEASURE 

120. The problem of assigning definite numerical measures to sets of 
points defined in linear, plane, solid, or p-dimensional, space is taken to 
require that the measure of a set shall satisfy the following conditions: 

(1) . The measure of a set shall be in accord with the usual notions of 
length, area, volume, of an interval, a rectangle, or a cell in three or more 
dimensions. 

Accordingly there must exist separate systems of measures for linear, 
plane, or p-dimensional, sets of points, corresponding to generalizations of 
the measures of length, area, or volume. 

(2) . The linear measure of the set of points in a linear interval (a, b) 
is taken to be b — a, whether the set includes neither, one, or both, of 
the end-points a and b, of the interval. 

The p-dimensional measure of the set of points in the cell 
(a* 1 *, a< 2) , ... a <*>; b <», 6< 2) , ... b<*>) 
is taken to be the product 

(&<i) - a {1) ) (ib < 2 > - a < 2 >) ... (ib <*> - a (J »), 
whether the set of points includes none, some, or all, of the points on the 
boundary of the cell. 

Thus, when p = 2, the measure of the set of points in the rectangle is 
in agreement with the ordinary measure of its area; when p — 3, the 
measure of the set of points in the rectangular parallelepiped is in agree¬ 
ment with the ordinary measure of its volume. 

It will be observed that, in accordance with this postulation, the 
p- dimensional measure of a ^-dimensional cell, when q < p, is zero. Thus 
a linear interval has the plane measure zero; and a plane rectangle has 
the three-dimensional measure zero. 
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(3) . The measure of a set of points is to be a number dependent on 
the set, such that the measure of the sum of two sets, which have no 
point in common, is the sum of the measures of the two sets. Jt then 
follows that the measure of the sum of any finite number of sets, no two 
of which have a point in common, is the sum of the measures of the sets. 
The measure of a set being regarded as a function of the set, is thus 
required to be an additive function, i.e. a function such that its value for 
the set E x -f E 2 is the sum of its values for E x and E 2 . 

It follows that, if a set F be a component of a set E , the measure of 
the set E - F is to be the excess of the measure of E over that of F. 
In case E and F have the same measures, the set E F is to have the 
measure zero. 

In particular, the p-dimensional measure of a (p - 1 )-dimensional set 
is zero. For such a set may be taken to be on the boundary of a />"dimen¬ 
sional cell; and, in accordance with (1), the measure of the set of points 
interior to the /^-dimensional cell is unaltered by addition of the (p 1)- 
dimensional set on its boundary; consequently the p-dimensional measure 
of this last set must be zero. 

(4) . The measure of the sum of an enurnerably infinite sequence of 
sets, no two of which have a point in common, is to be the limiting sum 
of the measures of the sets, whenever that limiting sum exists. This may 
be expressed as the postulation that the measure of a set shall be a 
completely additive function of the set. 

(5) . The measures of two sets such as are obtained from one another 
by a congruent transformation are to be identical. 

Congruent sets are such that, to each pair of points P t Q, of one of 
the sets there corresponds a unique pair of points P', Q\ of the other set, 
such that the distance of P' from Q' is the same as that of P from Q. 

In order that a system of measurement of sets may be set up, for each 
number of dimensions, which shall satisfy the above postulates, defini¬ 
tions will be introduced which will apply to a certain category of sets, 
called measurable sets, and which, for sets of this category, will be shewn 
to provide a function of the set that satisfies the above postulates. It 
will appear that the class of measurable sets, in linear, plane, or higher 
dimensional, space includes all the sets which are defined by certain 
prescribed methods. The possibility of extending the theory to cases of 
non-measurable sets will be left out of account*. 

* A definition of a non measurable set has been given by Van Vleck, Trans. Amer. Math. 
Soc. vol. ix (1908), p. 237, but thin definition requires an infinite number of arbitrary actR of 
choice. Such sets have also been defined by Lebesgue, Bull, da la sor. math, da France, vol. XXxv 
(1907), p. 209. 
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THE MEASURES OF OPEN AND CLOSED SETS 

121. A linear set of points O, open relatively to a linear interval (a, b) 
which contains the set, is the sum of a unique enumerable set of open 
intervals. 

In accordance with (3), the measure of O must be taken to be the 
limiting sum of the measures of the intervals of t his set. That this limiting 
sum is finite, and cannot exceed l , the measure of (a, b ), is seen by con¬ 
sidering the first n of the intervals, as arranged in order; the sum of 
these n intervals is less than l by the sum of the lengths of the finite set 
of intervals complementary to them; and this holds good however large 
n may be. 

Every open interval may be regarded as consisting of the points of an 
enumerable set of closed intervals, no two of which overlap one another, 
although they may have an end-point in common ; therefore any open set 
may be regarded as consisting of the points of an enumerable set of non¬ 
overlapping closed intervals. The measure of the open set can then be 
regarded as the limiting sum of the measures of such a system of non¬ 
overlapping closed intervals. 

In accordance with (1), it makes no difference that pairs of the intervals 
may have an end-point in common. 

If G be any closed linear set in (a, b ), the complementary set C (G) is 
open relatively to (a, b ), and in accordance with (2), the measure of G 
must be given by m (G) l m {(7 (G)\ ; where the measures of G and 
C (G) are denoted by m (G) and m [0 (G)\ respectively. 

122. It has been shewn in § 109 that, in space of any number of 
dimensions, an open set 0, in a cell (a, />), is constituted by the points of 
an enumerable set of closed cells. In accordance with (3), the measure 
of O must be the limiting sum of the measures of these closed cells. That 
this limiting sum is finite, and does not exceed the measure /, of the cell 
(a, b), is proved as in the case of a linear set. 

If G be a (dosed set, in the cell (a, b ), the measure of G must, in 
accordance with (2), be given by rn (G) -- l ~ m {C (G)}; the set C (G) 
being open relatively to (a, b). 

Since the set of closed cells, or intervals, which constitutes a given 
open set O is not unique, it must be shewn that the measure of O, defined 
as the limiting sum of the measures of the closed cells, or intervals, is 
independent of the particular sets of such cells, or intervals. In order to 
shew this, the following theorem, somewhat more general than is required 
for this particular purpose, will be proved: 

If an open set O consist of all the points of an enumerable set {A w }, of 
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non-overlapping closed cells , and if {A n '} be any other set of non-overlapping 
closed cells such that every jx)int of 0 belongs to at least one of the closed cells 
{A n '}, the limiting sum of the measures of {A/} cannot be less than that of 
the measures of {A n }. 

Assume, if possible, that the limiting sum of the measures of the cells 
of {A w } exceeds that of the measures of {A/}- A finite set A lf A 2 , ... A m , 

r “tit 

of the cells {A„}, may then be so determined that 2 m (A r ) is greater 

r 1 

than the limiting sum of the measures of the cells {A n '}. 

A cell A n ' containing A„' in its interior, and with the same centre, can 
be defined such that m (A n ') — m (A n ') — e/2 n ; where e is any chosen 
positive number; and this can be done for each of the cells {A n '}. The 
limiting sum of the measures of the cells {A n '} exceeds that for the cells 

r --- vi 

{A n '} by e. If e be sufficiently small, 2 m (A r ) is greater than the limiting 

r 1 

sum of the measures of the cells {A n '}. The set of points in A 1? A 2 , ... A m 
is a closed set G m \ and since each point of G m is interior to a cell of {A n '}, 
from the Heine-Borel theorem (§ 74) it follows that a finite set of the cells 
{A„'J can be determined so as to contain all the points of G m . The total 

r --- rn 

measure of the cells of this finite set cannot be less than 2 m (A r ); and 

r -1 

this is contrary to the supposition made above. 

It follows from this theorem that, if {A n }, {A,/} be two sets of closed 
non-overlapping cells, each of which constitutes a given open set 0, the 
limiting sums of the measures of the cells of each set must be the same; 
for, by the theorem, it is impossible that either of these should be less 
than the other. 

The unique yneasure of an open bounded, set 0 is thus defined * as the 
limiting sum of the measures of the closed cells , or intervals , of a set , all the 
jmnts of which constitute the set 0. 

The measure of a bounded closed set G is then defined to be the excess of 
the measure of the fundamental cell , or interval , in which G is contained , 
over the measure of the open set C(G), which is the complement of G with 
regard to the fundamental cell, or interval. 

All the points of the closed set G are in the finite set of cells, or intervals, 
complementary to the cells, or intervals, Aj, A 2 , ... A n . Thus m (G) is 
the lower limit of the sum of the measures of this finite set, as n is inde¬ 
finitely increased. It is thus seen that the measure of a closed set as here 
defined is identical with the content of such a set as defined in § 117. 
11 has been seen that any open set 0 contains a closed set whose measure 
is arbitrarily little less than m ( 0 ). 

* de la Vallet* Poussin, Integrates de Lebesgue, p. 22. 



122 - 124 ] Measures of Open and Closed Sets 169 

123. It follows, as a particular case of the theorem proved in § 122, 
that, if 0 2 be an open set which contains another open set O x> 

m (0 2 ) ^ m (Of). 

For, if O t is constituted by a set of closed cells {A„}, and 0 2 by a set of 
closed cells {A n '}, the limiting sum of the measures of the cells A n ' cannot 
be less than that of the measures of the cells A n ; and thus m (0 2 ) cannot 
be less than m (Of. 

// #i, 0 2 , ... 0 n , ... be a sequence of open sets, all contained in a finite 
cell, or interval , then 

m (Of) + m (Of) + ... + m (O n ) 4... ^ m [M (0,,0 2 , ... O n , ...)]. 

It has been shewn in § 56 that M (0 Y ,0 2 , ... 0 n , ...) is an open set; 
let {A m } be a sequence of closed cells which constitute it. Let O n be con¬ 
stituted by a set of closed cells {A nm }, where m ~ 1, 2, 3, .... 

The set of cells {A nm }, where n and m have all integral values, is an 
enumerable set of cells, the limiting sum of whose measures is 
m (Of) + m (Of) -f ... -b m (O n ) + .... 

By the theorem proved in § 122, this cannot be less than the limiting 
sum of the measures of A x , A 2 , ... A m , ...; or than the measure of 

M(0 x ,0 2 ,...0 n ,..f. 

It has here been assumed that if {c nm } is a double sequence of positive 

00 0O 

numbers, 2 2 c nm is unaltered by changing the order of the terms. 

n -1 m 1 

If 0 l ,0 2i ... 0 n , ... be a sequence of open sets , all contained in a finite 
cell , or interval, and such that no two of them, have a point in common, then 
m (Of) + m (Of) -b ... + m (Of) -b ... = m [M (0 l9 0 2 , ... 0 n , ...)]. 

In this case, no two of the cells A nm overlap one another, and the set 
M (0 X , 0 2 , ... 0 n , ...) 

is constituted by the enumerable set of cells A nm , where 
n — 1, 2, 3, ...; m — 1, 2, 3,_ 

00 

Both 2 m (Of and m [M (0 1} 0 2 , ...)] are equal to the limiting sum of 

n -1 

the measures of the cells A nw . 

It has thus been shewn that, for open sets, the measure is a completely 
additive function; thus satisfying the postulates (3) and (4), of § 120. 

THE CONTENT OR MEASURE OF A CLOSED SET 

124. In a linear interval, of length l, there can be [defined a set 
of non-overlapping intervals, everywhere dense in the interval, and such 
that the sum of the lengths of the intervals is less than an arbitrarily 
chosen number e. To establish this apparently paradoxical theorem, let us 
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consider an enumerable set of points x l9 x 2 , ... x n , ... everywhere dense in 
the interval; for example, the rational points of the interval. With each 
point x n as centre, an interval of length c/ 2 ” may be defined; any part 
of the interval which is not in the fundamental interval may be omitted. 
The overlapping set of open intervals obtained by taking all the values 
of n is equivalent to a non-overlapping set of open intervals A, of total 
measure less than c, which is everywhere dense in the fundamental interval. 
The points of that interval complementary to the set A form a non-dense 
closed set of content, or measure, greater than l - e. This can be made 
arbitrarily near to l, by taking e small enough. 

The following general theorem will be established: 

The content of a non-dense linear closed set is zero , in case the set is 
enumerable ; and , in case the set is unenumerable, its content may be zero , or 
may have any value less than the length of the interval in which the set is 
contained . 

The content, or measure, of a closed set is the sum of the measures of 
its perfect component and of its enumerable component; and this last is 
zero. If the set is enumerable, it has no perfect component, and therefore 
its content is zero. 

Let Sj, 8 2 , 8 3 , ... denote the lengths of the intervals contiguous to a 
non-dense closed set G, in a fundamental interval of length 1. 

Let 8 X — A*Z, 8 2 ^ A 2 (l ~~ 8d, 83 — A 3 (l — 8 X -- 8 2 ), ... 
and generally, 8 „ - A n (Z - 8 , - - ... - S n _d; 

where the numbers X l9 A 2 , ... A n , ... are all between 0 and 1 . 

We find at once 

and hence ^0 ^ 1 ) 0 ^ 2 ) ••• 0 ^n-i) ^ 

l - (8, } 8 2 i ... f 8 J (1 - A,) (1 - A 2 ) ... (1 - AJ l 
The content of the set G is therefore* l multiplied by the limit of the 
product (1 - AJ (1 A 2 ) ... (1 - AJ. 

The values of A,, A 2 , ... A„, ...-can be so chosen that the content of 
the set is zero; for example if \ -- A 2 A a _ 

If X 1 = 0 2 , A 2 0 2 / 2 2 , ... A n -= 6 2 jn 2 , ...; where 0 < 9 < 1 , the content of 
the set is l sin (tt6)/7t6. By choosing a sufficiently small value of 6, this 
may be made as near as we please to l. By proper choice of 6, the content 
may have any value less than l. 

125. Any linear closed set G that has interior points contains a non-dense 
closed set H, such that m (G) — rn (H) is less than an arbitrarily chosen 
number e. 

The set G may contain a finite, or an enumerable, set of closed intervals, 
* See fiftrnack. Math. Annalen , vol. xix (1882), p. 239. 
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Sj, 8 2 , ... _ Remove from G all the interior points of S n , except those 

belonging to a non-dense closed set g n , of content greater than m (8 n ) — e/2 w . 
The set which consists of G, with the interior points of all the intervals {8 W } 
removed, except the points of g 1 , g 2 , ... g n , ..., is a non-dense closed set //, 
such that m (G) - m (H) < e. 

Similar results hold for plane closed sets, or for closed sets in space of 
any number of dimensions. It will be sufficient to consider the case of 
plane closed sets. 

Tn a cell (a (1) , a {2) ; b {l \ b (2) ) an everywhere dense set of cells can be 
defined, of which the total measure is either that of the given cell, or has 
any less value > 0. 

In the linear interval (a (1) , b (1) ) an everywhere dense set of intervals 
{8 r ,} can be defined, as in § J24, of which t he total measure has a prescribed 
value ;> 0, and not exceeding b {l) — a {1) . Similarly a set of intervals {S n '} 
may be defined in the linear interval (a (2) , b {2) ), of total measure > 0, and 
not exceeding b {2) — a {2) . 

Consider a cell for which the projections of the sides on the two axes 
of coordinates are 8 n , S' n ,; where n and n are any pair of integers. The set 
of all such cells is everywhere dense in the fundamental cell, and the sum 
of their measures has a value > 0, and not exceeding the measure of the 
fundamental cell. The set of cells may clearly be chosen so that the total 
measure has any such prescribed value. The set of points which is com¬ 
plementary to the set of open cells so defined is a closed non-dense set, of 
which the measure may be zero, or may have any prescribed value less 
than the measure of the fundamental cell. 

A closed plane set G, that has interior points , contains a non-dense closed 
component //, such that m (G) — m (//) is less than an arbitrarily prescribed 
number e. 

It has been shewn, in § 106, that such a set as G contains a finite, or 
an enumerable, set of closed connex domains {/)„}. The interior points 
of D n are constituted by an enumerable set of closed cells. Hence the 
interior points of all the domains {/> n } together consist of an enumerable 
set of closed cells {A„}. We can remove from A n all its points except those 
of a non-dense closed set of measure > m (A rl ) — c/2 n . When this has been 
done for all the cells {A n }, we have remaining a part H , of G , such that 
m (G) — m (II) is less than e; for the sets of points removed have a total 
measure < Se/2 n , or e. The set H is non-dense in the fundamental cell. 

EXAMPLES 

1. The perfect set of points defined by x - ^ + ••• * where the numbers 

O U O n 

Cj, c 2 , ... have each one of the values 0, 2 (see Ex. 1, § 83), has the content zero. For the 
limit of the sum of the lengths of the complementary intervals is unity. 
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2. The non-dense closed set considered in Ex. 3, § 83, has the content zero. For, after 
k operations, the sum of the exempted segments is 

1 to -1 ( to - 1) 2 - l \* 


to-1 (to - 

r--=- 

m 


(to - l )* -1 

+ ... +- J —, or 1 - 


e-T 


When k is increased indefinitely, the limit of the sum of the free intervals is 1. 


3. The non-dense closed set considered in Ex. 4, § 83, has a content between 0 and 1. 
After k operations, the sum of the exempted segments is 

1 to — 1 (to - 1) (to 2 - 1) (to - 1) (to 2 - 1) ... (to*" 1 - 1) 

TO + TO 3 + TO 6 +•••+ m £fe(fe+l) * 


° r 

The limit of the sum of the exempted intervals is l - n (] - , and therefore the content 

Ar~l\ TO*/ 

of the set of points is n ^1 - , whioh is between 0 and 1, depending upon the value 

of ?n. By taking to sufficiently great, the content of the set may be made arbitrarily near 
to unity. 


THE EXTERIOR AND INTERIOR MEASURES OF A SET 

126 . Let G be any linear set of points in a given interval (a, b), and 
let a finite, or an infinite, set of non-overlapping open intervals be defined, 
such that every point of G is in one of these intervals. The point a is 
regarded as an interior point of any interval of which it is an end-point; 
and a similar remark applies to b. The set of intervals constitutes an 
open set which contains G. The sum, or limiting sum, of the lengths of 
the intervals has a definite value, not greater than 1. The lower boundary 
of this sum or limiting sum, for all possible such sets of intervals, is a 
number which is called t he exterior measure of the set G ; and this may be 
denoted by m e (G). 

Alternatively, the exterior measure m e (G), of the set G , may be defined 
as the lower boundary of the sum, or limiting sum, of the lengths of the 
intervals of a set of non-overlapping closed intervals such that every point 
of P is in a closed interval of the set, being either an interior point, or an 
end-point, when all such systems of closed intervals are taken into account. 

To shew the equivalence of the two definitions, let L x , L 2 denote the 
lower boundaries defined in accordance with the first, and the second, 
definition respectively. A set of non-overlapping open intervals con¬ 
taining G can be defined, such that the sum, or limiting sum, of their 
lengths is < -f e; where c is an arbitrarily chosen positive number. 
Since each open interval is constituted by an enumerable set of non¬ 
overlapping closed intervals, the set of open intervals can be replaced by 
a set of closed intervals which contain G ; and the limiting sum of the 
lengths of the intervals is the same in the two cases. This shews that 
L 2 ^ L lf since we have a set of closed intervals satisfying the requisite 




178 


125 - 127 ] Exterior and Interior Measures of a Set 

condition, the limiting sum of whose lengths is < L x + e. Again, it is 
impossible that L 2 < L x \ for if a set of non-overlapping closed intervals 
containing all the points of G existed, the limiting sum of whose lengths 
is < L X) we could enclose each of these intervals in an open interval, of 
length exceeding that of the closed interval by less than an arbitrarily 
chosen number, and in this way we could obtain an overlapping set of 
open intervals containing G , and such that the limiting sum of their 
lengths would be < L 1 . This overlapping set might be replaced by the 
equivalent set of non-overlapping intervals, and the limiting sum of the 
latter intervals cannot exceed that of the former, as has been shewn in 
the first theorem of § 123. We should therefore have a set of non-over¬ 
lapping opert intervals, enclosing 6r, the limiting sum of whose lengths 
would be < L l9 contrary to the fact that L x is the lower boundary of all 
such limiting sums. 

Let C (G) be the complementary set of G, relatively to the interval 
(a, b) of length Z, in which G is contained. If m e {C ((7)} denote the exterior 
measure of G (G), the number l — m e {C (G)} is taken to define the interior 
measure of the set G \ and this interior measure may be denoted by m t (G). 

127. In the case of a set G contained in a finite cell of any number 
of dimensions, a connex open domain takes the place of an open interval; 
and thus the exterior measure m e (G) is defined as the lower boundary of 
the limiting sums of the measures of connex open domains which form a 
finite, or an infinite, sequence containing all the points of 6 Y , when all 
such sets of connex open domains are taken into account. Again, since 
any open set is constituted by a sequence of closed cells, the exterior 
measure m e (G) may also be defined as the lower boundary of the limiting 
sums of the measures of sequences of closed cells, each sequence con¬ 
taining all the points of G in the interior and on the boundaries of the 
cells, when all such sequences are taken into account. That this second 
form of the definition is equivalent to the first may be shewn in exactly 
the same manner as in the case of a linear set; for every sequence of closed 
cells forms part of a sequence of open cells, such that the difference of the 
sums of the measures of the cells in the latter sequence and in the former 
is arbitrarily small. 

As before, the interior measure m, (G) of G is defined by 
m f (G) ee l - m e {C (G )}; 

where l is the measure of the cell in which G is contained. 

The above definitions of the exterior and interior measures of a 
bounded set of points in any number of dimensions are equivalent to the 
following statements: 

The exterior * measure m t (G) of a bounded set G is the lower boundary 
* This form of the definition is due to de la Vallee Poussin, see Integrates de Lebesgue, p. 22. 
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of the measures of open sets which contain G. The interior * measure ra* ( G) 
of a bounded set G is the upper boundary of the measures of closed sets con¬ 
tained in G. 

For, as the exterior measure m e {G (G)} is the lower boundary of open 
sets which contain G ( G ), it follows that l — m e {6 Y (6 y )} is the upper 
boundary of the measures of the closed sets complementary to these 
open sets. 

If a set (7j contains a set G 2i it is clear from the definition that 

m e (G,) w, (G 2 ). 


MEASURABLE SETS OF POINTS 

128. It appears from the above definitions that every bounded set of 
points has definite exterior and interior measures. 

When the exterior and interior measures of a set G, of points in p dimen¬ 
sions , are equal to one another , the set G is said to be measurable , and the 
number m e (G) = m t (G) is defined to be the measure of G. When G is 
measurable its measure is denoted by m (G). 

It is clear from this definition that, if G is measurable, so also is C (G). 

It will be shewn that this definition satisfies the conditions stated in 
§ 120, whenever it is applicable. If a is a set of non-overlapping cells, or 
intervals, enclosing a set of points G, measurable, or not, and p is a 
similar set for G ( G ), then all the points of the fundamental cell, or interval, 
are enclosed in the set made up of a and p. It follows that m (a) f m (P) & l ; 
and since m e (G), rn e {G (G)} are the lower boundaries of m (a), m (p), re¬ 
spectively, we have m e (G) -b m e {C ((*)} s l , and therefore m e (G) ;■ m t (G). 

The condition that a set G is measurable, in the sense above defined, 
may be stated in either of the following forms: 

A set of points G is measurable if an open set O, containing G, and a 
closed set H , contained in G , can be so determined that m ( O) — m (H) is 
less than an arbitrarily prescribed positive number c. 

A set of points G is measurable if its points can be enclosed in a finite , 
or an infinite , set a, of open intervals (in the case of linear sets), or of open 
connex domains (in the case of sets in space of higher dimensions ), and if 
C (G) can be similarly contained in a set such that the sum , or limiting 
sum , of the measures of the open intervals , or connex domains , which contain 
all those points which are common to a and ft is arbitrarily small. 

The set of points D (a, p) forms an open set (§ 56) which, if the con¬ 
dition be satisfied, may be taken to have its measure < e. As is easily 
seen by considering the three sets a, p, D (a, p), we have 
m (a) + m (£) = m {D (a , p)} + l ; 

* See W. H. Young, Proc. Lond. Math. Soc. (2), vol. n (1905), p. 28. 
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where Z is the measure of the cell, or interval, in which the given set is 
contained. It then follows that 

{m (a) ~ m t (G)} 4- {m (p) - m e \G (0)J) < c. 

Since m (a) m t (G), m (/?) — m P \ G (G) ] are not negative, it follows that 
each of them is < e. ; also m (a) - tn ( . (G) m, (G), hence m e ( G ) — ni t (G) < e. 
Since this holds for every value of €, we have m e (G) — m, ( G ); and thus 
the condition stated is sufficient that the set G may he measurable. If G 
be assumed to be measurable, a and can be so chosen that 

m (a) m (G) < 

and 7n (ft) 7ri [0 (G )] < \e\ 

we then have m (a) 4- m (fi) < e 4 Z, which is equivalent to the condition 
that m {D (a, /3)} < e; and thus the condition is necessary. 


129. It must be shewn that the definitions, given in §§ 120 and 122, 
of the measures of open, closed, or partly closed, cells or intervals, and of 
open and closed sets, are consistent with the definition of a measurable 
set given in § 12S, that is, that these sets have their exterior and interior 
measures equal to one another. 

The cell (a (1) , a (2) , ... a (p) ; b ll) , 6 <2) , ... b (v) ) is contained in the cell 


a 


(i) 


1 

271 




- a 11 '), ««-» - J a‘ s >), 

/,(i> f 1 (//i) _ a <D) 1 

2n 2 n 


a (2> ), 


and contains the c?ell obtained by writing — n instead of n . 


The measures of these cells are ^1 4- Z, ^1 — Z, respectively, and 


thus differ from one another by < e, if n be chosen sufficiently large. 
Therefore the exterior and interior measures of a cell are identical with Z, 
the measure of the cell, as defined in § 120. 

It is clear that the exterior measure of an open set O is identical with 
the measure of the set, as defined in § 121, for it has been shewn that 
no open set of measure less than that of the set can contain O. Also O 
contains the closed set formed by a finite number of the closed cells, or 
intervals, which constitute O, and the sum of the measures of these cells, 
or intervals, converges to the measure of O ; thus O is contained in an 
open set, consisting of 0 itself, and contains a closed set, such that the 
difference of the measures of the open and closed sets is arbitrarily small; 
therefore both the exterior and interior measures of O are identical with 
the measure of 0 , as defined in § 121. Since a closed set is the complement 
of an open set 0, it follows that the closed set is measurable. It has thus 
been proved that: 

Every bounded open set , and every bounded closed set , is measurable . 
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It will now be shewn that: 

Every enumerable set of points is measurable , and its measure is zero. 
Let Pj, P 2 , ... P ry ... denote the points of the set. Each point P T can 
be enclosed in an open cell, or interval, with P r as centre, and of measure 
c/2 r . In accordance with a theorem proved in § 123, the open set of points 
consisting of all the points belonging to one or more of these cells, or 

CO 

intervals, has its measure 2 e/2 r , or < e. It follows that the exterior 

r- 1 

measure of the set is zero, and thus that the interior measure is also zero, 
and that the set is measurable, with measure equal to zero. 

i3o. if a 19 G 2 , ... G n , ... is an enumerable (or a finite) sequence of 
measurable sets, all contained in a finite cell , or interval, the set 

M (G l9 G 2y ...G ny ...) y 

which consists of all points that belong to one or more of the sets , is measurable. 

Let G 1 and C (Gf) be contained in open connex domains, or open 
intervals, of sets a l9 ft, of such domains, or intervals, so defined that the 
measure of D (a x , ft) is < Let a 2 , ft be similar sets of connex open 
domains, or intervals, which contain G 2 and C (G 2 ) respectively, and are 
such that m [D ( a 2 , ft)] < e/2 2 . Let a 2 = D (a 2 , ft), and ft' =•= D (ft, ft). 
For G z and C (Gf) we similarly consider sets cc 3 , ft, of open domains, or 
intervals, such that m \D (a 3 , ft)J < e/2 3 ; and let 

<h'~D(a 3 , ft'), ft'(ft, ft'); 

and so on, for all the sets 6r 4 , (? 5 , — The points of M (G l y G 2i G 2i ...) are 
all contained in the sets of domains, or intervals, a ly a 2i af y and 
C {M (G ly G 2f ...)} is contained in ft', whatever value i may have. 

The two sets of domains, or intervals, which enclose M (G ly G 2 , ...) 
and its complement, have in common a set of points of which the measure 
is less than 

e/2 4* e/2 2 4- ... 4 - c/2* 4 - m (a i+1 ) 4~ m (a ,- +2 ) + .... 

The series 2m (a) is convergent, since each term is positive, and the 
sum of any number of terms is less than a fixed finite number. Therefore 
the number i can be so chosen that m (a i+1 ) -f m ( a ' i+2 ) -f ... is less than e. 

Now M (G x , G 2 , ...) and its complement have been enclosed in sets of 
intervals, or cells, such that the set common to both sets has measure < 2e. 
Since e is arbitrary, it has been proved that M (G lf G 2 , ...) satisfies the 
criterion that it is measurable. 

If G x ,G 2 , ... G n , ... be measurable sets in linear or higher dimensional 
space , all contained in a bounded domain , and no two of the sets have a point 
in common , the measure of G x + G 2 + ... is the limiting sum of the measures 
of the sets. 
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That G x ~b G 2 -I- ... is measurable is a particular case of the last 
theorem. 

In accordance with what has been shewn above m (G x f G 2 + ...) 
differs from the measure of the set consisting of all the sets a lf « 2 ', af, ... 
by less than e . Also G 2 , G 3 , ... are all parts of G (G x ); so that G x is enclosed 
in a l9 G 2 in a 2 ', G 3 in af, ...; lienee m(G 1 ) differs from m(a l ) by less than le , 

rn (G 2 ) differs from m ( a 2 ) by less than * e, and in general m ( G t ) differs 

eL ** 

OD 

from m («,') by less than c/2*. Therefore S in (G t ) differs from 

i- 1 

m (%) -h m (a 2 ') -I- ... h m (a/) + ... 

00 

by less than e. It follows that S m (6rJ and m (G x + G 2 [ ...) differ from 

i 1 

one another by less than 2e; and, since e is arbitrarily small, the theorem 
is established. 

It should be observed that although, for each set (7<, a pair of sets of 
intervals a t , ^ can be so determined that the measure of D (a lt P t ) is less 
than the prescribed number e/2*, there are an infinite number of such 
pairs a iy f$ t which satisfy this condition, and one such pair is selected out 
of this infinite number. It is however assumed, in the above proof, that 
an indefinite number of such selections is made, namely one for each 
value of i. It may not be possible to give a law which determines how 
this selection is to be made for every value of i , unless the given sets are 
of some particular nature such that we are able to assign such a law. 
Consequently, the existence has been assumed of the sets a i9 (3 ti for 
i -= 1, 2, 3, ..., independently of whether we are able to define these sets 
or not. This assumption is a particular case of an axiom which will be 
discussed in Chapter iv, and which is known as Zermelo s axiom, or as 
the general principle of selection. A corresponding remark is applicable 
to another proof of the above theorem which has been given* by de la 
Vallee Poussin. This proof depends upon the fact that a measurable set G n 
is the sum of a closed set F n and of a set e n , of which the exterior measure 
is less than an arbitrarily small pumber e n . For an enumerable sequence 
of sets, the choice of the sets F n requires the use of Zermelo’s axiom 
(see § 197). 

131. If a measurable set G x contains another measurable set G 2 , the set 
G 1 — G 2 is measurable , and its measure is m (G L ) m (G 2 ). 

The complement of G 1 — G 2 consists of G 2 together with C ( G j), hence 
G x — G 2 is measurable. Further, since G 1 ==■ (G x — G 2 ) t G 2> we have 
m (G x ) =m{G 1 - G t ) + m (G 2 ). 

* See Integrates de Lebesgue , pp. 22-25. 
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If G lt G 2 , ... G n , ... are all measurable sets , the set D (G x , G 2 , ... G ni ...) 
0 / points common to all the given sets is also measurable. 

For the complement of D (G 1 , G 2) ...) is M {C (G x ), C (G 2 ), ...}; and 
since C (G x ), G (G 2 ), ... are all measurable, it follows that the complement 
of D (G t , G 2 , ...) is measurable, and therefore that the set itself is 
measurable. 

If * H is the set of points each of which belongs to an infinite number of 
the measurable sets G x , G 2 , , ..., the set H is measurable. 

For the set C (H) consists of those points which belong to none, or 
only to a finite number, of the sets G 1 ,G 2 , ...; and hence C ( H) consists 
of the points belonging to one or more of the sets L ly L 2 , ... L n , ..., where 
L n denotes the set D {C (G n ), C (6r n+1 ), ...}. The sets L n are all measurable, 
and hence G ( H) is measurable; and therefore II is measurable. 

If * K is the set of points each of which belongs to all the measurable sets 

G n , G n + lt ..., where n has a definite value for each point of K, the set K is 

measurable. 

For the set C (K) is the set of points each of which belongs to an infinite 
number of the measurable sets G (G x ), G (G 2 ), ..., and hence, by the last 
theorem, C (K) is measurable. Therefore K is a measurable set. 

If GL is the inner limiting set of a sequence {6r n } of measurable sets G n , 
each of which contains the next , m {GJ) = lim m ( G n ). 

?l~GO 

For (G y - G t ) + (G\ - G,) + ... = Gy - G m 

and m (G n - G n+l ) = m (G n ) - m (G n+1 ), 

hence m (G^) — lim m (G n ) = m (Gy - GJ) — m (Gy) — m (GJ), 

n~co 

and therefore m (GJ = lim m ( G n ). 

n~oo 

If G w is the outer limiting set of a sequence {G n } of measurable sets G n , 
each of which is contained in the next , and if is a bounded set f 

m (G,.,) = lim m (G n ). 

n~oo 

For G x 4- (G 2 — G x ) + (G z — G 2 ) + ... = G^ 

and therefore m (G u ) = lim m (G n ). 

n*~ oo 

If t G lt G 2i ... G ni ... be a sequence of sets such that 

lim M (G n , G n+1 , ...) = lim D (G n , G n+1 ,...), 

n—co n~oo 

then, if either of these limits be denoted by G»,m (G w ) = lim m (G n ). 

n~oo 

* See Borel’s Lemons sur les fonctions de variables rdelles, p. 18. The set H is named by Borel 
the “ensemble limite complet,” and the set K the “ensemble limite restreint” of the given 
sequence of sets. 

f See de la Vallde Poussin, Trans. Amer. Math. Soc. vol. xvi (1915), p. 437. 
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The sets M ( G n , # n+1 , ...) are such that each contains the next, and 
therefore m ( 0J) = lim m [if (6r n , 6? n+1 , ...)J ^ lim m (G n ); and the sets 

n-~co n~ oo 

D(G n ,G n+1 ,...) 

are such that each is contained in the next, and therefore 

m (6r w ) — lim m [D (G ni G n+lf ...)] ^ lim m (G n )\ 

oo jA-*-oo 

we thus conclude that m (GJj = lim m (G n ). 

n —qo 

In general lim if (6? n , <2 n41 , ...) is called the upper limit of 6? n , and 

7l~00 

may be denoted by lim6 y n ; and lim D (G n , G n+1 , ...) is called the lower 

n~QO 7l~00 

limit of G n , and may be denoted by lim G n . In the case contemplated 

W~ao 

in the theorem, 6r w has a unique limit denoted by lim (^ n , or 6r w . 

oo 

SETS THAT ARE MEASURABLE (B) 

132. All the sets which have been shewn to be measurable, in accor¬ 
dance with the definition of a measurable set given in § 128, are obtained 
from the single point, the single interval, or cell, open or closed, by taking 
a finite, or enumerably infinite, set of such fundamental sets, and by 
taking the set common to a finite, or an enumerably infinite, number of 
the sets so obtained, or by taking the complements of the sets so ob¬ 
tained. All sets defined in this manner are said to be measurable (B), 
since they were the only kind of measurable sets contemplated by Borel 
in his original treatment of metric properties. Thus the complement of 
any set that is measurable (B) is so also; the set which is common to any 
finite, or enumerably infinite, number of sets all measurable (B) is also 
measurable (B); also closed and completely open sets are all measurable 
(B). It can be shewn however that measurable sets may exist which are 
not measurable (B). For example, any component whatever of a perfect 
set, of measure zero, has its external measure zero, and is therefore 
measurable. The question whether a valid definition can be given of a 
set which is not measurable will not be here discussed, as the question of 
the validity of such definitions depends upon debatable questions in the 
abstract theory of aggregates. The relation of measurable sets in general 
to those which are measurable (B) is contained in the following theorem: 

Any measurable set G is contained in a set G x which is measurable (B), 
and such that m (GJ == m (G); and G contains a set G t which is measurable 
(B), and such that m (G 2 ) = m (G). 

If {e n } denote a sequence of decreasing positive numbers converging 
to zero, the set G can be enclosed in a set of non-overlapping intervals, or 
open domains, {a n } of total measure < m (G) + € n . 
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The inner limiting set G x of the sequence of sets of intervals, or domains, 
is measurable (B), and its measure is m (G), as has been shewn in § 131; 
also G 1 contains G as a component. The set G x — G has measure zero, and 
its points can be enclosed in a set of intervals, or domains, /3 n , contained 
in a n , and therefore of measure < e„. The inner limiting set 77, of all the 
sets is measurable (JB), and its measure is zero; and the set G — II = G 2 
is measurable (B), and has m (G) for its measure. Moreover G 2 is contained 
in G. 

A classification of sets of measure zero has, on account of the funda¬ 
mental importance of such sets in the theory of functions, been under¬ 
taken* by Borel. 


CONOR (TENT SETS 

133. The definition of the measure of a set, given in §128, having 
been shewn to apply to the class of measurable sets, it has been shewn 
that, so far as such measurable sets are concerned, we have a metric 
system which satisfies the postulates (1), (2), (3), (4), of § 120, namely 
that the measure of such a set is a completely additive system, and that 
it is in agreement with the elementary theory of the measure of lengths, 
areas, and volumes, of intervals and cells. It only remains to be shewn 
that (5), of § 120, is satisfied by the definition which has been introduced 
and developed. 

A congruent transformation of a set of points is, expressed in geo¬ 
metrical language, equivalent to a translation together with a rotation. 
That a translation does not affect the metric properties is obvious from 
the fact that it only consists of an addition of fixed numbers /q, h 2 , ... h p 
to the numbers z Jf x 2 , ... x p which represent a point; and in all the defini¬ 
tions and proofs connected with the metric properties of a set, only the 
differences x/ — x ly x 2 — x 2 , ... for pairs of points are involved. So far 
as a transformation by rotation is concerned, it is only necessary to shew 
that a cell when rotated has a measure which is equal to that of the original 
cell. It will be observed that all the cells employed in the descriptive 
properties of sets, and so far in the metric theory, are orientated alike. 
We cannot therefore, ab initio , consider the set of points of a cell, as having 
its measure unaltered when the cell is rotated, and when it is therefore 
no longer a cell in the original sense of the term. For simplicity the case 
of the rectangle, or plane cell, will here be considered. 

The original cell may be taken to be the set of points (x, y) for which 
— a < x g a, — b s y £ b. 

Corresponding to {x, y) we have in the congruent set, obtained by rota¬ 
tion through an angle a, x' = x cos a + y sin a, y' = — x sin a -f y cos a. 

* See BuU. de la soc. math, de France, vol. xli (1913), p. 1. 
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Thus the points of the cell correspond to the points of the three sets 
defined by 

- sec a (a -f y sin a) x <• cosec a (b -f y cos a) 

— a sin a — b cos a y s — a sin a + b cos a, 
cosec a {y cos a — b) % x < cosec a (y cos a f b) 

— a sin a 4- b cos a ^ y ^ a sin a — b cos a , 
b cos a — a sin a ^ y s a sin a -f b cos a 

— cosec a (b — y cos a) < x ^ sec a (a — y sin a), 

provided tan a > bja. 

In the first of the above sets, divide the interval of y into n equal parts; 
and consider the part for which y is in one of these parts (— c r , — c r+1 ), 
where c r < c r )1 . The corresponding part of the set includes the rectangle 
of which the measure is 

(c rU — c r ) {a sec a -f b cosec a |- c r (tan a -h cot a)}, 
and is included in the rectangle of which the area is 

(c r+1 — c r ) {a sec a + b cosec a + c r+1 (tan a -f cot a)}. 

The difference of these is (c r . n — c r ) 2 (cot a {- tan a). 

The sum of all such differences is less than n~ 2 (cot a + tan a) 4b 2 cos 2 a , 
which can be made arbitrarily small by sufficiently increasing n. The 
measure of the set is then easily shewn to be 2 b 2 cot a. Similarly it can 
be shewn that the measures of the other two sets are 

(a sin a — b cos a) 4b cosec a, and 2 b 2 cot a ; 
thus the sum of the measures of the three parts is 4 ah, which is the 
measure of the original cell. 

The case in which tan a has a value < bja may be considered in a 
si milar manner. The method here adopted is clearly capable of extension 
to the case of a cell in any number of dimensions. 

THE MEASURE OF UNBOUNTDED SETS 

134. Let 0 be an unbounded set of jp dimensions, and let it be such 
that the component of G which is in any finite ^-dimensional cell is 
measurable. Consider a sequence of such cells, each of which is contained 
in the next, and such that if (a w (1) , a. n {2 \ ... a n (v) \ b n a) ,b n {2) , ... b n ip) ) is 
the nth cell of the sequence, a n a) , a n {2) , ... a n (v) all have the limit — oo, 
and b n {1) ,b n {2) , ... b n {p) all have the limit 4- oo, as n is indefinitely in¬ 
creased. If G n be the component of G in this cell, the sequence of numbers 
m ( G n ) is non-diminishing. Consequently m ( G n ) either converges to a 
finite limit, or becomes indefinitely great, as n is indefinitely increased. 
If lim m (G n ) is finite, the set G is regarded as measurable, and m (G) is 

n~oo 

defined to be equal to the limit. If the limit is indefinitely great, G is 
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measurable, but of infinite measure. If H be any bounded measurable 
set of points, it is contained in the nth. cell, if n be sufficiently great. The 
set D (G, H) is the same as D ( G n , H), and this set is measurable. Also 
m [D (G, H)] <. m (G n ). If H be identified successively with the sets H r 
of a sequence, such that H r is contained in H r+1 , we have 
m \D (i G , II r )] £ m (G nr ) s m (G), 

where n T is an integer corresponding to r. The sequence of numbers 
m [D (G, H r )] is monotone, non-decreasing, and has a limit ^ m (G). This 
holds good, even if lim m (H r ) is indefinitely great. Thus the part of G 

r~oo 

that is contained in any measurable set H is measurable, if G be so, and 
its measure does not exceed m (G). 

The theorem m (6^) + m (G 2 ) -f ... =- m (G x 4- G 2 4- ...), where G lf G 2 , ... 
are measurable sets, no two of which have a point in common, and such 
that G x + G 2 -f ... has a finite measure, holds good when G lt G 2 , ... are 
unbounded measurable sets, of finite measure. For, if A n denote the cell 
employed above, we have 

m [D (A n , GJ] + m [D (A n , <?,)] + ... - m [D (A,, G x -f G 2 + ...)]. 

If r be so large that m (G r . n ) 4- m (G r+2 ) ... < e, we have also 

m [D (A n , G r+1 )] + m [D (A n , G r+2 )] + ...<€, 
for every value of n. 

Hence, we have 

m [B (A n , GJ] + m [D (A n , G t )] + ... -I- m [D (A n , G r )] -i- 6 n e 

= m[D (A n , G, + G 2 + ...)], 

where 9 n is such that 0 < 6 n < 1. 

It follows, by letting n increase indefinitely, that 

m (GJ + m (G 2 ) + ... + m (G r ) -f- Oe = m (G x + G 2 + ...), 
where 0 is such that Qg0 & 1. Since e is arbitrary, it follows that 
m ((rj) 4- m (G 2 ) 4- ... converges to m (G 1 4- G 2 4- ...). 

It is easily seen that, if the unbounded measurable set G is the outer 
limiting set of a sequence {< 0 „} of bounded measurable sets, each of which is 
contained in the next , then m{G) = lim m (G n ). 

n~co 

THE MEASURE OF SETS RELATED TO A SYSTEM OF SETS 

135. It has been shewn in § 131 that, for a sequence {6r n } of measurable 
sets, each of which contains the next, if m (G n ) is, for every value of n , 
greater than a fixed positive number C , the measure of the inner limiting 
set is £ C . 

In case the sets G t , G 2 , ..., each of which contains the next, are not 
assumed to be measurable, the corresponding result holds as regards the 
interior measures of the sets. We can obtain the following theorem: 
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If * Oi, G 2 , ... be a sequence of sets of points, each of which contains the 
next , and if the interior measure of each set is greater than a fixed positive 
number G , then the interior measure of the inner limiting set is £ C. 

Closed components, P l9 Q 2 , Q 2 , ... of the sets G 1 ,G 2 ,G 3 , ... can be so 
determined that 

m (Pi) > to, (ft) - | C, TO (ft) > TO, (ft) - 2 ’ 2 €, TO (ft) > TO, (ft) - 2 3 e,...; 

where € is an arbitrarily chosen positive number. The set Q 2 has a closed 
component P 2 = D (P l7 Q 2 ), of measure 

m (P 2 ) - m (Pj) + m (Q 2 ) - m [M (P 1 , Q 2 )]. 

Since M (P l7 Q 2 ) is a closed set contained in G l7 its measure does not 
exceed rn i (6^); hence we have 

TO (P 2 ) 5= TO (P x ) + TO (ft) - TO, (ft) £ TO, (ft) - (| + ^ e. 

Next, by considering P 3 = D (P 2 , Q 3 ), it can be shewn as before that 
TO (P 3 ) > TO, (ft) ~ (g + 2^ + 2 3 ) e ’ 

and so on. We have now a sequence of closed sets P lt P 2 , P 3 , ..., each 
of which contains the next, and such that the measure of each of them 
is > C — e. Therefore the measure of P w , their inner limiting set, is 
^ G — e; and P w is a component of 6r w . It follows, since e is arbitrarily 
small, that ra t (6^) r C. 

136. We are now in a position to establish the following general 
theorem: 

If * G 1} G 2 , ... G n ,... be a sequence of sets of points , each of which sets 
is a component of a closed set, of finite content l , and if the interior measure 
of each of the sets G ly G 2 , ... G n , ... is greater than a fixed number G, then 
there exists a set of points, of interior measure ^ G, and of the power of the 
continuum , such that each point of the set belongs to an infinite number of 
the given sets. 

Choose a closed component of each of the given sets, of content > C; 
let these components be Q x , Q 2 , ... Q n , — Choose an integer m such that 
mC ^ l < (m + 1) G, and let us consider the first n (> m + 1) of the sets 
Qi, Q 2 , .... The points common to any pair of these closed sets form a 
closed set, and the set which contains all the points which belong to at 
least two of the n closed sets is also a closed set Q h n of content I ln . Those 
points of Q hn which belong to Q 1 form a closed set of content ^ I ln ; hence 
there is a set of points of Q x , of measure ^ 1 (Qj) — I ln , which do not 
belong to any of the sets Q 2 ,Q Z , ... Q n \ and the measure of this set is 
> G — I ln . Similarly, each of the sets Q 2 , Q 3 , ... Q n has a component of 

* W. H. Young, Proc. Lond. Math. 80 c. (2), vol. n (1905), p. 25. 



184 


Metric Properties of Sets of Points [oil in 


measure > G — I ln , consisting of points which do not belong to any of 
the other sets, or to Q 1 . The measure of all these sets added together is 
> n (C — I ln ) ; and must be less than l , since the sets do not contain 
any points common to two of them, and they are all enclosed in a set 
of measure l. Hence 

n ((' - » < (m 4 1) C, or I ln > (l - ™ ^ ^ C. 

It has thus been shewn that the closed set Q lt n has the power of the 
continuum, since its content is proved to be positive; and this holds for 
every value of n which is > m -( 1. Considering now the next n sets 

1 — j C, con- 

71 

sisting of points each of which belongs to two at least of the sets; and a 
similar result holds for each system of n sets Q rrH1 , Q rn < 2 > ••• Qir+i)n- 


We have now an infinite sequence of closed sets 
Ql, ti > Q‘2,ny $ 3 , Qr,n' ••• 


each of which has content ^ 1 - m J ^ 6\ and the points of each of 

them belong to two at least of the given sets. By applying similar 
reasoning, and taking ri sets at a time, we see that there are an infinite 

number of sets each of content > (l — 


w 4 1 
n 


m + 1 


C t and such 


that each point of any one of them belongs to four at least of the given 
sets. Proceeding in this manner, we obtain sets of points, each of content 


and such that each point of each set belongs to at least 2 a+1 of the given 
sets. Now let w, n', ... ?/ (s) be so chosen, that 

7n -f 1 1 m 111 m t 3 1 

n < 2 £ ’ n’ < 4 e ’ n< s > < 2-' 1 e; 


then the content of each of the sets which contains points belonging to 
2 <Hl at least of the given sets is 


> C 


(‘ - 2 ‘)(' - 1 ') - (' 2».') >C ! 1 - '(2 


2 ^ 4 


4- 


±)\ 

2*+V) 


> 0(1 — e). 


The process can be carried on without limit; and we see that the set 
which consists of all points belonging to 2* +1 at least of the given sets 
contains closed components of content > C (1 — e). Considering the 
sequence P x , P 2 , ... of sets such that P x contains all points that belong 
to two at least of the given sets, P 2 contains all points that belong to 2 a 
at least of the given sets, and so on, it is clear that contains P 2 , and 
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P 2 contains P 3 , etc. But the interior measure of each set is > C (1 — e); 
hence, in accordance with the theorem of § 135, there exists a set of points 
common to all the sets J \, P 2 , of interior measure ^ C (1 — e). This 

set consists of points which belong each to an infinite number of the given 
sets; and its interior measure is ^ C , since e is arbitrarily small. The set 
has the power of the continuum, since it contains closed components of 
content greater than zero. It should be observed that the employment 
of the sets {Q n } involves an infinite number of acts of choice. 

The theorem that has been now established is of considerable im¬ 
portance on account of the applications of it which can be made in various 
parts of the theory of functions; it is due* to W. H. Young. That par¬ 
ticular case of the theorem in which the sets are all measurable was first 
statedf, without proof, by Borel. 

An important case of the theorem arises if we suppose each of the 
sets to consist of a finite, or an enumerably infinite, set of closed intervals; 
in which case the sets are all measurable. The theorem may then be stated 
as follows: 

If there he given an infinite number of sets of intervals, in a finite segment , 
each set consisting of a finite , or an enumerably infinite , number of non- 
overlapping intervals , and if the measure of each set of intervals is greater 
than some fired positive number (J, them there exists a set of points , having 
the, power of the continuum , and of measure > C, such that each point of the 
set belongs to an infinite number of the given sets of intervals. 

This theorem contains the completion, and generalization, of a theorem 
due to Arzelaf which is stated by him as follows: 

Let y 0 be a limiting point of any set of numbers (y), and let 

(h - (y\<y 2 < •••) 

be a sequence of numbers of (y) which converges to the limit y 0 . Assuming 
tlie variables to be orthogonal coordinates of a point in a plane, let the 
set of straight lines y - y x , y — y 2 , y — y 3 , ..., be drawn, and let a set of 
intervals be taken on the portion of each of these straight lines which is 
in the interval (a, b ), of x. Suppose that each set of intervals is finite in 
number, and that this number is variable from one straight line to another, 
but increases indefinitely with the index s , of y 8 . Let the sum of the 
intervals 8 ltS , 8 2tS , ... 8 n , s on the line y y s , be d 8 . If for every value 
of s, d s is greater than C, a determinate positive number, there necessarily 
exists at least one point x 0 in the interval (a, b), such that the straight 
line x — x 0 intersects an infinite number of the intervals 8. 

* Proc. Lond. Math. Soc. (2), vol. n (1904), p. 26. 

t Compte.s Ilendus , vol. cxxxvn (1903), p. 966. 

t Rend. delV Ac. dei Lincei, (4), vol. x (1885), p. 637; a second proof, which is however not 
rigorous, has been given by Arzelk in the Memorie della ; R. Acc. d. Sc. di Bologna , (5), vol. vm (1899). 
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Arzela subsequently removed the condition that each set of intervals 
is a finite one. 

VITALl’S THEOREM 

136 1 . An important theorem has been given* by Vitali which may be 
regarded as an extension, in relation to metric properties, of the descriptive 
theorem of Heine-Borel and its extensions, discussed in §§ 73-77. 

The following preliminary theorem will be first established: 

Let C denote a set of cells in p dimensions , not necessarily enumerable , 
such that each cell has its sides of ecjual length, and such that the measures 
of the cells have a finite upper boundary k x . Let it be assumed that each cell 
of C has at least one point in common with each cell of some non-finite set 
of cells , all belonging to C. Let H be the set of points equivalent to C; and 
let it be assumed that H has its measure finite. Then a finite set of cells 
exists , all belonging to G , such that no two cells of the finite set have a point 
in common , and such that the sum of their measures is greater than 

; * p m (H) - e, 

where e is a prescribed positive number. 

A cell Cj , belonging to C , exists such that m (cf) > l\ — e; let C 1 

denote the part of C such that each cell of C x has at least one point in 
common with c x ; such part exists by hypothesis. The points of the set L t , 
equivalent to G x , that do not belong to c x have a measure ^ (3 P — 1 )k 1 . 
Let k 2 be the upper boundary of the measures of the cells C — C x ; then 
there exists in C — C x a cell c 2 , which necessarily has no point in common 

with c x , and of which the measure is > k 2 — e. Let C 2 denote the part 

of G — G x of which each cell has at least one point in common with c 2 . 
The set of points L 2 equivalent to G 2 that do not belong to c 2 has measure 
^ (3 P — 1) k 2 . Proceeding in this manner, we form a sequence c x , c 2 , c 3 , ... 
of cells such that their total measure is not greater than m (H); and 
therefore « f N 

00 

Since S k n is accordingly convergent, we have lim k n — 0; it then follows 

n - 1 n ~oo 

CO 

that G = S G n , for if a cell c existed in G which for no value of n belongs 

n ~ 1 

to G n , every value of k n would be £ m (c), and thus k n would not converge 
to zero. Let C denote the sum of c 1} c 2 , ... c n , ..., and let Z denote the 

* Atti d. Jt. Accad. d. Torino, vol. xliii (1908), p. 229. A somewhat different, and very general, 
statement of the theorem is given by Carathcodory, Vorlesungen fiber reelle Funktionen (1918), 
p. 299. 




136 1 ] Vital?s Theorem 187 

set of points equivalent to the totality of the sets L x , L 2 , .... Then 
H = (J -{- L \ also m (C) < 2 k n) m (L) £ (3* — 1 ) 2 k n , and therefore 

n ■ 1 n - 1 

m (//) < 3 P 2 & n . 

n =1 

_ 00 

Also we have m (C) > 2 k n - Jc, and therefore 

n “1 

»» (<?) > p ™ (#) - if- 

An integer % can be so chosen that m (c x -f e 2 ... f c nj ) > m (0) — \e\ 
then m (c x ) -h m ( c 2 ) + ... -I- m (c nj ) > m ( H) — c; thus the required cells 
c x ,c 2 , ... c ni exist. 

The main theorem given by Vitali is the following : 

Let C denote a set of cells in p dimensions , each of which has sides of equal 
length. Let C {n) denote that part of 0 which consists of cells each of which has 
its measure < r/, such part being assumed to exist for each positive value of 77 ; 
and let G be the inner limiting set of the sets of points equivalent to 
... C^n\ ..., where {g n } is a diminishing sequence of numbers 
converging to zero; and let G be assumed to have finite measure. Then 
an enumerable, or a finite, set of cells exists, all belonging to C , such that no 
two of them have a point in common, and such tiuit their total measure 
is s m (G). 

Since G has finite measure, the set of points equivalent to C {ri) must 
have finite measure, provided 77 is sufficiently small. There exists therefore 
a part C 19 of C , consisting of a part C iri) , for some value of 77 , such that, 
if H x is the set of points equivalent to G x , m (H x ) has finite measure. In 
accordance with the last theorem a set c x , c 2 , ... c ni , of cells belonging to C x 
exists such that no two of them have a point in common, and such that 

^ m (c r ) > m (tf,) - h > Jp ™ ( G ) ~ h- 

First, let it be assumed that every cell of C x has a point in common 
with one at least of the cells c x , c 2 , ... c ni . If, for some sufficiently small 
value of £, CM contains no cells partly outside the cells c x , c 2 , ... c ni , all 
the points of G must be in one of the cells of this finite set; and in that case 

r -=7i, 

2 m ( c r ) > m (G), 

r ■= 1 

and the set required has been obtained. If, for every value of £, however 
small, there are cells of CM partly outside the cells c r , all these cells must 
be contained in neighbourhoods of the cells c r whose measures diminish 
indefinitely with therefore there can exist no points of G exterior to 



188 Metric Properties of Sets of Points [ch. in 

r - Mi 

all the cells c r . Hence, as before 2 m (c r ) > m (G), and c,, c 2 , ... c n . is 

r -1 

the required set. 

Next, let it be assumed that a set G 2 , a part of C l9 exists such that 
no cell of C 2 has a point in common with any of the cells c I? c 2 , c n . 
Let the set G 2 be the inner limiting set of the sequence where 

( V consists of those cells of C 2 whose measure is <rj n ; and 1 et *// 2 be 
the set of points equivalent to C 2 . We have then the relation 

r n, 

m (G 2 ) £ m (G) — 2 m (c r ). 

r 1 

In 62 a finite set of intervals c nj+1 , c nj h2 , exists, no two of which 

have a point in common, and such that 

r T i' i m {Cr) > £ m {1Li) ~£ e> l m ^ -1 e - 

In case every cell of C 2 has a point in common with one at least of the 
cells c Wl41 , c„ l+2 , , c Wa , it can be shewn as before that 

2 m (c r ) > m (G 2 ); 
r nrll 

then we have from the relation given above 

2 m ( c r ) > m (G), 

r~ 1 

and thus the required set of cells exists. If however, there exists a set C 3 
which is a part of C 2 such that no cell of 6’ 3 has a point in common with 
any of the cells c„ lfl , c nj)2 , ... c„ 2 , and G 3 be defined as in the preceding 
definition of G 2 , we have 

r --n t 

m (0 3 ) > m «j 2 ) — 2 m (c r ). 

, r«nH 1 

We can proceed indefinitely in this manner, or until the process ceases 
at some stage. We obtain then, either a finite set of cells having the 

required property, or else a sequence of cells {c r }. The series 2 m (c r ) is 

r-1 

r ~tip 

convergent, since 2 m ( c r ) < m (//x); consequently the series 

r -1 



is convergent, where G X ^G\ and therefore, the series 2 m (G r ) being 
convergent, we have lim m (G r ) = 0. Since 

oo 

S - Mr 

m (G r ) zm(G) — S m (c s ), 

8 - 1 

00 

we now have, as r ~ oo, 2 m (cj ^ m ( G ). The set {cj, of cells, accordingly 

» 5 = 1 

satisfies the conditions of the theorem. 
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It is clear that, in the general case, the solution of the set {c w }, in the 
foregoing theorem, requires an application of the multiplicative axiom, 
but in particular cases, in which the definition of the set C , of cells, is of 
a special character, the use of the axiom may be dispensed with, by the 
assignment of a law of selection. 

Important applications of Vitali’s theorem can be made in the case 
when, starting with a given measurable set of points K , with each point P, 
of K , a set of cells, with P as the centre of each cell (with equal sides), 
is defined, and so that the lower limit of the measures cf the cells is zero: 
the set may consist either of an enumerable sequence A w (P), such that 

lim m [A n (P)J - 0, 

co 

or of a continuous set consisting of all such cells of which the measure 
has a finite upper limit. The set C, in the theorem, will then be taken to 
be the totality of the sets of cells corresponding to all the points P, of the 
given set K. The inner limiting set G will then consist of the set K, 
possibly increased by the addition of a set, all the points of which are 
limiting points of K. This has been shewn to be the case for linear sets 
in § 98, and the reasoning there given can be at once adapted to the case 
of sets in any number of dimensions. In case K is a closed set, we have 
G ee K. If K is an open set, and all the cells C are interior to K, the 
set G can contain no points that do not belong to K ; and since Hc r cannot 
have its measure greater than that of K , it follows from the theorem that 
the sum of the measures of all the cells c r is equal to m ( K ). Hence the 
set of cells {c r } contains every point of K , with the exception of a set of 
which the measure is zero. We have accordingly the following theorem: 

If, at each point P of an open set K , of finite measure, a set of cells, each 
with equal sides, is defined, all the cells being interior to K, and the lower 
limit of their measures being zero, an enumerable set of cells exists, all belonging 
to the complete set so defined, and no two of which have a point in common, 
which contains all the points of the open set K, except those belonging to a 
set of measure zero . The set with centre P may form a sequence, or be un- 
enumerable . 

The following corollary from Vitali’s theorem is useful in applications: 

If E be a measurable set of points , of finite measure, a finite set of cells 
can be determined, no two of which have a point in common , which includes 
all the points of a part of E of measure > m (E) — rj, where 77 is an assigned, 
positive number, and such that the sum of the measures of the cells is < m (E). 

The set E contains a closed set H, of measure > m (E) — \rj and < m (E). 
Taking the set C of cells to be interior to an open set K of measure <m(E) 
containing H (see § 127), C is equivalent to a set of points that has measure 
< m (E), and the inner limiting set G consists, as above, of almost all the 
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points of K. The enumerable set {c r } contains almost all the points of K, and 
therefore almost all the points of H ; and its total measure is m(K) <m(E). 
Hence a finite set c ly c 2 , ... c s of these cells exists the sum of whose measures 
is < m ( E ) and it contains a part of E of measure > m ( E) — 77 . 

THE METRIC DENSITY OF A SET OF POINTS 

137 . Let P be any point in linear, plane, or ^-dimensional space, and 
let G be a measurable set of points in that space. The point P may, or 
may not, belong to G. Suppose the interval, or cell, A h to have the point P 
as its centre, and to have all its sides of equal length h; let G Ah be the 
part of G contained in A a . 

If a definite number p P (G) exists such that 

where e is an arbitrarily chosen positive number, for all such cells A a for 
which h < h„ a number dependent on e, then the number p P (G) is said 
to be the metric density of the set G at the point P. It is clear that, unless 
P is a limiting point of G, whether it belongs to G, or not, the metric 
density at P is zero. The definition is equivalent to 

Pp (G) = lim 

A~0 m (A*) 

whenever the limit exists as a unique number; the cell A a having equal 
sides, and the point P being at its centre. 

It may however happen that such unique limit does not exist; there 
may be different sequences of values of h, all converging to zero, for which 
the above ratio converges to different values. There is then an upper, 
and a lower, limit of the values to which m (G Ah )/m (A a ) may converge. 
These two numbers are both in the closed interval ( 0 , 1 ), and the greater 
of them is called the upper metric density of G at P, and may be denoted 
by pp (G); the smaller of the two numbers may be called the lower metric 
density of G at P, and may be denoted by £p (G). Thus we have 

pp (G) = US , pp (G) = lim m %4. 

H h~orn(A h )’ jrr 0 ™(A h ) 

At every point P, of G, or G', we have p P (G) £ (G); when the sign of 

equality holds, there is a metric density pp (G) at P. 

If C (G) be the complement of G relatively to a cell, or interval, in 
which G is contained, we see that 


™ (G*) rn{C ( G)} Ah 
m (A*) m (A*) 


1 , 


and thence it follows that 

pp (G) -f pp {C (G)} = 1, £p (G) 4- pp {C (#)} = 1; 




191 


136 1 —138] Metric Density of a Set of Points 

and in case the metric density of 0 , at P, exists, that of C ( G ), at P, also 
exists, and the sum of the two is unity. It is clear that, if pp (G) — 0, 
then also pp (G) exists, and has the value zero. 

More general definitions of the upper and lower metric densities at a 
point may be obtained by employing, instead of the special set of cells 
A h) other more general sets of points which satisfy certain conditions ; 
this was done* by Lebesgue. We shall however make no use of this more 
general definition. 

It is however necessary to employ the conception, duef to de la Vall6e 
Poussin, of the upper and lower metric densities at a point relatively to a 
system of nets. 

If a system of nets {D n } be fitted on to the indefinitely great linear, or 
-dimensional, space in which the set G is defined, a point P is defined 
uniquely by the set d lt d 2 , ... d n , ... of meshes of D 19 D 2 , ... D n , ... which 

contain it (see $ 51). The upper and the lower limits of m ^ T t dn ), as 

r m(d n ) 

n ~ oo, are defined to be the upper and lower metric densities of G, 

at P, relative to the system {D n } of nets. When the two have the same 

value, they define the metric density of G , at P, relative to the system 

of nets. 

138. A set G is said to be metrically dense at a point P, whether or not 
P belongs to G, if, in every cell, or interval, according to the dimensions 
of the space in which the set is defined which contains P in its interior, 
there is a set of points of G of measure > 0. A point at which G is metrically 
dense must belong to G'. 

The metrical density of G may be zero at a point at which G is 
metrically dense; for m (G A ) may be > 0 for every cell A; and yet the 
limit m (G A )/m (A) may exist, and have the value zero. 

The terms metrical density , metrically dense are employed because the 
terms dense and non-dense have already been used in a descriptive sense. 
A set may be everywhere dense, and yet nowhere metrically dense, as is 
the case, for example, for the set. of rational points in the linear interval 
(0, 1). 

The set of all points at which a set G is metrically dense is dosed . 

For if, in every neighbourhood of a point P, there are points at which 
G is metrically dense, it is clear that, at the point P, G is metrically dense. 

Those points of a measurable set G at which the set is not metrically dense 
form a component of G , of measure zero , provided that component exists. 

* Annales sc. de Vtcole normals, (3), vol. xxvn (1910), p. 387. 

t Integrates de Lebesgue , pp. 63, 71; also Trans. Amer. Math. Soc. vol. xvi (1915), p. 488. 
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Apply a system of nets to the cell, or interval, in which 0 is contained. 
Let D ni be the first of the nets which has one or more meshes each of which 
contains only a part of G of measure zero; let D n f denote those meshes 
of D Wl for which this is the case. Let D ni (n 2 > nf) he the first net which 
has meshes, not contained in D ni f , in each of which the component of G 
has measure zero; let D r ,f denote these meshes. Proceeding in this manner, 
we define a sequence 2> ni ', D n f, ... each of which consists of a set of meshes 
belonging to D Ul , Z)„ 2 , ... respectively, and such that D n f does not belong 
to D nr _f for any value of r. Every point P, belonging to G, at which G 
is not metrically dense belongs to one of the meshes of the sets /> ', D n f, ..., 
each of which contains a set of points of G of measure zero. Since the set 
of all points of G contained in all these sets of meshes is of measure zero, 
it follows that the component of G , at each point of which G is not metri¬ 
cally dense, has measure zero. 

A set of points is said to be metrically dense in itself when every point 
of the set is a point at which the set is metrically dense. 

If, from any set G , such that m (G) > 0 , those points at which the set 
is not metrically dense be removed, there remains a set H such that 
rn (II) - m (G), which is metrically dense in itself. For, in the arbitrarily 
small neighbourhood of any point of H , there is a set of points of G of 
measure > 0 , and therefore a set of points of II of measure _> 0 . The 
following theorem has therefore been established: 

Any measurable set , of measure > 0 , is the sum of a set of measure zero , 
which may be non-existent , and of a set that is metrically dense in itself. 

In case the set G is closed, we may add to H its limiting points, all 
of which belong to G; we thus obtafin a closed component H of G i where 
m (H) = m (G). The set U has the same property as II ; for, in the neigh¬ 
bourhood of a limiting point of H , there must be a set of points of H of 
measure > 0 . It is clear that ti has no isolated points, hence it is a perfect 
set. It has thus been proved that: 

A closed, set G, of measure > 0 , is the sum of a perfect set H, of measure 
equal to that of G, and metrically dense in itself , and, of a set of measure zero. 

Any measurable set G contains a non-dense closed component, of 
measure arbitrarily less than that of G (§ 125). This non-dense closed 
component contains a perfect set, also non-dense, which is metrically 
dense in itself, and of measure equal to that of the component. We have 
thus proved the following theorem, due to Lusin : 

Any measurable set , of measure > 0 , contains a non-dense perfect set , 
metrically dense in itself , and of measure differing by an arbitrarily small 
amount from that of the given set . 
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139. With a view to the establishment of the fundamental theorem 
in the theory of metrical density it is convenient to introduce the con¬ 
ception, due* to de la Vallee Poussin, of conjugate systems of nets. 

Let us first consider a net /) x , all the meshes of which are of equal 
length, d , fixed on to the indefinite interval (-oo, oo) on the a: (1) -axis. 
Let i> 2 be the net obtained by dividing each mesh of D l into three equal 
meshes of breadth we proceed to obtain I , by dividing each mesh 
of 1) 2 into three equal parts, and so on indefinitely. We have thus a 
symmetrical system of nets, such that the breadth of each mesh of D n is 
d/ 3 n 1 . Next consider the net of which the end points of the meshes 
are the middle points of the successive meshes of D x \ from I ) l '. we form a 
symmetrical system of nets {/>„'} in exactly the same manner as {D n } 
was formed from D x . it is then easily seen that the end-points of the 
successive meshes of D n r are the middle points of the successive meshes 
of D„. The two systems of nets {/)„},{/)„'} are then said to be conjugate 
to one another; they have the property that any interval of length < \d 
is contained within a mesh of one at least of the two nets D n , D n \ pro¬ 
vided its length is < ld/ll n 3 . 

Next, let systems of linear nets, precisely similar to {/>„}, {!)„'}, be 
fitted on to the a; (2, -axis; and let, straight lines of indefinite length, be 
drawn parallel to the x (1) -axis and the x (2) -axis, intersecting those axes 
at the end-points of the meshes of the two pairs of conjugate linear 
systems of nets. In this manner wo obtain four systems of symmetrical 
nets fitted on to the unbounded plane of (x {1) , x {2) ) \ they are said to form 
four systems of nets, any pair of which are conjugate to one another. 
Let us consider any square, with its sides parallel to the axis, of side «, 
< | d, in the plane; the projections of two sides on the a* (1, -axis fall within 
a mesh of at least one of the two nets l ) 7l , J) n \ provided a < bd/W 1 1 ; 
also the projections of the other sides fall within a mesh of at least one 
of the two corresponding nets on the x (2) -axis. Consequently the square 
is entirely contained in one at least of the four plane nets of order n y 
belonging to the four systems of nets which have been fitted on to the 
plane. The same holds for any cell, provided its greatest side is < \dj 3* -1 . 

Proceeding in a similar manner, 2 P conjugate systems of nets can be 
fitted on to the p-dimensional space, and a cell, whose sides are a (< \d) 
is contained within a mesh of at least one of the nets of order n , belonging 
to the 2 V sets, provided the greatest side of the cell is < |rf/3 nl . We 
may suppose, as in § 51, that all the meshes of all the nets are semi-closed, 
so that any point is defined uniquely by the set of meshes of any one of 
the systems in which it is contained. 


Integrates de Lebesgue, p. 64. 
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The following theorem will be established : 

If E and F are two measurable sets of points , in any number of dimen¬ 
sions, and those points of E at which the upper metric density of F is greater 
than som,e positive number c,form a component E 1 , of E, such that m (E x ) > 0, 
then a system, of nets can be so determined, that those points of E at which the 
upper metric density of F relatively to the system of nets is > c/A farm a 
set E 2 such that m (E 2 ) > 0 , where X is a fixed number properly chosen. 

Suppose that, at a point P, of E x , the upper metric density of F is 

> c, then a sequence of cells exists, each one of which has its sides 
equal to h, and with P as its centre, and each cell of the set containing 
the next, which converges to the point P, and is such that, in each cell, 
the ratio of the measure of the part of F in the cell to the measure of the 
cell is > c. Consider a set of 2 P conjugate systems of cells, as defined 
above. For any value of h (< \d), there exists an integer n , such that 

d . h> d 
2 ^ ^ • 

The cell A a is then interior to a mesh of one of the 2 P conjugate systems, 
the length of a side of this mesh being . The ratio of the measure 
of the part of F contained in this mesh to the measure of the mesh is 

> . Since this is the case for all the values of A in a certain sequence, the 

upper metric density of F at P, relatively to one at least of the 2 P systems 
£ 

of nets, must be > c, where A is a number > 6 P . This is the case for each 

A 

point P belonging to the set E x , of which the measure is > 0 ; there must 
therefore be a part E 2 , of E x , of measure > 0 , such that the upper metric 
density of P, at every point of P 2 , relatively to one of the 2 P systems of 
nets is > c/A. That one of the conjugate systems of nets is a system of 
nets such as is required. 

140. The following fundamental theorem will now be established: 

The metric density of a measurable set E exists , and is equal to unity , at 
all points of E , with the possible exception of the points of a component of 
which the measure is zero; and the metric density of E exists , and is equal 
to zero , at all points of G ( E ), with the possible exception of points of a set 
of measure zero. 

We shall first suppose that E is a bounded set. It will be shewn that 
the points of E at which the upper metric density of C (E) is > 0 form a 
set of measure zero. 

Let us suppose, if possible, that the measure of the component of E, 
at each point of which the upper metric density of C (E) is greater than 
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a fixed positive number a , is > 0. A system of nets can then be so de¬ 
termined that the upper metric density of G ( P ), relatively to the system 
of nets, is > a/X, at ail points of a component P a , of E y such that m ( E a ) > 0. 
Let Q be an open set of points which contains P a , and is such that 
m (Q) — m (P a ) is less than an arbitrarily chosen positive number rj. 

Any point P, of E a , is contained in a unique sequence of meshes of 

the nets, one in each of the nets D 1} D 2t ... D ny _ From and after some 

value s , of n y all these meshes are contained in O. Of these meshes 
d s , d 8+l , ..., there must be one of lowest rank n P such that the measure 

of the part of G (E) that it contains is > y m (d n ). When d n has been 

determined, we may suppose it to correspond to each of the points P, 
of P a , which it contains. To any other point P', of E a , not contained in 
d np , there corresponds a similar mesh d np , . An enumerable set of such 
meshes will contain all the points of E a , and it is contained in LI. The 
measure of the part of C (E) contained in this enumerable set of meshes 

is > ^ m (P a ). But the measure of the set of meshes is < m ( E a ) + rj } and 

therefore the measure of the part of G (E) contained in the set of meshes 

is < tj. Now 7] can be so chosen as to be < ~ w (P»); hence the assumption 

that m (E a ) > 0 leads to contradiction. It follows that m (E a ) = 0. Taking 
for a the values in a decreasing sequence of numbers that converges to 
zero, the set of points of E at which the upper metric density of G (E) is 
> 0 is the outer limiting set of the sets E a , all of which have measure zero. 
Therefore those points of E at which the upper metric density of G (E) 
is > 0 form a set of measure zero. With the exception of this set, at every 
point of E the metric density of G (E) is zero. 

It then follows that, at every point of E not belonging to a component 
of measure zero, the metric density of E is unity. Since E and C (E) can 
be interchanged, we see that the metric density of E at all points of G (E) 
is zero, except at points of a component of G (E) of measure zero. 

In case the set E is unbounded, we may consider the parts of it in cells 
of a sequence each of which contains the preceding one, and of which the 
measures increase indefinitely. Each of the cells may contain an excep¬ 
tional set of measure zero, at which the metric density of the component 
of E in the cell either does not exist, or is not equal to unity. The outer 
limiting set of these exceptional sets has also the measure zero; hence 
the theorem holds for the unbounded set P. 

The above theorem was first established, for the case of linear sets, by 
Lebesgue, who employed the theory of integration. Other proofs, inde¬ 
pendent of the theory of integration, have been given, for linear sets, by 


13-2 
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Penjoy*, and by Lusin and Sierpinskif. The above proof is founded on 
the treatment of the subject by de la Vallee Poussin, who established the 
theorem for the ease of sets of points in any number of dimensions. 


THE RESOLUTION OF SETS OF POINTS IN ACCORDANCE WITH 
METRICAL PROPERTIES 

141 . It was shewn in § 91 that every set of points can be expressed 
as the sum of an enumerable (or finite) series of sets which have certain 
descriptive properties. We proceed to analyse a given measurable set into 
the sum of parts which have certain metrical properties. 

It will be shewn that: 

Any bounded measurable set can be expressed as the sum of an enumerable 
sequence of perfect sets, together with a set of which the measure is zero. 
Moreover, the perfect sets can be so clwsen as to be all non-dense, and, such 
that each of them is metrically dense in itself . 

If S be a measurable set, it has been shewn in § J25 that it contains 
a perfect set H l such that m (S) — m (H x ) < e/4. If H 1 is not non-dense 
it has a non-dense perfect component G l9 such that m (// 1 ) - m (G\) < e/4; 
therefore rn (S) — m (Of) < e/2. The perfect set G 1 has a perfect component 
(§ 138) L l , of the same measure as G x , and metrically dense in itself. We 
have now 8 — L x 4- M x \ where m ( 8) — m (L x ) < e/2; or m {M x ) < e/2. 
The set M x may be similarly resolved into 4 + M 2 ; where L 2 is perfect, 
non-dense, and metrically dense in itself, and where m (M 2 ) < e/2 2 . This 
procedure may be carried on successively, and terminates only if one of 
the sets M n is either absent or has the measure zero. In case M n exists 
for every value of n, its inner limiting set is Mu, which has the measure 
zero, since lim m ( M n ) = 0. We have therefore, in this case, 

S — JL X -f- L 2 -f- ... + L n 4- ... 4* M^, 

where the series L x , L 2 , ... of perfect sets, all non-dense and metrically 
dense in themselves, is either finite, or forms an unending series. 

It will be observed that the set Mu is a residual set, relatively to S, 
although it has the measure zero. In particular, the set of all the points 
of a cell can be resolved in this manner into a sum of sets; the residual S 
being everywhere dense, and of cardinal number c, although its measure 
is zero. 

* Journal de Maih . (7), vol. i (1915), p. 132. 

t Rend. d. Circ. Mod. di Palermo , vol. xlii (1917), p. 167. See also Sierpinski, Fundamenla 
Mat. vol. tv (1923), p. 167. 
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Jordan’s measure of a set of points 

142. A definition has been employed by Jordan*, and by Peanof, of 
the measure of a set of points, which differs from that, developed by 
Bore I and Lebesgue, which has been employed in the present Chapter. 
It is applicable to sets of points in space of any number of dimensions, 
and is of utility in the theory of quadratures. 

Let a set G be in the interior of a closed cell (or interval) A; and let a 
system of nets with closed meshes be fitted on to A. Lei S„ (1) be the sum 
of the measures of those meshes of l) n which are such that every point 
of each of them is an interior point of G, and Jet X ? , <2) be the sum of those 
meshes of D n which are such that each of them contains either an interior 
point of G or a point on the frontier of G and C (G). It can be shewn that 
S r; (1) converges, as n ~ go, to a number , and that Z„ (2) converges to 
a number S 2 ; where aS, and S 2 are independent of the particular system 
of nets fitted on to A. 

The number Sj is called the interior extent of the set G, and S 2 is called 
the exterior extent of G. When — $ 2 , the set is said to be measurable 
(./), and the number S 2 is said to be its measure (J). The exterior 
extent of a set is identical with its content, as defined by Harnack and 
Cantor (§118). 

In accordance with this definition, any set which contains no interior 
points has its interior extent zero. For example, the interior extent of the 
irrational points of the linear interval (0, 1) is zero, and its exterior 
extent is 1. 

A set G consists, in general, of interior points forming a set /, and of 
points F 1 , all of which belong to the frontier F. The interior extent of G 
is identical with the measure of the open set /, and the exterior extent of 
G is identical with the measure of the closed set 1 Hr F; the measures of 
these sets being defined as in § 122. It is then clear that the necessary 
and sufficient condition that a set G should be measurable (J) is that its 
frontier, which is a closed set, should have the measure zero. Accordingly, 
a set that is measurable (J) is also measurable in accordance with the 
definition here adopted (§ 128), but the converse does not hold. 

THE SECTIONS OF A CLOSED SET 

143 . If G be any closed set of points in a rectangle A BCD, and through 
the points P, of AB f straight lines PP r are drawn perpendicular to AB f 
and if / (P) denote the linear content of the linear component of G which 
is on the straight line PP', and which may be termed the section of G 

* Journal de Math. (4), vol. vm (1892), p. 78; also Cours d'Analyse, vol. i, p. 28. 
f Applicazioni geom. del. calc, infinit. (1887), p. 153. 
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by PP', then the set of points P, on AB, which is such that f (P) £ cr, is a 
closed set; cr denoting any positive number. 

Let Pj be a limiting point of the set; and if possible, let the linear 
content of that component of G which is on PjP/ be < a; we can then 
determine a finite number of intervals 8 l5 8 2 , ... 8 r , on P 1 P 1 \ whose sum 



Fig. 2 


is > AD ~~ <7, and which are free in their interiors and at their ends from 
points of G. On each of these intervals 8 we can describe a rectangle which 
contains no points of G within it or on its boundaries: this may be done 
on either side of PjP/; for each point of 8 can be enclosed in a rectangle 
free from points of G; and by the Heine-Borel theorem, a finite number 
of these rectangles, enclosing all the points of 8 , exists. Take a point Q 
belonging to the set of points for which / ($) ^ cr, and let P X Q be less than 
the breadth of all the rectangles described on the intervals { 8 } on one 
side of PjPj'. On QQ' there is a finite number of intervals free from 
points of G , whose sum is > AD — cr, by the assumption as to P^Pf\ 
hence the linear content of the component of G which is on QQ' must be 
< a, which is contrary to the hypothesis. It follows that/ (P 2 ) £ a; hence 
the set of points on AB is closed. 

It will now be shewn that, for a closed set of points 6 r, if for every position 
of P , on AB, the linear content of the section of G by PP' is < a, then the 
content of G is < a .AB. 



A 
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P 
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Taking any point P, of AB ; on PP', a finite number of intervals, whose 
sum is > AD — a, can be found which are free from points of G; and on 
each of these intervals a rectangle can be drawn on each side of PP', 
containing no points of G in its interior or on its boundary. We can now 
draw two straight lines pp', qq', one on each side of P, so that each of 
them passes through the interiors of all the rectangles so described. We 
have now found an interval pq containing P, such that in pqq'p' there is 
an area > pq (AD — a) free from points of G. Corresponding to each 
point P, of AB, such an interval pq can be found; and, in accordance with 
the Heine-Borel theorem, a finite number of these intervals can be selected, 
such that every point of AB is in the interior of one at least of them. 
The end-points of these intervals divide AB into a finite number of parts, 
such that, above any one part, of length a, there is an area > a (AD ™ a) 
free from points of G; and hence there is altogether an area > AB (AD ~ a) 
free from points of G. It follows therefore that the content of G is < AB. a. 

We shall now establish the following theorem, which is of importance 
in the theory of double integration: 

If G be a closed set , and if the linear content of the set of points P, on AB, 
for which the linear content of the section of G by PP' is £ cr, have the value 
zero, for every positive value of cr, then the set G is of zero content. 

The points on AB, for which / (P) > cr, can be enclosed in a finite 
number of intervals whose sum is < e, where e is an arbitrarily small 
number; and in each of the remaining parts of A B, the value of / (P) 
is < cr; hence by the foregoing theorem the content of G is 

< a (AB — c) 4* €.AD; 

and since this holds for arbitrarily small values of a and e, it follows that 
the content of G must be zero. 

Conversely, it may be shewn that: 

If G be a closed set, of plane content zero , the set of points P, on AB, for 
which the linear content of the section of G by PP' is £ cr, has, for every 
positive value of a, content zero. The linear measure of the set of points P 
for which the linear content of the section of G by PP' is greater than zero 
is zero. 

Let I denote the linear content of the set (P) for which f (P) £ a ; 
divide AB into n equal parts, and A D also into n equal parts, and through 
the end-points of these parts draw straight lines dividing the rectangle 

into equal parts each of area ^.AB .AD. Then the sum of those parts 

of AB which contain points of the set (P) is always greater than /; and 
in each such part there is at least one point P, such that the sum of the 
parts of PP' which contain points of G is > cr. It follows that the sum of 
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those rectangular portions which contain points of G is > al y however 
great n may be; and hence that the content of G is > ol. Therefore it 
follows that G cannot have zero content unless / is zero. 

The second part of the theorem is proved by giving to a the values in 
a sequence {cr w } which converges steadily to zero. We have then only to 
consider the outer limiting set of the sequence of sets which correspond 
to the values of cr in the sequence {a w }. 


EXAMPLES 


I. Let a set of points ( x , y) in the rectangle for which 0 * x • 1,0:?/* 1, be defined 
as follows*: —The numbers x, y are expressed in the dyad scale, and only those values of x 
and y are taken which are expressed by terminating radix-fractions, the number of digits 
being the same for x as for y. If x' denotes a terminating radix-fraction, there is only a 
finite number of points (x\ y') of the set on the straight line x—x'; similarly if y' denotes 
a terminating radix-fraction, there is only a finite number of points of the set on the 
straight line y — ?/. The two-dimensional set is however everywhere dense: for, considering 
a straight line y x i a 9 where a is a positive, or a negative, radix-fraction with a finite 
number of digits, we see that, corresponding to any number x expressed by a finite number 
of digits greater than the number of digits by which a is expressed, there is a point (jr, y) 
on the straight line belonging to the set. The component, of the set on the straight line 
y ~x -} n, being everywhere dense, and the values of a being everywhere dense in the interval 
(- 1, 1), it follows that the set is everywhere dense in the rectangle. 

This example shews that an everywhere dense two-dimensional set may he linearly 
non-dense on each straight line belonging to two parallel sets. It also shews that a two- 
dimensional set may exist which is extended, but is unextended on straight lines belonging 
to either of two parallel sets. 


2. Let a erossf formed by two pairs of straight lines parallel to the pairs of sides of a 
square be constructed, and so that the remainder of the square consists of four equal squares 
at the corners. Let the interior points of the cross be removed from the square, and then 
let a similar cross bo removed from each of the remaining four squares. Proceeding in this 

manner, let the crosses ho so chosen that the area of each square after the rath stage of the 
p 

process is ab 2 W times the area of each square after the preceding stage. The sum of the 
areas of the squares which remain after the rath stage is 




'T 


p 0 i”) 


Q. 


(4 a) m b z “ 7 Q~{4a) rn b 

where Q is the area of the original square. A non-dense closed set of points is defined as 
the points which remain when this process is carried on indefinitely. The limit of the sum 
of the crosses is that of 


? '( l 2m )]l 


-(4 a) m b 

and this is Q, or < Q , according as a % ]; it follows that the closed set has content zero, 
if a < J; but if a - J, the content is b-vQ. 


* Pringsheim, Sitzungsbcrichte d. Munch. A had. vol. xxrx (1899), p. 48, 
f Veltmann, ScMomileh's Zeitsch. vol. xxvn (1882), pp. 178, 314. 



CHAPTER IV 


TRANSFINITE NUMBERS AND ORDER-TYPES 

144 . A preliminary account has been given, in Chapter n, of the 
theory of transfinite ordinal arid cardinal numbers; it was shewn that the 
introduction of such numbers was suggested by the exigencies of the 
theory of linear sets of points, and that, in particular, the necessity for 
the use of transfinite ordinal numbers arises whenever a convergent se¬ 
quence of points is transcended by adjoining to the points of the sequence 
their limiting point and any further points which it may be desirable to 
regard as belonging to the same set as the points of the sequence. The 
fundamental discovery of G. Cantor, that the rational points of an interval 
form an enumerable set, whereas the set of points of the continuum is 
unenumerable, by establishing the existence of a distinction between the 
characters of two infinite sets, suggests the development of a general 
theory of cardinal numbers of infinite aggregates. The procedure we 
adopted, of introducing the fundamental notions of transfinite ordinal 
and cardinal numbers in connection with the theory of sets of points, is 
in accord with the historical order in which the whole theory of transfinite 
numbers and order-types was developed. The account of the theory of 
transfinite numbers given in Chapter II is in general agreement with 
Cantor’s earlier presentation* of his ideas; his laterf, and more abstract, 
treatment of the subject is the one upon which the account given in the 
present Chapter is founded. 

In order that the reader may be put into a position to form his own 
conclusions as to the validity of a scheme which must be regarded as still, 
to some extent at least, in the controversial stage, it has been thought 
best to postpone any discussion of the difficulties of the theory, until after 
the conclusion of the detailed account of the theory in its constructive 
aspect. 

In the latter part of the Chapter, some critical remarks upon the 
logical basis of the theory will be made; these must necessarily be of an 
incomplete character, partly from considerations of space, and also because 
any complete criticism of such a scheme as Cantor’s theory of transfinite 

* See his “Grundlagen einer allgemeinen Mannigfaltigkeitslehre,” Leipzig, 1883, or Math. 
Annalen, vol. xxi (1883), p. 545; see also Zcitschrift f ur Phil, und phil. Kritik, vols. Lxxxvm, 
xci and xcu. Cantor’s ideas were foreshadowed in a paradoxical form by Bolzano in his Para- 
doxien des Unendlichen, Leipzig, 1851; and although infinite numbers had been discussed by 
earlier writers, Bolzano is the only real predecessor of Cantor in this department of thought. 

t This is contained in the two articles “Beitriige zur Begriindung der transfiniten Mengen- 
lehre,” in the Math. Annalen , vol. xlvi (1895), and vol. xlix (1897). 
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numbers would involve the consideration of questions of an epistemological 
character, which for obvious reasons cannot be adequately dealt with in 
a work of a professedly mathematical complexion. Objections which may 
be urged against some parts of the theory will however be fully stated. 
Some consideration will also be given to the question, whether, and how 
far, the theory is indispensable as a logical basis of continuous Analysis. 

THE CARDINAL NUMBER OF AN AGGREGATE 

145. A collection* of definite distinct objects which is regarded as a single 
whole is called an aggregate . 

An aggregate may be denoted symbolically by a large letter M , the 
elements of the aggregate by small letters m ; and the constitution of the 
aggregate may be denoted by the equation M — {m}. 

The consideration of questions which arise in connection with this 
definition, as to the mode in which the objects of the aggregate must be 
specified in order that the aggregate may be adequately defined, and as 
regards the conditions, if any, which must be satisfied in order that a 
collection may be regarded as a whole, or aggregate, of such a character 
that it can be an object of mathematical thought, will be postponed. For 
the present, it is sufficient to remark that an adequate definition of any 
particular aggregate, which is not necessarily finite, must contain, as a 
minimum, a set of rules or specifications by means of which it is theoretically 
determinate, in respect of any object whatever, whether such object does, 
or does not, belong to the aggregate. The set of prime numbers, for 
example, is regarded as an aggregate although, when a particular number 
is presented to us, we may be practically unable to decide whether that 
number is prime or not. In this case, however, a finite number of processes 
will suffice to decide the question. If however, we take the case of the 
algebraical numbers, the state of things is different; for we are not in 
possession of any general method which enables us to decide whether a 
given number is algebraic or not. Nevertheless, the question being re¬ 
garded as having a logically determinate answer, the algebraical numbers 
are regarded as forming an aggregate, in the sense here employed. 

An aggregate does not depend, for its validity as a mathematical 
entity, upon the possibility of producing all its members, successively or 
otherwise, but upon the sufficiency of the rules by which its elements are 
to be distinguished, as belonging to it, in that particular kind of objects 
to which they belong; that is, upon the sufficiency, in this direction, of 
its definition of membership. 

* This definition is given by Cantor, Math. Annalen, vol. xlvi (1895), p. 481, as follows:— 
“Unter einer ‘Menge’ verstehen wir jede Zusammenfassung M von bestixnmten wohl unter- 
schiedenen Objecten m unserer Anschauung oder unseres Denkens (welche die ‘ Element® * von M 
genannt werden) zu einem Ganzen.” 
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Two aggregates M, N are said to be equivalent to one another when 
they are such that a law of correspondence can be established between 
the elements of one aggregate, and those of the other, so that, to each 
element of one of the aggregates, there corresponds one, and only one, 
element of the other aggregate. 

This relation of equivalence between two aggregates M and N may be 
expressed symbolically by M ~ N, or N ~ M . 

It is clear that, if each of two aggregates is equivalent to a third, the 
two aggregates are equivalent to one another. 

Aggregates which are equivalent to one another are said to have the same 
power , or cardinal number. 

A cardinal number is accordingly characteristic of a class of equivalent 
aggregates. 

The question whether two defined aggregates have, or have not, the 
same cardinal number, is thus equivalent to the question whether it is, 
or is not, possible to establish a systematic (1, 1) correspondence between 
the elements of the two aggregates, in accordance with the above definition 
of equivalence. 

A particular aggregate can ordinarily be shewn to be equivalent to 
itself. The law of correspondence between an element and another element 
which can be set up is in general of a character which admits of a certain 
arbitrariness. The cardinal number is accordingly regarded as independent 
of the notion of order in the aggregate. 

The power, or cardinal number, of an aggregate M has been defined 
by Cantor as the concept which is obtained by abstraction when the nature 
of the elements of M , and the order in which they are given, are entirely 
disregarded. 

Cantor regards the fact that equivalent aggregates have the same 
cardinal number as a deduction from this definition. 

The cardinal number of M is a characteristic of M which may be 
denoted by M , to indicate that both the order of the elements, and their 
precise individual nature, are irrelevant as regards the cardinal number. 

The relation of equivalence M ~ N, between two aggregates, implies 
the equality M =•- N ; and this equation expresses the necessary and suffi¬ 
cient condition for the equivalence of M and N. 

Since Cantor regards the cardinal number of M as independent of the 
precise nature of the elements of M , we may, in accordance with this view, 
substitute for each element the number unity. We have thus a new 
aggregate which is a collection of elements each of which is the number 1, 
and is equivalent to M ; and this new aggregate is regarded by Cantor as 
a symbolical representation of the cardinal number M . 
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THE RELATIVE ORDER OF CARDINAL NUMBERS 

146 . Every aggregate M x , which is such that all its elements are also 
elements of M, is called a part, or sub-aggregate, of M . 

If M 2 is a part of M x , and M x is a part of M, then M 2 is a part of M. 

A finite aggregate cannot be equivalent to any of its sub-aggregates; 
but as will be seen in detail further on, an infinite aggregate always possesses 
sub-aggregates which an 1 equivalent to itself. This is the characteristic 
distinction between finite and infinite aggregates, and has in fact been 
employed by Dedekind and others to define an infinite aggregate as one 
which is equivalent to one of its parts. 

If two aggregates M, N , with the cardinal numbers a == M, ft -s~. A, 
are such that, (1), there exists no part of M which is equivalent to A, and 
(2), there exists a part N x , of A, which is equivalent to M, it is clear that 
the corresponding conditions are satisfied for any two aggregates which 
are equivalent to M, A respectively; and thus the two conditions char¬ 
acterize a relation between the cardinal numbers a, ft of the two aggregates. 
When the above conditions are satisfied we say that a is less than ft, and 
that j8 is greater than a; which is expressed symbolically by a < /?, ft > a. 
This is the definition of inequality for two cardinal numbers, and of the 
relations greater and less, in the purely ordinal sense in which they are 
here used. 

The condition contained in the definition is inconsistent with the re¬ 
lation of equality between a and being satisfied. For if a z= /3, then 
M — A; hence since A 1 ~ M , we have N x ~ N : therefore, since M ~ A, 
there must be a part of M , say M x , such that M x ~ M , which would in¬ 
volve M x ~ N ; but this is contrary to one of the conditions contained in 
the definition of inequality. 

It is easily seen that if a < fi, and fi < y, then a < y. 

147 . It has been seen that the three relations a - f3, a < ft, a > arc 
mutually exclusive; but the question arises whether any two cardinal 
numbers a, p whatever must satisfy one of these relations. An affirmative 
answer to this question would be required before it could be maintained 
that all cardinal numbers can be regarded as being alike capable of having 
relative rank assigned to them, in a single ordered aggregate. 

Two aggregates M, A, of which we may denote parts by M x , N x , 
must satisfy one, and ordy one, of the following four conditions: 

(1) . M, N have parts M x , N x , such that M x — A, and N x — M. 

(2) . M has a part M x , such that M x ~ A; but no part of A exists 
which is equivalent to M. 
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(3) . There is no M x which is equivalent to N ; but there is an N 1 which 
is equivalent to M . 

(4) . There exists no M x equivalent to N ; and also no N x equivalent to M. 

It will be proved that, if the condition (1) is satisfied, then M — N. 

The condition (2) expresses the relation defined as M > N. The condition 
(3) expresses the relation defined as M < N. 

In case M is a part of N, M is itself equivalent to a part of N, and 
thus either (1) or (2) is satisfied; and therefore M =- iV. 

It has not been proved that the relation (4) is an impossible one; 
except that, in the case of finite aggregates, it may be easily seen that it 
involves M - N. Until this point is cleared up, it cannot be maintained 
as an established fact that the cardinal numbers a , ft of any two aggre¬ 
gates whatever satisfy one of the three relations a — ft, a > ft, a < ft. 

Two aggregates which are such that their cardinal numbers a, ft stand 
to one another in one of the relations a --- ft , a > ft , or a < ft, may be 
said to be comparable with one another. Otherwise they are incomparable 
with one another. 

THE ADDITION AND MULTIPLICATION OE CARDINAL NUMBERS 

148. If M, N are two aggregates which have no element in common, 
then the aggregate which has for its elements all those of M and all 
those of A is called the sum of the two aggregates M, N , and may be 
denoted by ( M , N). A similar definition applies to the case of the sum 
of any number of aggregates, no two of which have an element in common. 

If M ', N' are two other aggregates with no element in common, such 
that M ~ M', N — N', it is clear that (M , A 7 ) ~ (M' , N'); and thus the 
cardinal number of (M, N) depends only on those of M and N. 

If M = a, N = ft, we define the result of the operation of addition of a 
and ft to be (M, N). 

From the independence of cardinal numbers of the order of elememts,- 
we deduce „ + f} = 0 + a, a + (/? + y) = (« + ft) + y; 

thus the operation of addition of cardinal numbers obeys the commutative 
and associative laws. 

149. If an element m, of M, be associated with an element n, of N, 
so as to form a new element (m, n), the aggregate of all possible elements 
which can be formed in this way is called the product of M and N, and 
may be denoted by (M .N). 

If M ~ M ', N ~ N', it is clear that, to each element (m, n), of (M .N), 
there is a corresponding element of (M'.N'), hence (M .N) ~ (M *. A^'); 

and thus (M .N) depends only on M and N. 
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The cardinal number of the product-aggregate (M .N) defines the product 
of the cardinal numbers of M and N. 

The product of M and N may also be defined as the cardinal number of 
the aggregate which is obtained by substituting for each element of N an 
aggregate which is equivalent to M. 

It is seen on reflection that this definition is equivalent to the first one. 

Since, as can be shewn from the definition, 

(M.N) ~ (N.M), (M.(N.E)) ~ ((M.N).B) 
and (M . (N, B)) ~ ((M . N), (M . B)), 

we see that cardinal numbers satisfy the relations 

a.fi = /La; a .(/? .y) -= (a, fi) y\ a (/? + y) = aft -f ay. 

It has thus been shewn that the multiplication of cardinal numbers 
obeys the commutative, associative, and distributive laws. 

The definition of multiplication may be extended* to the case in which 
the number of factors is not necessarily finite. Let us consider a class of 
aggregates M , where the class contains either a finite, or an infinite, number 
of aggregates, and suppose no two of the aggregates have an element in 
common. Let a new object consist of an association of elements of the 
aggregates in the given class, one element belonging to each of those 
aggregates. All such objects may be regarded as the elements of a new 
aggregate. This new aggregate is said to be the product-aggregate of the 
given class of aggregates, and its cardinal number defines the product of 
the cardinal numbers of all the aggregates of the given class. 

CARDINAL NUMBERS AS EXPONENTS 

150. If we have two finite aggregates M, N , containing x and y ele¬ 
ments respectively, we may suppose that, to each of the y elements of N , 
one element of M is made to correspond, so that the same element of M 
may be used any number of times; any such particular correspondence 
we call a distribution of N upon M. The total number of ways o£ dis¬ 
tributing N upon M is x v . To put the matter in a concrete form, the 
total number of ways of distributing y things among x persons, where 
any number of the y things may be given to one person, is x v ; any par¬ 
ticular mode of distribution is what we have called a mode of distributing 
the aggregate of y things upon the aggregate of x persons. 

The definition of distributing an aggregate N upon an aggregate M is 
immediately extensible to the case of infinite aggregates. As before, the 
distribution denotes any system by which, to each element of N, is made 
to correspond a particular element of M , the same element of M being 

* See Whitehead, American Journal of Math. vol. xxiv (1902), p. 367, where the theory of 
cardinal numbers is treated by the Peano-Rusaell symbolical method. 
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employed any number of times, or not at all. Denoting by N/M each 
particular mode of distributing N upon M , we thus form the new aggregate 
(N/M) which contains as its elements all such distributions. 

It is seen at once that, ii M ~ M\ N ~ N', then (N/M) ~ (N'/M r ). 
Thus the cardinal number of (N/M) depends only on the cardinal numbers 
of M and N. 

The cardinal number of the aggregate ( N/M ), each element of which 
is a distribution of N on M , and in which every possible mode of such 
distribution occurs as an element, is denoted by the symbol a*, where 

a ~M, p =N; thus a 8 = (WfM). 

It is easy to shew that 

((N/M).(R/M))~((N, R)/M) 

({R/M).(R/N))~{RHM.N)) 

(R/(N/M))~((R.N)/M). 

Hence if M — a, N — jS, R — y, we see that, in accordance with the 
above definition of exponentials, 

a? .a? — a 8+y , .fir = (a.p)v, (a*)v = a?'*; 
and thus the same laws hold as for exponents in which only finite cardinal 
numbers are involved. 

THE SMALLEST TRANSFINITE CARDINAL NUMBER 

151. The cardinal number of the aggregate of all the finite integers 

1, 2, 3, ... ft,.is called Alef-zero, and is denoted by thus Kq = {n}. 

The number K,, is identical with the number which has previously been 
denoted by a. 

If we add to {ft} a new element e, we obtain the sum-aggregate ({ft}, e), 
and this is equivalent to {ft}, for we may make e in the first of these aggre¬ 
gates correspond to 1 in the second, and in general ft to ft -f 1; and thus 
({ft}, e) ~ {ft}. From this, we obtain Xo + 1 = a relation which differ¬ 
entiates from all the finite cardinal numbers. 

The cardinal number Nq is greater than all the finite cardinal numbers , 
and no transfinite cardinal number exists that is less than Kq . 

Since the finite aggregate (1, 2, 3,... k) is a part of {ft}, but no part of 
the finite aggregate is equivalent to {ft}, we have, by the definition of 
inequality, Nq > k. 

Let us suppose, if possible, that a = {M } < ; then there is no part 

of {M} equivalent to {ft}, but there is a part of {ft} equivalent to {M}. 
Now every part of {ft} is either finite, or else is equivalent to {ft}. For, 
consider a part N ls of {ft}; this must contain some number n; either n is 
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the number of lowest rank in A r ,, or there are in A', numbers of lower 
rank than n. In either ease N x contains a part N 2 which is a part of the 
finite aggregate 1 , 2, 3, ... n. In accordance with the definition in §2, 
N 2 has a number n x , of lower rank than all the other numbers in 1, 2, 3,... n ; 
this number n x has lower rank than all the other numbers in A T X . Re¬ 
moving ti y from iVj, we now have a part N 2 , of {??}, and it may be proved 
as before that there exists a number n 2 , of lower rank than all the other 
numbers in N 2 . Proceeding in this manner, we obtain a set n x , n 2 , n 3 , ... 
of numbers all belonging to iVj, each one of which is of lower rank than 
the next, if this sequence terminates at some number n v , Aq is a finite 
aggregate; if it does not terminate, it is a sequence equivalent to {n}. 
Since every element of N 1 must occur as one of the numbers {n v \, it thus 
follows that N t is equivalent to {^}. It now follows that, either {M} is a 
finite aggregate, in which case its cardinal number is < X 0 , or else {M} 
is equivalent to {n}, in which ease its cardinal number is K 0 , contrary to 
the hypothesis that it is < N 0 . It has thus been shewn that the cardinal 
number of any infinite aggregate is either ; X 0 , or else that it is incom¬ 
parable with fc$ 0 . 

The following more stringent theorem was proved by Cantor: 

The cardinal number Ho is less than any other transfinite cardinal number. 

The proof of this theorem, unlike that of the foregoing one, requires 
the employment of an infinite process of choosing particular elements 
from an aggregate. If is equivalent to the theorem that every non-finite 
aggregate contains a part which is equivalent to {/?>}, the aggregate of finite 
numbers , that is, a part whose cardinal number is X () . 

If M be any non-finite aggregate, the process of choosing a first, 
second, ... wth element from the aggregate can be continued indefinitely 
without limit. Thus M contains a part that is equivalent to {n), and 
therefore (§ 147) its cardinal number is £ X 0 . Thus every cardinal number 
that is not finite, and is different from is > Xq. 

152. It has been shewn that N 0 + 1 = X () : a similar proof would shew 
that Kq f n = K, where n is any finite integer. 

In accordance with the definition of addition, N 0 -f Xq is the cardinal 
number of the aggregate (1, 3, 5, ... 2, 4, 6, ...), for is the cardinal 
number of each of the aggregates (1, 3, 5, ...) (2, 4, 6, ...); hence, since 
the cardinal number of (1, 3, 5, ... 2, 4, 6, ...) is the same as that of {n }, 
we have ^0 + K) “ ^05 which we may write as 

*,.2 = 2*, = *,. 

From this relation, by repeated addition of Kq to both sides of the identity, 

we ^ No .n = n.X 0 -K- 




209 


151-153] Smallest Transfinite Cardinal Number 

In order to express the product Ko.X 0 > we f° rrn the aggregate {(w, n')} 
of which the elements (n, n') consist of every pair of finite cardinal numbers. 
Let n \ n' — ft, then s has the values 2, 3, 4, ; and for any fixed value 

of $ the numbers n, n f have a definite number of sets of values. Let 6’ — 2 ; 
we then have one element (1,1): let s —• 3, we then have two elements 
(1,2), (2,1): for ,v 4, we have (1,3), (2,2), (3,1), and so on. The 
elements of {(n, n')} may thus be arranged in order so that the element 
(n, 7i ) is at the pth place, where 

p - ?i | l (n I it - 1) (n f n! - 2); 

thus the aggregate {( n , n')} is equivalent to {j?}, which has the cardinal 
number X 0 . 

It has now been proved that K 0 .N 0 X 0 , or -K; and from this 

the theorem — N () follows, by repeated multiplication by 

The theorems n . K - Xo, *,* - No express in a symbolical form the 
results which have been proved in § 58, that a finite, or an enumerahly 
infinite, set of enumerable aggregates makes an enumerable aggregate. 

TJ1E EQUIVALENCE THEOREM 

153. The proof referred to in § 147 will now be given, that, if M , N 
are any two aggregates such that M contains a part M l which is equi¬ 
valent to TV, and N contains a part N x equivalent to M , then M » JV. 
This theorem, which may be called the equivalence theorem, was first 
proved by Schroder* and independently by Bernstein*)'; but the form in 
which the proof is here given is due to Zermelof. 

Lemma 1. If a cardinal number a remains unaltered by the addition 
of any one of the enumerable set of cardinal numbers Pi, p 2 , - -pm •••> 
it remains unaltered if all these cardinal numbers p are added to it at once. 

Let M, P 1 , P 2 , ... P n , ... be aggregates of which the cardinal numbers 
arc <z, p x , p 2 , ... p n , . ■.; and such that P lt P 2 , ... P n , ... are all parts of M. 
We have then, 

M - (P l9 M t ) - (P 2 , M 2 ) - ... « (P n , M n ) = ...; 
where M l9 M 2 , ... are all parts of M , and in virtue of the hypothesis made 
in the statement of the theorem, 

M - E x = Jf 2 = ... - = 

for p n -f M = M - p n + M n . 

* See Jahrwbericht d. deutsch. math. Vereinigung, vol. v (1896), p. 81; also Nova Acta Leap, 
vol. lx xi (1898), p. 303. 

| See Borel’s Lemons sur la theorit des fonclions, p. 103. 

% Oottinger Nachrichlen , 1901, p. 34, “Ueber die Addition transfiniter Cardinalzahlen.” For 
remarks upon these proofs see Korselt, Math. Annalen , vol. lxx (1911), p. 294. 
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We may denote the (l, 1) correspondence which can be set up (see 
§ J 45) between M and M n , by M n ■■= 4> n M ; and this for every n. Now it 
is clear that this relation of correspondence is such that 
<f>M (<f>P l9 4>M 1 ) - 4>M t ) 

Hence M = (P t , M x ) y 

M 1 — </>j M — (4>iP 2> 4 * 1 ^ 2 ) ~ ), 

where P 2 ', M 2 are those aggregates which correspond to P 2 , M z , respec¬ 
tively, in the correspondence denoted by <f >\. 

Also, with a similar notation, 

^2 ~ 4*i 4*2 ~ (4*14*2^3’ 4*14*2 2 ) ~ (^3 > ^3 )> 

.. . * .. ? 

M r ~l = ^1^2 4*r-\M — (4>i4*2 ••• 4*t-1^*t'> 4*1 4*2 ••• 4*r~ l^r) ~ (^r > )• 

From these results we deduce 


M^(P l ,P t ',P 9 \...P r ',M r '); 

and no two of the parts JP,, jP 2 ',/y, ... P/, of if, have elements in 
common. This process of division of AT can be continued indefinitely; 


and we then have 


m ^(iyp^/y,... mj), 


where P r ', for every r, is included, and iW w ' consists of those elements 
which belong to M r ' for every value of r. From this we see that 
a = p ± + p 2 -f p 3 -f ... +■ 
where a' is the cardinal number of MJ. 


Let us now consider the special case of the lemma which arises when 
Pi, p 2 , ... are all equal, say to p. In this case, we see that, from the 
hypothesis a — p + a, the result a = Kp -f a follows, where denotes 
the cardinal number of the series of finite integers. 

Now since Nq = 2Ko, we have i ^p + «' = 2XoP -f a = K 0 p + a; it has 
thus been shewn that, if a — a -f p, then a —■ a -f «o7>- 
Returning to the general ease, we have 

a — a + K0P1 = a + No lh = •••; 
it now follows that a = Kq/), f + ... -) a", 

where a” is the value which a' takes when X,,/),, H 0 p 2 , ... are substituted 
for Pi,Pi, ... 

Hence we have 

a = 2Ko (Pi + Pi 4- ...) + a" = a + No (ft + p 2 + ...) 

= (No + 1) (Pi +Pi+ ■■■) + a" = a + ft + ft + ...: 

and therefore the lemma has been established. 


Lemma II. If the sum of two cardinal numbers p and q , when added 
to a, leaves a unaltered, then a is unaltered by the addition of either p or q . 





153,154] Equivalence Theorem 211 

For if a = a -f p + q, 

we have seen that a — a + N 0 (p -f- q ); 

hence a = a -f (Hi + 1) 7? + 

and also a — a -f (H) + 1) i 

from these equalities we have 

a — a -f p, and a ~ a + q. 

We are now in a position to prove the equivalence theorem. If 
a = P + p, and ft = a + q, 
we have a = a p -f q, 

and hence, by Lemma II, 

a = a + p a + q = ft; 

therefore, if M has a part equivalent to N, and N has a part equivalent 
to M, it follows that M = a \~ p = a f q = N\ where a is the cardinal 
number of the part of M that is equivalent to N, and is that of the 
part of N that is equivalent to M. 

In case M is itself equivalent to N, which will in particular happen if 
M is a part of N, we have p = 0, and the theorem holds good. 

In case the condition a = + p holds, but there is no corresponding 

condition ft = a + q, we have in accordance with the definition in § 147, 
a > p. It follows that the sum of two or more cardinal numbers is greater 
than , or equal to , any one of the cardinal numbers. 

The following theorem may be established: 

If the cardinal number a is unaltered by the addition of p, and if /3 ^ a, 
the cardinal number p is unaltered by the addition of q , where q £ p. 

For let ft~a + y, p = q + r; then, from a = a+ p = a-fgq-r, we 
deduce that a = a -f q. It then follows that 

P = a + y = a + q + y = a + r + y^P + q = P + r. 

154. A proof has been given by Cantor* that, if an aggregate exists of 
which the cardinal number is a , then an aggregate exists of which the cardinal 
number is greater than a. 

The proof is a generalization of the second proof, given in § 60, that 
the cardinal number c, of the continuum, is greater than that of the rational 
numbers. The proof may be put into the following form: 

Suppose M = {m} to be an aggregate, of cardinal number a; this 
aggregate M may be supposed to be simply ordered (see § 157) in any 
manner. In M let each element m be replaced either by A or by B, where 
Ay B are two given objects; then M is replaced by a similar aggregate 
(see § 157), in which each element is either A or B. An infinity of such 

♦ See JahresbericM d. deutsch. maih. Vereinigung , 1897. 
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aggregates will be obtained, differing from one another in respect of 
whether A or B has been put in the place of each element of M ; denoting 
the aggregate of all such possible aggregates M A1U by {M AIt \, it will be 
shewn that the cardinal number of {M AIi \ is greater than that of M. In 
the first place, it can be seen that the cardinal number of {M A h\ is equal 
to, or greater than, that of {in} ; for, taking any one- element w 0 , of {>//}, 
replace it by A, and all the other elements by B ; we have then an element 
of {AI ab}* and there is such an element- corresponding to each element w () , 
of {m }; thus those elements of { Mah} in which tliere is only one A, form 
an aggregate of cardinal number equal to that of [///). Next, let us assume 
that, if possible, all the elements of \Mab) are placed in (1,1) corre¬ 
spondence with those of {///}: it will then be shewn that an M A b can 
always be defined which is not included in the correspondence. Each 
M°ab, in {Ma /;}, now corresponds to a definite m {)% in {///.}; form a new 
aggregate M'ah in the following manner:—For each element M { *ab< in 
{MaiA, in which A takes the place of m 0 , in {rn}, substitute B; and for 
each element in {M AB ], in which B takes the place of m 0 , in {m} s 

substitute A; in this manner we form an aggregate M' A b in which each 
element is either A or />, which is similar to {w}, and which is not identical 
with any M A u that occurs in the correspondence between { M A //} and {///]. 
It- has thus been shewn that the cardinal number of the aggregate of all 
t he M AB is greater than that of M . If M is the aggregate a l , a 2 , u, ... a n ,... 
which is similar to the aggregate of integral numbers, and if, for A and B , 
we take 0 and 1 respectively, then the aggregate {M Ql } may be interpreted 
as the aggregate of all the rational and irrational numbers between 0 and 1, 
in the binary scale; and this aggregate is thus shewn to be unenunierable. 

Instead of replacing the elements of {m} by two letters A, B, we might 
have taken any finite number of letters, without altering the principle of 
the proof. In § 60, the ten digits 0, 1, ... 9 were taken instead of A and B. 
It will be observed that, even if {m} is normally ordered (see § 16. r >), the 
new aggregate {M} is not given as a normally ordered aggregate; and in 
default of proof it cannot be assumed that it is capable of being arranged 
in normal order. 

To replace all the elements of an aggregate either by A, or by B , is 
equivalent to taking a part* of the given aggregate. The theorem has 
thus been established that, the cardinal number of the aggregate , each 
element of which is a part of a given aggregate , is greater than the cardinal 
number of the given aggregate , all possible parts being contained in the new 
aggregate. 

155. An important question relating to cardinal numbers in general 
arises as regards the sum, a 4- jS, of two transfinite cardinal numbers. 

* See Borel, Lemons sur la thdorie des fonclions , p. 108. 
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If a -f ft y, it may be the ease that y is greater than both the 
numbers a, ft, or that it is equal to one of them; it being assumed that 
a, ft are comparable cardinal numbers. Whether the first case can arise, 
or not, has not been definitely settled*, even in the case in which a ft; 
thus it is not known whether 2a is necessarily equal to a, as is the ease 
when a - c, or when a — X () . 

In case a 2a — a | a, it then follows, by applying Lemma I of 
§ 153, that a --- a f X ( /r (X 0 -f 1 ) a; and thus that a *** X 0 rc. 

The special case in which a is the cardinal number of a set of points 
in space of any number of dimensions has been referred to in § 90, where 
the hypot hesis has been alluded to that, if x be the cardinal number of such 
set, then ax =■- x . That this holds good, involves the assumption that 
2 x — x; which is equivalent to the assertion that, if there exists a set, of 
cardinal number x , in each of two non-overlapping cells, the cardinal 
number of the combined set is also x. 

DIVISION OF CARDINAL NUMBERS BY FINITE NUMBERS 

156. If two aggregates have the same cardinal number, and if each 
of the two Aggregates be divided into the same finite number n, of parts, 
such that the n parts of the first aggregate all have the same cardinal 
number, and also the n parts of the second all have the same cardinal 
number, then it can be proved that the cardinal number of one of the 
parts of the first aggregate is the same as that of one of the parts of the 
second aggregate. Symbolically, the theorem may be stated in the form :— 
if a, ft are cardinal numbers such that na — lift, then a — ft. 

This theorem has been proved by Bern stein f. It will be sufficient to 
give the detailed proof in t he case n 2, as the proof in the general case 
is obtained by generalization of that employed in the particular case. 

Since an aggregate is equivalent to itself, any special mode of exhibiting 
such equivalence, by which each element is made to correspond to a 
definite other element, may be called a transformation of the system into 
itself. As regards all such possible transformations the following pro¬ 
positions may be seen to hold: 

(1) . The transformations of ail aggregate M into itself form a group <f > M . 

(2) . Let l,yi,y 2 ,y 3 , ... denote a sequence of transformations of M 
into itself, 1 denoting the identical transformation, and let this sequence 
form a group which is necessarily a sub-group of 4>m ; then the condition 
that the sequence forms a group is that, corresponding to any two integers 

* See Schoenflies, Die Entwickelung ..., vol. n, p. 9; also Jourdain, Phil. Mag. (6), vol. vi 
(1903), p. 323. 

f Inaugural Dissertation, “ Untersuchungen aus dor Mengenlehre,” Halle, 1901. This is 
reproduced in Math. Annalen , vol. lxi (190«5), p. 117. 
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m, n f there is a third r, such that XmXn ~ Xr • Further, let us suppose that, 
to every y n there corresponds a definite such that XnXn ^ 1* If wi be 
an element of M , such that m ± (m), for w = 1, 2, 3, ..., then 

Xn (™) + Xn' M’ 

where n and n’ are any unequal integers. 

(3) . If m and m ' are any two distinct elements of M , and if m ^ (m'), 

for n — 1,2, 3, ..., then (m) ^ y n , (ra'). For if y n (m) - y n , (m'}, we 
should deduce that m ■= (ra) =- y'y n , (m') = (ra'), which is contrary 

to the hypothesis made. 

(4) . If Tj, T 2 ,... are parts of M, such that each T has no element in 
common with another T , we may say that the 7 7 \s form a system of 
separate parts of M. 

If T — {£} is a part of iff, and if t ± y n (£')> for n — 1, 2, 3, ..., then the 
equivalent aggregates T, (T 7 ), y 2 (7 T ), ... form a system of separate parts 
ofJf. 

(5) . If T is a part of M which satisfies the condition stated in (4), 
then M — M f- T. 

For T,Xi(T), x 2 (T 7 ),... are all parts of M having the cardinal number 7 7 ; 

and if R is the part of M which remains when all these separate parts are 

removed, we have r= == 

M = R + Kq.7 7 ; 

hence M + 7 = I + (K + 1) 5* 

= S + Ko.f - Jr. 

After these remarks we can proceed to the proof of the theorem:—Let 
{&)> -M — JTj *f- oc 2 ~ or 3 -■}- ,7\j, 

( 6 ) , = 

( c )> ^3 = ^ 4 ; 

then it is required to shew that = f 3 ; which involves x 2 = f 4 . 


The three equations (a), (6), (c) may be regarded as denoting that there 
are three reversible transformations of the aggregate M into itself, which 
may be denoted by (f> a , <f> b , <f> c respectively; the reversibility of these trans¬ 
formations is expressed by <f> a 2 = <f > b 2 — <^ c 2 = 1. 

The transformation <f> a involves Xj = (x n , x 14 ), where x 13 are those 
elements of x 1 which are transformed into elements of x 3 , and x 14 those 
which are transformed into elements of x 4 ; on the whole we have 


( 6 ). 


X 1 ~ (^13 > ^14 )> 

x 2 — (a?23 > ^ 24 ) > 

#3 — (# 31 , x 32 ), 

*4 = (*41>*4 2 )> 


where x ik = x ki . 


If 7\ is any part of x x , and T 2 an equivalent part of x 2i we may denote 


by x x *, x 2 *, the aggregates obtained by interchanging those elements of x t 
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which belong to 1\ with those of x 2 which belong to T 2 ; we have then a 
similar set of equations to (6) for the new starred aggregates, and Ij* = , 

I 2 * — f 2 , J 3 * = I 3 , i 4 * ~ # 4 . Thus if the theorem be proved for the 
starred aggregates, it holds for the original ones. 

We have to shew that, after suitable transformations, a system of 
division of the aggregates into parts, of the form in (6), can be found, 
such that I 13 -f x l4 — x Uy and ir 31 -f- x 32 — x^. For, from these equations, 
we deduce x x = x 2 = t 14 , x 3 — x 4 = x^, and then the aggregates x 2 , x 4 are 
such that each has a part which is equivalent to the other; and conse¬ 
quently, in accordance with the equivalence theorem, x 2 , x 4 are equivalent 
to one another; or x 2 = x 4 . It has in fact to be shewn that x 13 can be so 
chosen that it is negligible with respect to cardinal number, in comparison 
both with x 14 and with . 

We form the systems of transformations 

06 X 2 > 06 0c = X 4 > 06 0c 06 = Xe > • • • > 

0C •= X3 5 0c 06 = X5> 0C060C X?> 

each transformation y in this system has one inverse, given by the scheme 
X 4 «X 4 n+i = 1, X 4 n^ 2 X 4« + 2 X 4 n-f 3 X 4 n +3 = 1 ; thus the transformations y form 

a group of reversible transformations of M (= (x l9 x 2 )) into itself. 

An element of x 13 , is either, (i) transformed into an element of x M 
by a transformation y, with finite index, or else, (ii) e 13 is not transformed 
into an element of x M by any of the transformations y. Suppose then that, 
for every element ej 3 , of x 13 , the second of these cases arises, then y 2r , y 2r41 
transform the elements of x 13 into aggregates which arc respectively in 
and x 14 , and in them these aggregates form an enumerable system of 
separate parts of each. For, in the case contemplated, y 2 transforms x 13 
into a part of x ^; by y 4 , the elements of x 13 become elements of x^ or x M , 
consequently, in accordance with (ii), y 4 (# 13 ) is a part of x^. In this 
manner it is seen that x 13 is transformed, by every y 2n , into a part of x^, 
and by every y 2n41 , into a part of # 14 . It then follows, by proposition (5), 
that x 13 + x 14 = x 14 , x 13 f x 23 — x 23) and the theorem is thus completely 
established. The remainder of the proof consists in shewing that, by an 
exchange of elements of x 13 with elements of %, it is possible to arrange 
so that the case just considered always arises. 

Suppose x 13 are those elements of x 13 which are transformed by y 2 into 
elements of x u ; let x 13 " denote those elements, different from x 13 , which 
are transformed, by y 3 , into elements of x 24 which were not affected by y 2 , 
and so on; we thus obtain the scheme 


X 13 

X 2 (^lS ) 

in 

X 13 0 X 13 

X2 (^13 ) 0 X3 (^13 ) 

in x Ut 

^13 0 ^13 0 X 13 

X2 (^13 ) 0 X3 ( X 13 ') 0 X4 ( X 13 ' ) 

in x u , 

*XS + ••• + *^13^ W> 

X 2 (^13 ) 0 Xs ( x is ) 0 ••• 0 X«+i 

in # 24 . 
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We take now the equivalent sums 

[x 13 ] = £ .V n) , and [x M ] - £ x«u (*i 3 (n> ), 

n 1 71 ‘ 1 

and we carry out an exchange of [a? 13 ] with | ;r <24 ]; we then have 

t r(^ 3 )J, *^24 “ L^lJ t [(^ 24 )]* 

When the exchange has been made, of the elements of [x 13 ] with those 
of [^ 34 ], we denote the new aggregates by starring the original ones; we 
have then, in accordance with the formulae ((>), expressions for #,*, x 2 * y 
x :i *, :r 4 *, and we can, as has been shewn above, attend to these, instead 
of to the original x l ,x 2 , :r 3 , r 4 . Now no element of x 13 * is transformed 
into an element of .r 24 * by any of the transformations y, it being understood 
that the transformations y are not to affect the substituted elements; 
and thus, by the reasoning which has been given above, for the case in 
which no element of x 13 is transformed into an element of the theorem 
is established. Bernstein has also proved that, if 2a — a j - p, where a, fi 
are cardinal numbers, then a ^ ft. 

T1IE ORDER-TYPE OK SIMPLY ORDERED AGGREGATES 

157. A n aggregate M is said to be a simply ordered aggregate when each 
element m has a definite rank relatively to the other elements of M , so that , of 
any two elements w, m' whatever , it is known which has the higher and which 
has the lower rank. 

If m has a lower rank than m\ the fact is denoted symbolically by 
m < m'; and if a higher rank, by m > m r . 

If an aggregate is given, at first unordered, it may be possible to order 
the aggregate in a variety of essentially distinct ways. If the aggregate 
is finite, the ordering of it may be accomplished by arbitrarily assigning 
to each element its rank relatively to the others. In case the aggregate is 
an infinite one, the ordering of it consists in the setting up of some general 
rule which suffices logically to assign the relative order of any two elements. 

Besides simply ordered aggregates there exist also doubly, or trebly, 
ordered aggregates, or also aggregates with higher degrees of multiplicityf 
of order. Each element of such an aggregate possesses two, three, or more 
distinct characteristics of an ordinal character. Only simply ordered 
aggregates will be considered here. 

Two simply ordered aggregates M, N are said to be similar when a (1, 1) 
correspondence can be established , in accordance with some law , such that , 
to any two definite elements m, m\ of M, there correspond two definite ele¬ 
ments n, n\ of N, in such a manner that the relative order of m, ra', in M, 
is the same as that of the corresponding elements n, n , in N. 

t Multiple order-types have been considered by E. Riesz, Math. Annalen, vol. lxi (1905), 
p. 40b. 
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This relation of similarity may be represented symbolically by M — N. 

Every simply ordered aggregate is similar to itself. 

Two simply ordered aggregates which are similar to a third are similar 
to one another. 

All simply ordered aggregates which are similar to one another are said 
to have the same order-type. 

An order-type is accordingly characteristic of a class of similar aggre¬ 
gates. 

The order-type of a simply ordered aggregate M is defined by Cantor 
as the concept which is obtained by abstraction when the nature of the 
elements of M is disregarded, their order being alone retained. The order- 
type of M is then denoted by M. This definition will be further discussed 
in § 191. That similar aggregates have the same order-type is regarded 
by Cantor as a deduction from this definition. 

if, in Af, we further disregard the order of the elements, we obtain M , 
the cardinal number of M . 

The order-type of M is, from Cantor’s point of view, regarded as a 
simply ordered aggregate similar to M , such that each element is the 
number 1 . If any order-type be denoted by a , the corresponding cardinal 
number is denoted by a. 

Corresponding to any given transfinite cardinal number, there is a 
multiplicity of order-types which form a class; each such class of order- 
types is characterized by the common cardinal number of all the order- 
types of the class. 

The order-types which belong to the class corresponding to a cardinal 
number a form an aggregate which has a cardinal number a. It will 
appear that a is always greater than a. 

If the order of every pair of elements in a'simply ordered aggregate M 
be reversed, the aggregate in the new order is denoted by *M. 

If M = a , then the order-type *M is denoted by *«. 

The order-type of the aggregate of all the finite integers, in their 
natural order (1, 2, 3, ...), is denoted by at. This is therefore the order-type 
of every aggregate (a ls a 2 , ... a n , ...) which is similar to (1, 2, 3, ...). 

The aggregate (... a n ... a 2 , a x ) has the order-type *o>. 

TIIE ADDITION AND MULTIPLICATION OF ORDER-TYPES 

158. If M , N denote two simply ordered aggregates, and if the aggre¬ 
gate (A/, N) be formed, in which all the elements of both M and N occur, 
and which is such that any two elements of M have the same relative 
order as in M, and that any two elements of N have the same relative 
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order as in N, and further that each element of M has a lower rank than 
all the elements of N, then the new simply ordered aggregate (M, N) is 
said to be the sum of the two simply ordered aggregates M and N. it is 
clear that if M - M\ N - N', then ( M , N) - (M\ N'), and thus that 
the order-type of (M, N) depends only on the order-types of M and N. 

If M - a, N ----- ft, the sum a f ft is defined to be the order type of the 
sum ( M , N) of the two simply ordered aggregates , as defined above. 

This defines the operation of addition of order-types. It will be seen 
that the addition of order-types does not obey the commutative law. For 
if a = M, ft := A\ then a -f ft ----- (M, N): but ft 4- a -= (N, M); and the two 
order-types (M , A T ), (N, M) are in general different from one another. 

If n denotes a finite integer, w 4- n is the order-type of the ordered 
aggregate , e z , e 3 , ... f\, f 2 , .../ n ), whereas n }~ to is the order-type of 
(/i,/ 2 , .../„, e,, e 2 , e 3 , ...). it is clear that the first of these aggregates is 
not similar to {g 1 , g 2 , g 3 , ...), but if we make / 2 ,/ 2 , .../„ correspond to 
9 i, 92 , 9 i, ••• 9n, then e, to g„ H , e 2 to ;/ n+2 , ... and in general e m to g n+m , 
it is seen that the second of the above order-types is similar to (g 1 , g 2 , g 3 ,...). 
It thus appears that n | to — to, but to + ni fi to. 

159. In the simply ordered aggregate N, let us suppose that, in the 
place of each element, there is substituted a simply ordered aggregate 
similar to M, whereby a new simply ordered aggregate is formed; this 
may be denoted by M.N. It is clear that if M — M', N — N', then 
M.N — M' .N'; thus the order-type of M.N depends only on the order- 
types oiM and N. 

If a ~ M, ft — N, the product a. ft is defined to be M .A 7 , the order-type 
of M.N, as just defined. 

It will be seen that the product a.ft is in general different from ft.a, 
and thus that the multiplication of order-types does not obey the com¬ 
mutative law. For example, to. 2 is the order-type of the aggregate formed 
by substituting in (a 1 , a 2 ) for each of the two elements an aggregate of 
type to; to. 2 is therefore the order-type of (b lt b 2 , 6 3 , ... c lf Cg, Cg, ...), in 
which there is no last element, and no element immediately preceding c x . 
On the other hand, 2. to is the order-type obtained by substituting for 
each element in (a,, a 2 , a 3 , ...)> an aggregate consisting of two elements; 
and 2. to is thus the order-type of the enumerable aggregate 

(an, tt 12 > a 21 , &22 , a 31 f a 32 > •••)» 

which is similar to (b ly b 2 , b 3 , ...), as may be seen by making a nl correspond 
to 6^.! and a n2 to b 2n . It has thus been shewn that 2. co = co, but co . 2 ^ co. 
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THE STRUCTURE OF SIMPLY ORDERED AGGREGATES 

160 . An examination of the structure of a simply ordered aggregate M 
can, in general, only be attempted by considering the nature of those 
aggregates which are its parts, and in each of which parts the order of 
the elements is the same as that of the same elements in the whole aggre¬ 
gate. The simplest transfinite part of an ordered aggregate is that which 
has one of the types co, *o>. Such parts we speak of as ascending sequences , 
and descending sequencer, respectively, contained in M. 

Two ascending sequences {a n }, {a n '} 9 contained in M , are said to be 
related to one another , provided that, corresponding to any element a ny of 
the first, there are elements a' n *, of the second, such that a n < a ' n ,; and 
provided also that, corresponding to any element a n ', of the second, there 
are elements a n . y of the first sequence, such that a n ' < a n >. 

Two descending sequences {b n }, {£>„'}, contained in M , are said to be 
related to one another, provided that, corresponding to any element b n 
of the first sequence, there are elements b f n > 9 of the second, such that 
b n > />V; and provided also that, corresponding to any element b n ' y of 
the second, there are elements b n , y of the first sequence, such that b„' > b n >. 

An ascending sequence {a„} and a descending sequence {b n }, contained 
in M , are said to be related to one another, if a n < b n ,, for every n and n '; 
and further, provided there exists in M no element, or only one element m, 
which is such that a n < rn < b „, for every n. 

Two sequences contained in an ordered aggregate, which are both 
related to a third sequence, are related to one another. 

Two sequences in an ordered aggregate, which are both ascending, or 
both descending, and of which one is a part of the other, are related to 
one another. 

161 . Suppose that, in an ordered aggregate M, there is an element m 0 
which satisfies the following conditions, with respect to an ascending se¬ 
quence contained in M : 

(1) , for every n, a n < ra 0 ; 

(2) , for every element w, of M, which is < m 0 , there exists a number n 
such that a n , a njl) a n42 , ••• are all > m; then the element m 0 is said to be 
the limiting element , or limit of {a n } in M ; and m 0 is said to be a principal 
element of M . 

Similarly, if we suppose that, in M y there is an element ra 0 , which 
satisfies with reference to a descending sequence {a n }, contained in M , the 
following conditions: 

(1) , for every n, a n > m Q ; 

(2) , for every element m, of M , which is > , there exists a number n 
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such that a n9 a nil , a 71 f2 , ... are all < m ; then the element m 0 is said to be 
the limiting element , or /rwi/ o/ {aj in M ; and m 0 is said to be a principal 
element of 71/. 

A sequence contained in M can never have more than one limiting 
element in M. 

If a sequence in M has a limiting element m {) , in M, then m 0 is the 
limiting element of every sequence in M which is related to the first one. 

Two sequences which have the same limiting element, in M , must be 
related to one another. 

It is clear that, if 71/, M' are similar ordered aggregates, an ascending, 
or a descending, sequence in M corresponds to a sequence of the same 
kind in M'. To every principal element in M there corresponds a principal 
element in M r . 

An ordered aggregate which- is such that every element is a principal 
demerit is said to be dense in itself. 

If , in an ordered aggregate , every sequence which is contained therein has 
a limiting element in the aggregate , then the ordered aggregate is said to be a 
closed aggregate. 

An ordered aggregate which is dense in itself , and also closed, is said to 
be perfect. 

An ordered aggregate which is such that ., between any two whatever of its 
elements, there are other elements of the aggregate , is said to be everywhere 
dense. 

The properties of an ordered aggregate, thus defined, are also properties 
of any similar aggregate; hence the terms may be applied to the order- 
types which are symbolized by replacing the elements of the ordered 
aggregate's by I ; there can exist therefore an order-type which is dense in 
itself, or closed, or perfect, or everywhere dense. 

The terms which have been here employed for the purpose of describing 
certain peculiarities which may exist in an ordered aggregate, or in the 
corresponding order-type, are identical with those which we have em¬ 
ployed in analogous senses in Chapter n, in the case of sets of points, or 
numbers. There is however a distinction which must be noticed between 
the use of the terms in the two cases. To illustrate this distinction, let 
(P 1 , P 2 , P 3 , ... P n , ...) be a sequence of points on a straight line, which 
sequence has a limiting point P w , on the right of the points P v ; then if Q 
be any point of the straight line on the right of P w , the two ordered aggre¬ 
gates (P lf P 2 , P 3 , ... P n , ... P w ), and (P 3 , P 8 , P 3 , ... P w , ... Q ), are similar, 
and have the same order-type a» + 1. In the first of these aggregates, 
P w is the limiting element of the sequence (P 3 , P 2 , ... P n , ...); and, in the 
second aggregate, Q is the limiting element of the same sequence; and 
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therefore both the ordered aggregates are closed, in the sense explained 
above. The first of these aggregates forms a closed set of points, in the 
sense of the term delined in Chapter 11 ; but llie second does not, since Q 
is not a limiting point of the set of points {P n }. The distinction rests upon 
the different use of the terms limiting element and limiting point, in the 
two eases of an ordered aggregate of elements in general, and that of a 
set of points in the continuum. The question whether an element is a 
limiting element of an aggregate to which it be ongs, or not, in the sense 
defined above, is answered by examining the structure of the ordered 
aggregate itself. In the case of a set of points in the continuum, a par¬ 
ticular point may be a limiting element of the aggregate of points con¬ 
sidered merely as an aggregate of elements with a particular order-type: 
but the question as to whether the same point is a limiting point of tlie 
set of points, considered as chosen out of the continuum, can only be 
answered after an examination of the ordinal relation of the point to other 
points of the continuum which do not belong to the set; in fact, the set 
must be regarded, for this purpose, as an aggregate which is only a part of 
another aggregate, the continuum. It is now clear that a set of points, 
considered solely as an ordered aggregate of elements, without reference 
to the fact that it is essentially a part of the continuum, may be closed, 
or perfect; and yet that the same set of points need be neither closed nor 
perfect, in the sense of the terms employed in the theory of sets of points, 
which has been dealt with in Chapter ti. 

THIS ORDER-TYPES 77 , 0, tt 

162. Cert i*in order-types which are of special importance will be now 
examined. 

The first of these is the order-type* ry, of the set A\ of rational numbers 
between 0 and 1 (both exclusive), in their order, as defined in Chapter X. 

It will be shewn that the order-type 77 is exhaustively characterized 
by the following properties: 

(1) . fj - K„ = a. 

( 2 ) . There is in 77 no lowest, and no highest, element. 

(3) . 77 is everywhere dense. 

In fact, every simply ordered aggregate M , which has these three 
characteristics, is similar to the aggregate R. 

To prove this, we first observe that, on account of the condition ( 1 ), 
the order of the elements in both M and R can be so altered that each of 
them is reduced to the order-type to. Let this he done; and denote by 
M 0 , R {) the new ordered aggregates 

M 0 - (in Xl m 2 , /%, ...), 

-fto = (»1 ,r 2 ,r 3 , ...). 
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We have to shew that M—R\ and to do this we have to shew how to 
establish the requisite correspondence between the elements m, of M , 
and r, of R. Let be made to correspond to r,; then there are an in¬ 
definitely great number of elements of M, which have the same relation, 
as regards order, to m 1 , as r 2 has, in R , relatively to r 1 ; of all these elements 
choose that one which has the smallest index as it appears in M 0 ; 
and let m et be made to correspond to r 2 . Of all the elements of M , which 
are related to m 1 and w <2 , in the same manner, as regards order in M , 
as r 3 is related to r x and r 2 , as regards order in R, choose that one w fJ , 
which has the smallest index as it appears in M 0 ; and make m t3 correspond 
to r 3 . 

Proceeding in this manner, we make the elements r x , r 2 , r 3 , ... r n , of R y 
correspond to the elements m 1% ra <2 , m t3 , ... m« n , of M\ and so far as these 
elements are concerned the relations of rank are preserved in the corre¬ 
spondence: we proceed then to choose, in the same manner as before, the 
element w, R+1 , which is to be made to correspond to r w+1 , and thus we 
obtain, for every r ny the corresponding m , n . It must however be shewn 
that this process exhausts all the elements m, of M , that is to say, that 
in the sequence 1, e 2 , e 3 , every integral number p occurs in some 

definite place. This can be proved by the method of induction. Let us 
assume that the elements m 1 , m 2 , m 3 , ... m n all occur in the correspondence 
that has been set up between the whole of R and at least a part of M\ 
then we shall prove that m n+1 also occurs. Upon this assumption, let A 
be so great that, among the elements m 1 , m ei , m et) ... m, A , all the elements 
Wj, ra 2 , m 3 , ... m n occur. Then, if m n+1 is not also among those elements, 
choose out of r A+1 , r A+2 , r A+3 ,... that element r A+s , with the smallest index, 
which has the same relation to r 2i ... r A , as regards order in R , that 
m n+1 has relatively to %, , m tt , ... m, A , as regards order in M. Then 

the element m n+1 has the same relation to m ly m €l ,-m €| , ... m €K+t _ xi as 
regards order in M y as r A+s has to r ly r 2i ... r A+ ,_j, as regards order in R. 
It thus appears that m n+l is the element with the smallest index as it 
appears in M 0y which has, in M y the same relation as regards order to 
mj, m tl , ... that r A+s has, relatively to r ly r 2 , ... in R; hence 

~ m n+1 ; that is, the element m n+1 occurs in the correspondence which 
has been established between M and R. It has now been shewn that M 
and R are similarly ordered aggregates. 

Examples of the order-type rj are the following : 

(1) . The aggregate of all negative and positive rational numbers, 
including zero, in their natural order. 

(2) . The aggregate of all rational numbers which are greater than a, 
and less than 6, where a, b are two real numbers such that a < b. 
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(3) . The aggregate of all real algebraical numbers in their natural order 
in the continuum, or of all such of these numbers as lie between two real 
numbers a, b. 

(4) . The aggregate of a set of non-abutting linear intervals which are 
such that their end-points and the limiting points of these end-points 
form a non-dense perfect set of points in a linear interval. 

The rational numbers of the interval (0, 1), including 0 and 1, form an 
aggregate of the order-type 1 +ry-{ 1. 

163 . We now proceed to the consideration of the order-type 0, of 
points forming a linear closed continuum. 

It will be shewn that any simply ordered aggregate M is similar to the 
aggregate A of all real numbers of the continuum (0, 1), in their natural 
order, provided (1), M is perfect, and (2), in M, an aggregate 8 , with the 
cardinal number is contained, which is so related to M, that, between 
any two elements m 0 , m ly of M, there are elements of 8 . 

If 8 has a lowest and a highest element, these can be removed without 
affecting its relation to M ; and thus we may suppose 8 to be of the type rj, 
of the aggregate R of rational numbers which lie between 0 and 1, both 
exclusive, in their natural order. 

Since 8 — R, we may suppose the elements of 8 to be made to corre¬ 
spond in order to the elements of R\ and it will be shewn that this 
correspondence enables us to establish a correspondence between the 
elements of M and X. 

We suppose that each element of M , which belongs to 8 , corresponds 
to that element of X which belongs to R, just as in the correspondence 
of 8 with R already established. Any element rn, of M, which does not 
belong to 8 , is the limiting element of a sequence {m n } of elements of 8 . 
To this sequence {m n }, there corresponds a sequence {r n } in A, all the 
elements of which belong to R; and this sequence { r n } has a limiting 
element x , in A, not belonging to R ; we take therefore m, in M, to corre¬ 
spond to x, in A. If we take a different sequence {x n '}, which has the 
same limiting element m as before, in M , then there corresponds to it a 
sequence {r n '} in R , which has the same limiting element x as before, in A. 
It will now be shewn that, in the correspondence so established between 
the elements of M and of A, the relative order of two elements of M is 
the same as that of the corresponding elements of A. This clearly holds 
of any two elements of M which are also elements of 8. Consider next 
two elements m and s, of AT, the first of which does not, and the second 
of which does, belong to 8; and let x 1? r be the corresponding elements 
of A. If r < x x , there exists an ascending sequence in R , of which x Y is 
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the limiting element, such that all its elements are > r; then, to this 
sequence there corresponds an ascending sequence in 8 , all the elements 
of which are > s, and of which m is the limiting element; hence s < m. 
If r > x lf it can be shewn, in a similar manner, that s > m. The proof 
that, corresponding to any two elements m l9 ?n 2 , of M, which do not belong 
to 8 , the elements x l9 x 2 , of X, are such that w.j ^ , according as $ x 2> 

is of a precisely similar character to that just given. It has thus been 
shewn that M and X are similar aggregates, and that the type 0 is charac¬ 
terized by the conditions (1) and (2). 

The above characterizationt of the type 0 contains Cantor’s ordinal 
theory of the constitution of the linear closed continuum. 

A non-dense perfect set of points in a linear interval lias not the 
order-type 6, but the set of complementary intervals together with the 
limiting points of their end-points does form an aggregate of order-type ft, 
when the elements, consisting partly of points, and partly of intervals, are 
taken in the order in which they occur in the continuum. 

164. The order-type *o> + w may be denoted by r r, and is the order- 
type of the negative and positive integeis in their natural order. This 
order-type has properties distinct from those of to. For example, n -f oj 
has been shewn to be identical with a>, where n is a finite integer; but 
n -f 7 T is not identical with tt. From either of the equations n 4- n — m 4- 77, 
or 77 -f n = 7 T 4 - m, there follows m = n , or more generallyJ : 

If n, n' are finite integers, £ and £' other order-types, from the equation 
n 4~ tt 4- £ = n' 4- n 4- £', there follows n — n\ £ — £'. 

To prove this theorem, we observe that, if the two aggregates be 
placed into similar correspondence, the lowest elements correspond to 
one another, then the second, and so on; hence n — n' is proved at once: 
and we now have tt + £ ~ rr 4- £'. 

When two simply ordered aggregates M n 4 - Z, N n 4 - Z', of order-types 
77 4 - £, 77 4 - £', are placed in correspondence in order, either M n corresponds 
to N wt or M n corresponds to a part of N„, or else N n corresponds to a part 
of M n . In the last two cases the order-type 77 must be split up into 
tt = ^ 4- 77 2 , where 77 == tt 1 , and n 2 is some other order-type; but from the 
definition 77 = *co 4 - <0, it is clear that every mode of dividing tt into two 
parts, without altering the relative order of the elements, leaves it in the 
form *a> 4 ~ co ; hence it is impossible that 77 — tt x 4 - n 2 , and 77 = 77 x ; and 
therefore M n corresponds to N n . Hence also Z corresponds to Z f ; or 

c = «'• 

f See Russell, Principles of Mathematics , vol, 1 (1903), p. 303, also Veblen, Trans. Amer. Math . 
Soc . vol. vi (1905), p. 165 and Huntington, Annals of Math. (2), vols. vi (1904), p. 151 and vu 
(1905), p. 15. 

J Bernstein, loc. cit. p. 9. 
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NOHM ALLY ORDERED AGGREGATES 

165. The order-type of a simply ordered aggregate is, as we have 
already seen, such that the structure of the aggregate, as revealed by an 
examination of the sequences contained in it, may be of the most varied 
character; the various sequences may be ascending or descending ones., 
and may, or may not, have a limiting element within the aggregate. 

Of all the possible order-types, those are of especial importance which 
have been defined by Cantor as the order-types of normally ordered 
aggregates (wohlgeordnete Mengen). 

A normally ordered aggregate M is one which satisfies the following 
conditions : 

(1) . M has an element m, of lower rank than all the other elements. 

(2) . // M 1 is any part of M, and if M contains one or more elements 
which are of higher rank than all the elements of 71/-,, then there exists one 
element m', of M, which immediately follows the part-aggregate M 1 , so that 
there are no elements of M which are intermediate in rank between m' and all 
the elements of M x . 

The special case of (2) which arises when M x consists of one element , 
shews that a normally ordered aggregate is such that each element has 
one which immediately follows it, unless the element is the highest element 
of M. It is however not necessarily the case that M has a highest element. 

If e x , e 2 , e 3 , ... e n , ... be an ascending sequence of elements contained 
in M, and such that elements exist, in M , which are of higher rank than 
every e n , then there exists an element e\ of M, which is higher than all 
the e n , and such that every element e", of M, which is lower than e' is 
lower than e n , e n+1 , e n+2 , ..., for some definite value of n. 

Every part of a normally ordered aggregate has a lowest element. 

Let M x be a part of M ; if M , contains m x , the lowest element of M , 
then m x is the lowest element of M x . If M x does not contain m,, consider 
that part of M which contains all those elements every one of which is 
of lower rank than all the elements of M x ; this part of M must have an 
element which immediately follows it; and this element belongs to M x , 
and is its lowest element. 

If a simply ordered aggregate M itself , and also every part of M, has a 
lowest element , then M is normally ordered. 

The condition (1) is fulfilled. Let M x be a part of M such that M 
contains elements which are higher than all those of M x ; let these form the 
aggregate M 2 , and let m be the lowest element of M 2 . Then m is the 
element which immediately follows M x ; and thus the condition (2) is 
satisfied. 
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This property of a normally ordered aggregate, that every part of it 
has a lowest element, might be adopted as the definition of a normally 
ordered aggregate. 

A somewhat simpler property, which might be employed to define a 
normally ordered aggregate, is the following: 

An aggregate M is normally ordered j* if, and only if, it contains no part, 
of which the order-type is *co. 

If M is not normally ordered, at least one part of it must have no 
lowest element, and this part contains a sequence whose order-type is *<o. 
This follows from the theorem of § 151, that every aggregate that is not 
finite contains a part, of cardinal number Such part can be ordered 
according to the type to, when it has no highest element, and according 
to the type *< 0 , when it has no lowest element. An aggregate which has a 
lowest element, and is also such that each element has one that imme¬ 
diately succeeds it, is not necessarily normally ordered, even if each 
element has one immediately preceding it. This can be seen by considering 
an aggregate with the order-type co -f *co. 

166 . The following properties of normally ordered aggregates can be 
proved in a very simple manner: 

Every part-aggregate of a normally ordered aggregate is itself normally 
ordered. 

Every ordered aggregate which is similar to a normally ordered 
aggregate is itself normally ordered. 

If, in a normally ordered aggregate M, there be substituted for the 
elements normally ordered aggregates, in such a manner that, if M m , M m - 
are the aggregates substituted for any two elements m, m', then M m $ M m> , 
according as m $ m r , the resulting new aggregate is normally ordered. 

167 . The part of a normally ordered aggregate M which consists of all 
those elements which are of lower rank than an element m, of M, is called 
the segment of M determined by the element m. 

The aggregate which remains when the segment of M , determined by 
the element m, is removed from M, is called the remainder of M deter¬ 
mined by an element m. The element m is the lowest element of the 
remainder. 

If 8 is the segment of M , determined by m, and R is the remainder, 
then M — (8, R). 

The segment of M determined by the lowest element of M contains 
no element, but it may be regarded as existent, and being the null-seg¬ 
ment. The remainder, determined by the lowest element, is the aggregate 
M itself. 

t See Jourdain, Phil. Mag. (6), vol. vn (1904), p. 65. 
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Of two segments S, S' determined by the elements m, m', of which 
m < m', we say that S is the smaller, and S' the larger segment, or S < S'. 

It can easily be seen that, if M, M x are two similar normally ordered 
aggregates, a segment of M corresponds to a similar segment of M x , the 
element by which the segment of M is determined corresponding to the 
element of M x by which the segment of M x is determined. 

A normally ordered aggregate is not similar to any of its segments. 

Assume that, if possible, S - M, and suppose the elements of S, M are 
put into correspondence. To the segment S, of M, there must correspond 
a segment S x , of S, so that S x - M - S, where S 1 < S. Since S } - M, 
we find in a similar manner a segment S 2 < S x , which is similar to M , 
and so on; and in this way we obtain an unending sequence 
S > S x > S 2 > ... > S n > ... 

of segments of M, which are all similar to M. Let m,m x ,m 2 , ... m n , ... 
be the elements which determine the segments S, S x , S 2 , ... S n , ...; then 
m > m x > m 2 > ... > m n > .... 

The aggregate (. m n , ... m 2 , m Xi m) would be a part of M which 

has no lowest element, and is of type ; this is impossible if M is normally 
ordered. 

If M is an infinite normally ordered aggregate, it always has parts 
which are similar to M, although such a part cannot be a segment. 

A normally ordered aggregate cannot be similar to any part of one of its 
segments. 

Let us assume that, if possible, S' a part of a segment S , of M, is 
similar to M. Since S' — Af, we can place the elements of S', M in corre¬ 
spondence, then, to the segment S, of M , there will correspond a segment 
S x , of S', where S x - S; let then S x be determined by the element e x , of S'. 
Since e x is also an element of M, it determines a segment M x , of M, of 
which S x is a part, and which has a part similar to M. Proceeding in the 
same manner, we determine a segment M 2 , of M x , which has a part that 
is similar to M; and in this way we obtain an unending sequence of 
segments of M , all similar to M , so that M > M x > M 2 > ... > M n > — The 
elements which determine these sequences form a part of M which is of 
type *a>; and this is contrary to the hypothesis that M is normally ordered. 

Two different segments of a normally ordered aggregate cannot be similar . 

For one of these segments is a segment of the other. 

The aggregate of which the dements are the segments of a normally ordered 
aggregate M, can be so ordered that it is similar to M. 

For each element of M corresponds to a single segment of M, and 
thus the aggregate of segments can be so ordered as to be similar to M. 
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There is only one mode of putting the elements of two similar normally 
ordered aggregates into correspondence , so that the relative orders of the 
elements are unaltered in the correspondence. 

For if, in two modes of placing the aggregates in correspondence, two 
elements /, /', of one aggregate M , correspond to one element e of the 
other M\ the segments of M determined by/,/' are each similar to the 
segment of M ' determined by e; but it has been shewn to be impossible 
that M can have two different segments which are similar to one* another. 

A segment of one of two normally ordered aggregates is similar to at 'most 
one segment of the other aggregate. 

If S, S' are similar segments of two normally ordered aggregates 
M. M ', then, to every smaller segment S x < 8. of M, there corresponds a 
similar segment 8 f < S', of M'. 

If , 8 2 are two segments of the normally ordered aggregate M, and 
Sf, Sf are two similar segments of a normally ordered aggregate M\ 
then if Sf < 8 1 , it follows that 8f < S 2 . 

If a segment 8, of M, is not similar to any segment of another normally 
ordered aggregate M', then no segment S' > 8, of M , is similar to any 
segment of M', nor to M' itself; and the same holds of M itself. 

If M, M', two normally ordered aggregates , are so related that, to any 
segment of either , there corresponds a similar segment of the other, then 
M^M\ 

Any element e, of M , determines a segment of M which corresponds 
to a similar segment of M'. Let this latter be determined by an element e', 
of M '; we then take e to correspond to e'. To every element of M we 
therefore find a corresponding element of M', and it is seen by applying 
the foregoing theorems that the relative order of the elements is preserved. 

168. If two normally ordered aggregates M, M' are so related that , (1), to 
every segment S, of M, there corresponds a similar segment S', of M',. and 
(2), at least one segment of M' exists, to which there is no corresponding 
similar segment of M; then there exists a definite segment Sf, of M', such that 
Sf - M. 

Consider all those segments of M' which do not correspond to similar 
segments of M. Among these, there must be one Sf, which is the least of 
all; this follows from the fact that the elements which determine these 
segments of M' form an aggregate which has a lowest element, and this 
lowest element determines the segment Sf . Every segment of M' which 
is greater than Sf is such that there exists no corresponding similar 
segment of M ; but every segment of M f which is less than Sf has a 
corresponding similar segment of M . Since, to every segment of M, there 
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corresponds a similar segment of $/, and, to every segment of $/, there 
corresponds a similar segment of iff, it follows that iff - Sf, 

If the normally ordered aggregate iff' has at least one segment to which 
there corresponds no similar segment of iff , then , to every segment of M y there 
corresponds a, similar segment of iff'. 

Let Sy be the smallest segment of iff' to which there corresponds no 
similar segment of M . If there exist segments of iff to which no corre¬ 
sponding similar segments of iff' exist, let Sy be the smallest of all such 
segments of iff. To every segment of S x there corresponds a similar 
segment of $/, and conversely; hence 8 y - which is contrary to the 
hypothesis that there exists no segment of iff which is similar to Sf. 

If iff, iff' are any two normally ordered aggregates, then either (1 ), iff 
and if/' are similar, or, (2), there exists a segment S', of iff', which is 
similar to M, or, (3), there exists a segment of M , which is similar to iff'; 
and these possibilities are mutually exclusive. 

The following four possibilities may be contemplated, as regards the 
relation of iff to if/': 

(1) . To (‘very segment of eit her iff or iff' there corresponds a similar 
segment of the other aggregate. 

(2) . To every segment of M there exists a corresponding similar seg¬ 
ment of if/'; but there is at least one segment of M' to which no similar 
segment of iff corresponds. 

(3) . To every segment of if/' there corresponds a similar segment of iff; 
but there is at least one segment of M to which no similar segment of M' 
corresponds. 

(4) . There is at least one segment of M to which no similar segment 
of M' corresponds, and also at least one segment of M' to which no similar 
segment of M corresponds. 

It has been shewn that (4) is impossible. In the case (1), it has been 
proved that M - M In the case (2), it has been shewn that a definite 
segment Sy, of M', exists, such that Sf — M ; and in the case (3), that 
there is a definite segment of M, such that ~ M'. 

It is impossible that, at the same time M - iff ', and also iff — Sf ; 
for, in that case, if/' - ; and it lias been shewn to be impossible that 
M r should be similar to one of its own segments. 

It is also impossible that iff - $/, and also iff' - S x ; for there must 
then exist a segment of Sf which is similar to , and therefore to iff'; 
but this is contrary to the theorem that a normally ordered aggregate 
cannot be similar to one of its segments. 

If any part of iff is such that that part is not similar to any segment of iff, 
then that part is similar to iff itself. 
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Any part M x , of if, is normally ordered; if then M x be similar neither 
to M nor to any segment of if, there must exist a segment of if/, of M x , 
which is similar to if; and if/ is a part of that segment of M which is 
determined by the same element that determines the segment if/, of M x . 
Therefore if would be similar to a part of one of its segments, which has 
been shewn to be impossible. 

THE THEORY OF ORDINAL NUMBERS 

169 . The order-type M, of a normally ordered aggregate if, is said to he 
the ordinal number which belongs to if; all similar normally ordered aggre¬ 
gates have consequently the same ordinal number. 

If if, if' are two normally ordered aggregates such that if has a segment 
which is similar to if', whilst AT has no segment which is similar to if, then 
the ordinal number a = M is said to be greater than th£ ordinal number 
p ~ M; and this relation is denoted by a > ft. If if has no segment similar 
to if', but if' has a segment similar to M , the ordinal number a is said to be 
less than p, and the relation is denoted by a < ft. 

It follows from these definitions, in conjunction with the theorem of 
§ 168, that, if a, ft are any two ordinal numbers whatever, they satisfy 
one, and one only, of the relations a^p,a>p y a<P; and that if a > ft, 
then ft < a. 

Further it is seen that, if a < p and p < y, then a < y ; hence the 
aggregate of all ordinal numbers is a simply ordered aggregate, when 
arranged in such a manner that any one a, which has been defined as less 
than another one P, precedes it. 

The sum a + p of two ordinal numbers is , in accordance with the general 
definition of the sum of two order-types , the order-type of the normally ordered 
aggregate (if, N) y where if, N are two normally ordered aggregates such that 
a = M, P ~ if. 

Since if, N both contain no part,of type *a>, the same is true of 
(if, N). Hence the aggregate (if, N) is normally ordered; and thus 
a + P is an ordinal number. 

Since if is a segment of (if, N), we see that a < a ■+* P- 

N is a remainder of (if, N) determined by the lowest element of N , 
hence N may be similar to (if, N) ; or, if not, it is similar to a segment 
of (if, N): thus either p = a + /}, or p < a 4- p. 

The addition of ordinal numbers obeys the associative law, but not 
in general the commutative law; thus (a + P) 4- y = a -f (P -f y), but 
a -f P is in general not the same as P + a. 

170 . The product a.p t of two ordinal numbers, is, in accordance with 
the definition of § 159, the order-type of the aggregate obtained by sub- 



231 


i68-i7i] Theory of Ordinal Numbers 

stituting for each element of an aggregate of order-type ft, an aggregate 
of order-type a. In accordance with the theorem of § 166, the aggregate 
thus obtained is normally ordered, and of type dependent only on a and ft. 

In general a. ft is not equal to ft.a. 

It is easily seen that a.ft> a , provided ft > 1; and that, if aft - ay , 
then ft == y. 

If a, ft are two ordinal numbers such, that a < ft, there exists an ordinal 
number y such that a f y — ft; and this number y is defined, to be ft — a. 

For, if M =- ft, there is a segment of M which may be denoted by M l , 
such that Afj = «; let then M — {M 1 , S), therefore M — M x -f- S, and 
ft — a - aS. 

171. Let , ft 2 , ... ft n , ... denote a simple sequence of ordinal numbers, 
and suppose M k , M 2 , ... M n , ... are aggregates of which the order-types 
are respectively the numbers of the sequence. The aggregate 

(M,, M 2 , ... M n , ...), 

which is obtained by replacing each element of the normally ordered 
aggregate (1, 1, 1, ...), of type a>, by a normally ordered aggregate, is, in 
accordance with the theorem of § 166, itself normally ordered; and its 
type defines the sum ft 1 -f ft 2 + ... + ft n 4- ... — ft. if a n denotes the sum 
A + A -f ... -f ft n , we see that a n — (M 1 , M 2 , ... M n ); and it is clear that 
a n+ 1 > a n : hence 

Pi = «1' Pi ** «2 - «1. ■ •• > Pn «n - “n-1 • 

It will now be shewn (1), that > a n , for every value of w,; and (2), that, 
if ft' is any ordinal number < ft, there is some definite value of n such 
that a n , a w+1 , ... are all > 

(1) follows from the fact that each a n is the ordinal number of a 
segment of (M lt M 2 , ... M n , ...) of which ft is the ordinal number. 

To prove (2), we observe that a segment of (M x , M 2 , ... M n , ...) exists, 
of which ft' is the ordinal number, and therefore the element which de¬ 
termines this segment must belong to one of the aggregates 

j, If 2 ? • • * NL n j ...» 

say M ni . It follows that the segment is also a segment of ( M x , M 2 ,... M ny ); 
and therefore, ft' < a ni , or a n > ft', n being s rq. 

It has thus been proved that ft is the ordinal number which im¬ 
mediately follows all the ordinal numbers cq, cq, ... a n , ...; and it may be 
spoken of as the limit of the sequence a x , aq, ... a n ,_ 

Thus every ascending sequence a x , a 2 , ... a n , ... of ordinal numbers de¬ 
termines a limiting number ft — lim a n , which immediately follows all the 

n~oo 

numbers of the sequence. 
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THE ORDINAL NUMBERS OK THE SECOND CLASS 

172. Every finite ordered aggregate is normally ordered, and its order- 
type is the ordinal number of the aggregate. The finite ordinal numbers 
may be spoken of as the ordinal numbers of the first class; to each such 
ordinal number there corresponds a single cardinal number, and the 
properties of the finite ordinal numbers are identical with those of the 
finite cardinal numbers, the terms ordinal and cardinal simply defining 
the two uses of the same number. In the case of transfinite aggregates 
there is no such identity between ordinal and cardinal numbers; in fact 
the arithmetic of the one kind of numbers is essentially different from 
that of the other kind. 

Corresponding to a single transfinite cardinal number there is an in¬ 
finity of transfinite ordinal numbers; all those transfinite ordinal numbers 
which correspond to aggregates that have one and the same cardinal 
number a are said to form a class Z (a), the class of normal order-types 
which have the cardinal number a. 

The ordinal numbers of all those order-types which have the same 
cardinal number X 0 , as the aggregate of finite numbers, are said to be of 
the second class Z (N 0 )- 

The ordinal number o> — lim n, and is the smallest number of the second 

n —oc 

class. 

If M denotes the aggregate (m lf m 2 , ... m nl ...), then M = o>, and 
w = K 0 . Any number fi, which is < w , must be the order-type of a seg¬ 
ment of M, and M has only segments (m 1 , m 2 , ... m n ) with finite ordinal 
numbers n\ thus must be a finite number; and therefore the only ordinal 
numbers < co are finite ones. 

Every number a, of the second class , has a number a -f 1 immediately 
following it. 

For if a ~ M, a — K 0 , we have a -f 1 = (M , e), where e is a new 
element; and since M is a segment of ( M , e), we have a + 1 > a. Also 

a + 1 — o [■1 = -f 1 ~ Xq. 

It has thus been shewn that a 4- 1 is a number of the second class. Every 
number < a + 1, is the order-type of a segment of (M,e); and such 
segment can only be M , or a segment of M ; hence no number < a + I 
is > a: therefore a -f 1 is the next number greater than a. 

// (* 1 , « 2 , ... a n , ... be any sequence of numbers of the second class , there 
is a number lim a n , also of the second class , which is the smallest number 

oo 

that is greater than every number a n of the sequence. 
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If, as in § 171, we write ft = a,, ft = a 2 - g lt ... ft = a„ - a„ 
then if ft = ft, we have lim a„ ■ (ft, ft, ... ft, ...); and this number 

n~ oo 

lim a n has been shewn to be the smallest number which is > a n for every 

n~ao 

value of n. To shew that this number lim a n is of the second class, we 

_ n~oc 

have, since f3 n s for every value of n, lim a n ^ ^ Hq] and, since 

W~QO 

lim a n is not finite, it must therefore = X 0 . 

n~oo 

Two sequences {«„}, {a'„}, of numbers of the second class , have the same 
limiting number , when, and only when, the sequences are related to one 
another , in accordance with the definition of § J 60. 

Let ft, y be the two limiting numbers, and first assume that the se¬ 
quences are related to one another. If ft < y, then for some value of n, 
a n > ft, a'„ n > ft, ...; and hence, for some value of n' } we must have 
a n , > ft, ft, ..., which is inconsistent with ft being the limit of the 

sequence {a n }. 

If we assume ft = y, then, since a n < y, for some fixed number r we 
must have a r > a n , a' r +i > a n , ...; and similarly, since a n < ft, for some 
fixed number s we must have a s > a ' n , a s+1 > a' n , ...; hence the two 
sequences are related to one another. 

If ft is a finite ordinal number, and a a number of the second class, 
then n + a — a, and hence a — n — a. If a be a non-limiting number, it 
is however convenient to denote the number immediately preceding a 
by [a - 1], 

For n oj ~ oj, 

since n + w - (e ly e 2 , ... e n ;f ly f 2 , .../„, ...) 

~ iO 1 ->9 2 > * • * 9n t 9n-{ 1 > • • • )> 

where g x = e 1 , g 2 = e 2 , ... g n - e n , g n+1 ~f l9 g n+2 =/ 2 , .... 

Further, n -+• a — n -f w -f (a — <*>) = -I- (a — ca) = a. 

If n is a finite number, then no ~ o>. This is seen by taking an aggre¬ 
gate of the type w, and replacing each element by n new elements; then 
it is clear that the new aggregate is also of type w. 

It can be easily proved that (a + n) w — aw, where a is of the second 
class, and n of the first class. 

173. If a is any number of the second class , then those numbers of the first 
and second classes which are less than a form a normally ordered aggregate 
of type a , when they are arranged in order as defined above . 

If M is an aggregate such that M = a, and if a is an ordinal number 
< a, then there is a segment M f , of M, such that M' ~ a'; and, conversely, 
every segment of M determines a number of the first, or the second, class 
which is < a. For, since M = K, any segment M' must have either a 
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finite cardinal number, or else must have N 0 for its cardinal number. If e x 
is the lowest element of M, a segment M' is determined by an element 
e' > ; and every element e ', of M, determines a segment M f . If e', e " 

are two elements of M , both > e lf and M\ M" the segments of M deter¬ 
mined by these elements, and a', a" their order-types, then if e' < e", it 
follows, by § 167, that M' < M" ; and hence a ' < a". 

If then M = (e x , M'), and to the element e\ of M\ we make the 
element a', of {a'j, correspond, the two aggregates M' and {a'} are placed 
in the relation of similarity. It has thus been shewn that {a'} — M' ; now 
M' = a — 1 = a, hence {a} — a. 

Since a = N 0 , we have {«'} — N 0 ; and therefore the following theorem 
is established: 

The aggregate {«'} of all those numbers a\ of the first and second classes , 
which are ordinally smaller than a number a, of the, second, class , has the 
cardinal number K 


174. Every number a y of the second class , is either (1), such that it is 
obtained from a number of the same class immediately preceding it , by the 
addition of unity, or else (2), such that there exists a sequence {a„}, of numbers 
of the first or second class , having a for its limit. 

Let a = M ; then, if M has a highest element e, M = ( M ', e), where 
is the segment of M determined by e ; thus M — J/' + 1 , or 

a = [a—l] + l. 

If has no highest element, then the aggregate {«'}> of all numbers 
< a, which is similar to M, has no greatest number; and this aggregate 
{a}, being of cardinal number Kq, can be re-arranged as an aggregate {a' n } 
of type u). In this aggregate { a n }, some of the numbers af, of > ••• * n 
general be less than a/, but others must be greater than a/ ; for af cannot 
be greater than all the other numbers of the aggregate, there being in {a'} 
no greatest number. Let a' Vi be that number of {<*'„}, with the smallest 
index, such that a' P2 > of ; similarly let a' Vz be that number, with the 
smallest index, such that a' P3 > a' P2 , and so on. We have now an infinite 
sequence r > » 

a i > a VI > a P3 ’ * • * 

of numbers, such that they are in ascending order, and such that their 
indices are also in ascending order. Since n s. p n , we have a n £ a ; 
hence, for every number a which is less than a, there exists a number a' Pn 
which is > a '. Since a is the number which follows next after all the 
numbers a', it is also the number which follows next after all the numbers 
a/, a'^, a' Pa , ..., which we may write as a lf ctg, a 3 , ... a n , ...; thus 

a = lim a n . 


It has thus been shewn that there are two kinds of numbers of the 
second class, (1), those which have an immediate predecessor in the aggre- 
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gate of all such numbers arranged in ascending order, and (2), those which 
have no such immediate predecessor, and are called limiting numbers. 

A number of the first kind is obtained by means of the first principle 
of generation, (see § 65), from the immediately preceding number, 

A number of the second kind is obtained by the second principle of 
generation, as the number a which next follows all the numbers a n , of 
some sequence {a n } of numbers of the second class. 

THE CARDINAL NUMBER OF THE SECOND CLASS OF ORDINALS 

175. The totality of the numbers of the second class , arranged in ascending 
order , forms a normally ordered aggregate. 

If A a denotes the ordered aggregate of all those numbers of the second 
class which are less than the given number a, then A a is normally ordered, 
and of type a — co. For the aggregate {a'} of numbers of the first and 
second classes, which consists of {n} and A a , has been shewn, in § 173, to 
be normally ordered, and thus 

{a'} ({n}, AJ, 

hence {«'} = {n} + A a , or A a = a — co. 

Let M denote any part of the aggregate {a}, of all the numbers of the 
second class, such that, in {a}, there are numbers which are greater than 
all the numbers in M ; and let be one such number: then M is a part 
of A H . n , which is such that all the numbers of M are less than at least 
one number of A^ +1 . Since A^+i is normally ordered, there must be 
a number a, of A^^, being itself consequently a number oi {a}, which 
is the next greater number than all the numbers of M. Thus, since {a} 
has a lowest number co, the conditions are satisfied that {a} is a normally 
ordered aggregate. 

It follows, by applying the results of § 165, that: 

Every part of the aggregate {a}, of all numbers of the second class, 
has a least number. 

Every such part, in order, is normally ordered. 

It will now be shewn that the aggregate {a}, of all the numbers of the 
second class , has a cardinal number greater than tto. 

Consider the two aggregates {a} and {n}, where n denotes the finite 
integers. The aggregate {a} has a part, viz. that consisting of 
co -f- 1 , <o -f- 2, ... co -f~ n, ..., 

that is equivalent to {n}. It has been shewn in § 151 that every part of 
{n} is either finite, or that it is equivalent to {n} itself. In order to shew 
that the cardinal number of {a} is greater than Ko, it is therefore only 
necessary to shew that it is not equal to Hq . 
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If {«} == N». the numbers of {«} could be arranged in the form 
Yi> Yz> •••> 

of type to, in which of course the order would not be that of generation. 
Starting from y 3 , let y Vl be the y, with the smallest index, which is such 
that y Vi > y A ; then let y iJ3 be that y, with the smallest index, such that 
y v3 > y p ,; and so on. We obtain in this manner a sequence 

yi* Y»^ Yv ••• 

in ascending order, the indices 1, p. 3 , ... being also in ascending order. 

In accordance with § 171, there must be a definite number 8 of the second 
class, namely 8 — limy />?j , such that 8 >> y Vn , for every p n , and conse- 

II — TT 

quently such t hat 8 is greater than every y„; but this is impossible, since 
{y n } contains every number of the second class; hence {«} cannot equal X 0 , 
and is therefore > 

Every part of the aggregate {«}, of all numbers of the second class, has 
either the cardinal number of {«}, or else the cardinal number X 0 , unless it 
is a finite part. 

Every such part, when the elements of it are in order of generation, 
being part of the normally ordered aggregate {a}, is either similar to {«}, 
or else to some segment A H , of {a}; hence the cardinal number is either 
that of {«}, or is A ao a 0 — to, and this last is either X () , or is finite. 

The cardinal number of {a} is the cardinal number next greater than N () . 

If there exist a cardinal number less than {a}, and greater than i$ 0 ,'it 
must be the cardinal number of some part of {a} ; but it has been shewn 
that every such part of {«} has either the cardinal number of {«}, or is H*,, 
or is finite. 

The cardinal number of {a}, or of 7j {N () }, is denoted by Nj. 

THE GENERAL THEORY OF ALEPH-NUMBERS 

176. It has now been shewn that the ordinal numbers of the second 
class, in their order of generation, form a normally ordered aggregate, of 
which the cardinal number is K x , the next cardinal number to Ko. The 
ordinal type of the normally ordered aggregate {a} of all numbers of the 
second class, is a number Q, which is the smallest number of the third 
class. In analogy with the definition of the second class, and in accordance 
with what Cantor has denominated the principle of limitation (Hemmungs- 
prinzip), the third class is taken to include all the ordinal types of normally 
ordered aggregates, of which the cardinal number is Nj, and this class is 
consequently denoted by Z (fc^). The number Q, which is the order-type 
of all the numbers of the first and second classes, in the order of genera- 
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tion, and which comes after all those numbers, is not the limiting element 
of any sequence « 2 , ... a n , ... of numbers of the second class; for, as 
we have seen, every such sequence has a limiting number within the second 
class. From the point of view adopted by Cantor in his earlier writings, 
and explained in § (>5, in which t he successive ordinal numbers are regarded 
as successively generated, in accordance with postulated principles of 
generation, the number 12 must be regarded as generated by a third 
principle of generation, different from the two principles of generation 
employed in the case of the numbers of the first and second classes. This 
third principle of generation affirms that every set of ordinal numbers 
similar to the aggregate of all the numbers of the first and second classes, 
in their order of generation, is immediately succeeded by a new number, 
ordinally greater than all the numbers of the set, so that every number 
which is less than this new number is also less than some of the numbers 
of the set. When, proceeding from 12, t he numbers 

12 f- 1, 12 + 2, ... 12 -f 12, ... 

are formed, all three principles of generation will be required, in forming 
the numbers of the third class. 

From the point of view adopted later by Cantor, and explained in the 
present chapter, f2 is simply defined to be the order-type of the totality 
of the numbers of the first and second classes, in their normal order. The 
numbers higher than 12 are then defined in the same manner, each one 
as the order-type of the totality of the preceding numbers in normal 
order. 

The existence of a whole series of classes of order-types of normally 
ordered aggregates, i.e. of ordinal numbers, has been speculatively asserted 
by Cantor*, who has however, in his published works, confined his detailed 
investigations to numbers of the first and second classes. 

To each of the successive classes of numbers, there corresponds a single 
cardinal number, that of the totality of the ordinal numbers up to, and 
including all the ordinal numbers of that class. The first ordinal number 
of each class is the order-type of all the numbers of the preceding classes, 
in their order of generation. A new principle of generation is required for 
the first number of each new class, since that number cannot be regarded 
as the limiting number of any sequence of which the ordinal number is 
less than that of the number in question. All the successive principles of 
generation are however included in the one principle, that an aggregate of 
normally ordered ordinal numbers has itself an order-type which is a new 
number; and thus, from this point of view, all the principles of genera¬ 
tion, from the second, onwards, are replaced by this one principle. 

* See Math. Annahn , vol. xxi (1883), pp. 587, 588, also vol. xlvi (1895), p. 495. 
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177. In accordance with this theory, there exists an ordered aggregate 
1, 2, 3, ... n, ... oj, co 4- 1, ... co 2 , ... co w , ... ft, ft + 1, ... y, ... 

which contains every ordinal number of every class; and there also exists 
a similar aggregate 

1, 2, 3, ... 71, ... Kq, ••• ... ... 

of cardinal numbers, each element of which is the cardinal number of a 
single class of numbers of the first aggregate. 

That the first of these aggregates is normally ordered may be seen by 
remarking that, if it contained any part, of the type *co, then such part 
would also be part of the normally ordered aggregate formed by the 
numbers 1, 2, 3, ... co, ... a; where a is the highest number in the hypo¬ 
thetical part, of type *co. This is impossible, and hence the first aggregate 
is normally ordered. 

Cantor has given a proof that Xq is less than, or equal to, the cardinal 
number of any transfinite aggregate, and that X x is the cardinal number 
next greater than Xq . In his proof the possibility of two cardinal numbers 
being incomparable was disregarded. Accordingly, the first theorem in 
§ 151 has been made to precede Cantor’s theorem, which has however 
also been given. A proof has been given by Jourdainf, that X 2 is the next 
greater cardinal number than Xj , who has also considered, in some detail, 
the ordinal numbers of the third class, and has given indications of ex¬ 
tension to the higher classes. 

The question whether every transfinite cardinal number is necessarily 
an Aleph-number, which is equivalent to the question whether every 
aggregate is capable of being normally ordered, has engaged a considerable 
amount of attention. That the answer should be an affirmative one, was 
regarded by Cantor as probable. Some discussion of attempts which have 
been made to settle this matter, will be considered in § 203. A case of 
great importance is that of the continuum, which is defined as a simply 
ordered, but not as a normally ordered, aggregate. No proof has yet been 
discovered of the correctness of Cantor’s view that c = X x . In case c 
occurs at all in the aggregate of Aleph-numbers, the continuum is capable 
of being normally ordered. The possibility has also been contemplated 
that c may be greater than all the Aleph-numbers. 

THE ARITHMETIC OF ORDINAL NUMBERS OF THE SECOND CLASS 

178. The ordinal numbers of the second class have been defined as 
the order-types of normally ordered, enumerably infinite, aggregates; and 
the operations of addition and multiplication have been defined for these 
numbers, in §§ 169 and 170. It now remains for us to define exponentials, 

t See Phil, Mag. (6), vol. vn (1904), p. 294, “On the transfinite cardinal numbers of number- 
classes in general.” 
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for numbers of this class; and the definition is founded upon the following 
theorem : 

If f is a variable of which the domain consists of the numbers of the 
first and second classes, including zero, and if y, 8 denote two constants 
belonging to the same domain, such that 8 > 0, y > 1, then there exists a 
single-valued determinate function / (f), which satisfies the conditions 
(1). / (0) = 8. 

(2) . If £" are any two values of £ such that fthen/(f') < /(£"). 

(3) . For every value of f, / (f h 1) = / (£).y. 

(4) . If {£ n } is a sequence, of which f is the limiting number, then 

{/ (f«)} is a sequence, of which / (£) is the limiting number. 

In the case 8 --= 1, the function f (£) is denoted by y f ; and then/(£), 
satisfying the above conditions, defines the exponential function y f , for 
all numbers y, £ of the first and second classes. 

To prove the theorem, in the first place we have 

/(l) = 8y, /(2)-8y* f ...,/(n)^8y n ; 
thus /(l) < /(2) </(3), and the function is determined for every 
£ < <u. Next assume that the function is determined for every £< a, a 
number of the second class. If a is not a limiting number, 

/(«)-/!« - i]y >/L« - i]; 

and thus / (a) is determined. If a is a limiting number, and is preceded 
by the sequence {a n }, then {/(f„)} is a sequence, and f (a) = lim/(a n ). 

n~ao 

If {a n '} is another sequence such that a — lim a n ', then the two sequences 

n ~co 

{/( a n)}> {f (&»/)} are related to one another, and therefore have the same 
limit; and thus f (a) is uniquely determined. / (£) is now determined for 
every f: for if there were values of a for which it were not determined, 
there must be a smallest of such values; the theorem would then hold for 
£ < a, but not for f ^ a ; which is contrary to what has been proved above. 

179. If a , ft are numbers of the first or second class, y"+ 8 = y^.y 8 . 

The function <f> (f), = y“ +f , satisfies the conditions 

(1) , <f>(0) = y*. 

(2) , if e < r, <f> (n < 4> (n. 

(3) ,*(f + l) = ^(f).y. 

(4) , If {f n } is a sequence such that lim £ n = £, then (£) = lim <f> (f n ). 

n~oo n-'-oo 

It follows by § 178 that, if we take 8 = y*, then <j> (f) — ; hence if 

f = ft we have = y. 

Again, if a, /S are two numbers of the first or second class, 

y* = (r'Y- 
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If we put / (£) — y af , we find by applying the theorem of the last 
section, that, /(f) - (y a )C where y 1 replaces y. 

y being > 2, it can be proved that, for every £, y r s £. The theorem 
holds for £ 0, £ -= 1; and if it be assumed to hold for all values of £ 

which are less than a given number a , then it holds also for £ — a . 

For, first let a be not a limiting number: then, if \a - 1] ^ we 

have [a - JJy /; 

hence y 1 ^ [a - 1J {• [a — 1] [y — IJ: therefore since [a — 1] and [y — 1] 
are > 1, and [« — 1J 4 1 ^ we have y a a. If a is a limiting number 
— lim a n , then since a v < y"" we have lim * lirn y a », or a ? y a . If there 


were values of £ such that £ > y f , there would be one of such values which 
is the least of all; and if this were a , then f i-: y f , if f < a, but a > y“; 
which is contrary to what has been proved above. 


180. Of all the numbers of the second class, the smallest ones are 
those which are algebraical functions of o>, of the form 

<*>"-Pn + “>P\ I- Po> 

where p 0 , p 1} ... p n are finite numbers. If we write 


CO^I, w 3 — CO "a, 


then we obtain the number e 0 


=• lim a> n 

V ~oo 


This number e 0 is the smallest 


of a species of numbers of the second class which are characterized by the 
property e — co\ and which Cantor has designated c-numbers. Cantor has 
shewn that the €-numbers form a normally ordered aggregate, of type Q, 
and therefore similar to the whole second class of numbers. He has further 
shewn that every number «, of the second class, is uniquely representable 
in the form 


a — cu a o/c 0 {- w^/Ci 


CO a rK r 


where a 0 , a x , <%, ... a r are numbers of the first or second class which satisfy 
the conditions a 0 > cq > a 2 > ... > a r ^ 0, and *- 0 , k 2 , ... k t , r -f 1, are 

numbers of the first class which are different from zero. For the detailed 
investigation of the normal form, and for that of the special class of 
c-numbers, we must refer to Cantor’s original discussion*. 


THE THEORY OF ORDER-FUNCTIONS 

181. A method of representation of any mode of ordering a given 
aggregate M has been given by Bernstein f. When the elements of the 
aggregate are numbers, this method lends itself to a diagrammatic repre¬ 
sentation of the aggregate, as ordered in any particular order-type. 

* Math. Annalen , vol. xux (1897), pp. 235-248. 

| See his Dissertation, also W. H. Young, on “Closed sets of points and Cantor’s numbers,” 
Proc. Lond. Math. Soc. (2), vol. # i (1903), p. 230. 
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An aggregate M is ordered, in the most general sense of the term, when 
it is known as regards every pair of elements a, 6, whether a $ b; but a 
particular mode of ordering the aggregate can be represented by means 
of a function / (a, b) of the pairs of elements, which is defined by 
/ (a, b) — 1, if a < b \ f (a, b) = — 1, if a > b \ and / (a, a) = 0. 

This function / (a, 6) may be denominated an order-function of the aggre¬ 
gate Af; and there is one order-function for each possible mode of ordering 
the aggregate. The function must satisfy the condition that, if 

j (a, b) — f (b, c), 
then each is equal to / (a, c). 

Two order-functions (a, 6),/ 2 (a, b), of a given aggregate if, represent 
two methods of ordering the aggregate in one and the same order-type, 
provided there exist a reversible transformation <f >, of the aggregate M 
into itself, such that/ x {</> (a), cf> (6)} = / 2 (a, 6). 

All those order-functions of a given aggregate M , which correspond 
to an arrangement of Af in one and the same order-type, constitute a 
family of order-functions; and there is one such family of order-functions 
corresponding to each order-type in which the given aggregate M can be 
arranged. 

It is clear that the order-functions of a family corresponding to M 
form an aggregate with the same cardinal number as the group of trans¬ 
formations of M into itself. 

If, in particular, the aggregate M is that of the positive integers, then 
a pair of elements (a, 6) is represented by a cross-point of the rectangular 
trellis formed in the positive quadrant by drawing all the straight lines, 
x = i, y = i , for positive integral values of i, referred to rectangular 
Cartesian coordinates x , y. 

The natural order of the numbers 1 , 2, 3, ... will be represented by 
/ ( x , y ), defined for all the cross-points, so that / (x,y) = 1, when x < y, 
and / (x, y) = — 1, when x > y, and such that / (x, x) — 0. 

Any particular mode* of ordering the numbers 1 , 2, 3, ... will be re¬ 
presented by marking one set of cross-points -f 1, and another set — 1, 
those on the diagonal x = y being marked zero. 

It is, however, not every mode of so marking the cross-points that 
represents a possible ordering of the aggregate. That a mode of marking 
may represent a possible order, two conditions must be satisfied. First, 
we must have / {x, y) = — / ( y , x ); and thus points which are optical 
images, relatively to the diagonal y = x, must be marked with unities of 
opposite sign. Secondly, the condition that, if / (a, b x ) = / (b x , c), then 

* Some examples of order-types represented in this manner are given by W. H. Young, Proc. 
Land. Math. Soc. (2), vol. i (1903), p. 244. 
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each -- / (a, c), must be satisfied. This condition may be expressed as 
follows:—Join every cross-point which is marked -f 1, or 0, with every 
other such cross-point, then the resulting figure may be called the positive 
frame-work; join similarly all the pairs of points marked — 1,0; in this 
way we obtain the negative frame-work. Let lines joining the two pairs 
of points (x 1 ,y l ), (x 2 ,y 2 ) and (x ly y 2 ), (z 2 , y x ) be called conjugate lines. 
The condition which must be satisfied is that no side of the positive 
frame-work can be conjugate to a side of the negative frame-work. It can 
easily be seen that the condition so stated is necessary and sufficient. 

THE CARDINAL NUMBER OF THE CONTINUUM 

182. The arithmetic continuum has been defined as an aggregate of 
the order-type 6 (see § 163), and it is thus not normally ordered. It has 
been held by Cantor* that this aggregate, and perhaps every aggregate, 
is capable of being arranged as a normally ordered aggregate; but no 
universally accepted proof of the correctness of this view' has been ob¬ 
tained. If the continuum be capable of arrangement as a normally ordered 
aggregate, its cardinal number c must be identical with one of the Aleph- 
numbers; and in fact Cantor believed that c — Kj, the cardinal number 
of the aggregate of all the order-types of normally ordered enumerable 
aggregates. As evidence of the probable truth of this view, the facts may 
be cited that all the sets of points which have actually been defined in 
connection with the theory of sets of points have one or other of the two 
cardinal numbers X 0 and c, and that no such set of points has been defined 
of which it is known that the cardinal number is > Xq and < c . This 
negative evidence is however clearly insufficient to settle the question 
whether every part of the continuum has one of the powers N 0 or c, a 
question which has hitherto defied all attempts to obtain a conclusive 
answer. It has been shewn by Konig that the cardinal number of the 
continuum cannot be an Aleph-number of which the index is a limiting 
number. As has already been pointed out, it cannot be assumed that 
every two cardinal numbers are such as to be comparable with one another; 
but a proof has been propounded by G. H. Hardy j* that c, and presumably 
any cardinal number whatever, must either be an Aleph-number, or else 
be greater than all the Aleph-numbers. The mode of reasoning is a 
generalization of that employed by Cantor in his proof (see § 151) that 
Kq is less than any other transfinite cardinal number. 

Because c £ , it is possible to take elements from the number-con¬ 

tinuum corresponding to all the numbers of the first and second classes 
of ordinals. For, if this process came to an end, we should have c = Kq, 

* Math. Annalen , vol. xxi (1883), p. 550. 

t Quarterly J. of Math. vol. xxxv (1903), p. 87. 
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which has been proved by Cantor not to be the case. It follows that a 
set can be selected from the continuum equivalent to the aggregate of 
ordinal numbers of the first and second classes. Now if a set could be 
selected from this aggregate equivalent to the continuum, it would follow 
from the equivalence theorem, proved in § 153, that c — and if no such 
set could be selected it would follow from the definition of inequality in 
§ 146, that c > Kj; thus it follows that c X r . If now c > X 1? a similar 
proof would shew that c ^ X 2 , and so on. If c > X n for every finite n, 
then c k X w , and this process may be continued indefinitely through the 
Aleph series. The validity of this proof depends upon considerations which 
will be discussed in § 198. 

It is known that every infinite closed linear set of points has one or 
other of the two cardinal numbers Kq, c; and if a set of points can exist 
of which the cardinal number has neither of these two values, it must be 
unclosed, and may without loss of generality be taken as dense in itself. 
The difficulties of dealing with unclosed sets, dense in themselves, are so 
great that attempts to find a contradiction involved in the assumption of 
the existence of such a set, possessing a cardinal number different from 
both Kq and c, have hitherto been a complete failure. 


183. A very remarkable relation has been given by Cantor between 
the cardinal number of the continuum and that of the integral numbers. 
This relation is expressed by c = 2*o, or more generally c = where 
n is a finite integer. 

This theorem was applied by its discoverer to obtain a simple arith¬ 
metical proof that the Xo-dimensional continuum has the same power as 
the one-dimensional continuum. 


In accordance with the definition of an exponential, given in § 150, 
2 s o is the cardinal number of the numbers in the dyad scale, 


2 2 a 


+ - + 
^ 2 3 ^ 


where every b is either 0 or 1. In this aggregate, each number of the form 
w ^ ere P an< ^ ? are i n t e g ers > occurs twice; hence 

2*« = CT, 

where X is the aggregate of real numbers between 0 and 1, and s is an 
enumerable aggregate. 

It follows from the above, that 2*o = c + Xq. Now c -f Ko-c+2^,, 
since K 0 =2N 0 ; therefore c = c -f Kq ; whence we have 2 K o =* c. 

From this theorem, we deduce c.c = 2*o.2*o = 2 2 *o = 2*o = c; and 
hence by repeated multiplication by c, we find c n = c, where n is any finite 
integer. 
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Again c s « = (2 K o) s o = 2 K o* s o = 2 K o = c, 

and therefore the continuum of finite, or of enumerable, dimensions is 
equivalent to the one-dimensional continuum. 

The aggregate defined by all possible modes of distributing the numbers 
of the continuum upon themselves, has the power c c ee /; and this number / 
is greater than c. This has been proved in § 154; for a part of the new 
aggregate is equivalent to that obtained by replacing the numbers of the 
continuum either by A , or by B, and taking all possible aggregates which 
arise in this wav. Since this part has been shewn to have a cardinal 
number greater than that of the original aggregate, it follows then that 
/> c. 

More generally, if a is any cardinal number, we have cf > a. 

If the continuum be divided into any finite number n , of parts , such that 
all the parts have the same cardinal number , then that cardinal number is 
the same as that of the continuum *. 

The parts may consist of sets of points of any kind. 

The theorem may also be stated thus:—if na — c, then a = c. 

To prove it, we have na = c = nc\ and therefore, by applying the 
theorem of § 156, it follows that a = c. 

184. The continuum is equivalent to the aggregate of all possible order- 
types of simply ordered aggregates of cardinal number K,,. 

This theorem points the contrast between the aggregate of all order- 
types of simply ordered enumerable aggregates, which is of power 2 K o, 
and the aggregate of all the order-types of normally ordered enumerable 
aggregates, which is . The latter aggregate is, of course, a part of the 
former one, and thus the theorem s 2 s o can be deduced. 

The theorem may be also stated in the form: The total number of ways 
of ordering the integral numbers 1, 2, 3 , is c. 

If p is the order-type of an enumerable aggregate, arranged as a •simply 
ordered aggregate, then a part of an aggregate of the type 77, considered 
in § 162, can always be determined, which is of the type p. To establish 
this, it can be shewn that an aggregate of type p can always be changed 
into one of type 77, by insertion of new elements. If, between every pair 
of elements m x , ra 2 , of p, there are other elements, then p is of one of the 
types 77, 1 + 77, 7) 4- 1, 1 4- 77 H- 1; so that p is reduced to 17 by the removal 
of the lowest and the highest elements, when such elements exist. In any 
such case, if we add to p aggregates of type 77, at the beginning and at 
the end, we obtain an aggregate of type 77 + p -f- 77, which is of type 77, 
whichever of the types 77, 1 -f- 77,77+1,1-1-77+1 may be identical with p . 


* Bernstein, loc. cit . p. 31. 
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If pairs of elements exist in the aggregate of type p,, such that there are 
no elements between them, an Aggregate of type rj can be inserted between 
every such pair, until a new aggregate of one of the types 77, 1 -f- 77, 77 -f- 1, 
1 -f- 77 -f- 1, is obtained; then, as before, by adding aggregates of type 77, 
at the beginning and end, we obtain an aggregate of type 77. It has thus 
been shewn that any aggregate of type /z is a part of another aggregate 
of type rj. Since the rational numbers in their totality naturally exist in 
the order-type 77, it follows that an aggregate of any type /z can be made 
by taking a part of the aggregate of rational numbers, of type 77. It follows 
that the aggregate of all types /z has a cardinal number less than, or equal 
to, that of the aggregate of all part-aggregates of the set of rational 
numbers, arranged in type 77. 

Now every aggregate (r 1} r 2 , ...), of which all the elements are rational 
numbers, corresponds to a single point of a continuum of an enumerable 
number of dimensions, of which the coordinates are x x — r l , x 2 — r 2 ,.... 
Hence the cardinal number of the aggregate of all part-aggregates of the 
set of rational numbers is less than, or equal to, the cardinal number of 
the K 0 -dimensional continuum, that is, £ c; and therefore the cardinal 
number of the aggregate of all types fi is < c. 

It will now be proved that c ^ the cardinal number of the aggregate 
of all types /z. To every real number between 0 and 1, there corresponds 
an infinite sequence b 1 b 2 b 3 ..., where every b is either 0 or 1 , expressing 
the number in the dyad scale. After each b, insert an aggregate of type 7r, 
and we then have an aggregate b^b^b^ir ..., of type 

V ^ b L 4- 77- f b 2 -f 77 -f 63 -f tt 4-- 

Here, some of the b' s may be zero, and these may be simply omitted; 
thus 7r-f0f7r-7r + 7 T. Hence, to any real number x , between 0 and 1, 
there corresponds the type 

v — by -f tt -f b 2 H- • • • • 


It is now necessary to shew that the two order-types v, v , which 
correspond to two different numbers, x, x\ are necessarily distinct from 
one another. If v — v \ we can write the equality 

C x + TT + U = <V + TT + £/, 


where C x , C x ' are each either 0 or 1; and from this we obtain, by means 
of the theorem of § 164, C l = (7/, and = £/. The last equation can 


be written 


C 2 + TT + £2 = C 2 T TT 4- £ 2 ' y 


and from this we conclude that C 2 = C 2 , £ 2 = Cz ; anf l we can proceed 
onwards in the same manner. From b Y = 6/, b 2 = b 2 \ ..., we conclude 
that x = x'. It has thus been shewn that {x} — {^}; and from this we 
conclude that c £ the cardinal number of all order-types fi. 
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This part of the theorem is due to Cantor*, and the first part to 
Bernstein. By combining the two results, the complete theorem is estab¬ 
lished. 

This important result may also be expressed by saying that the totality 
of all permutations of the sequence of positive integers has the pcnver of the 
continuum. 

It may also be shewn that the totality of all parts of the sequence 1, 2, 3, ... 
has the power of the continuum. 

For, if we form a sequence, by writing 0 for each of the numbers 
1, 2, 3, ... which does not occur in a given part of (1, 2, 3, ...); and 1 for 
each number which does occur in the given part, then the sequence of 0’s 
and Fs, thus obtained, corresponds to a real number expressed in the 
dyad scale, and therefore the numbers of the continuum are put into 
correspondence with the parts of the sequence (1, 2, 3, ...). 

It was thought probable by Cantor that 2 K o = , but no proof of the 

correctness of this hypothesis has been obtained. The hypothesis has 
been discussedf by Sierpiriski. 

185. It can be shewn that the aggregate of all sets of points in the 
n-dimensional continuum has a cardinal number greater than c. 

For, in the aggregate {P} of all points in an n-dimensional continuum, 
we can substitute 0 for each point P which does not occur in a given set 
of points of the continuum, and 1 for each point P which does occur; 
we then obtain an aggregate consisting of 0’s and l’s: but it is known that 
the totality of all such aggregates has the power 2 C , which is > c. 

On the other hand, the totality of all closed sets of points in the n-dimen¬ 
sional continuum has the same power c as the continuum. 

Every closed set is the derivative of an enumerable set of points; and, 
to every enumerable set of points, there corresponds a single closed set. 

It follows that the cardinal number of the totality of closed sets is 
^ the cardinal number of the totality of enumerable sets of points chosen 
out of the continuum. To shew that the latter is c, we observe that it is 
£ the aggregate of all combinations of points of the continuum in sets of 
H 0 elements, that is, ^ c s o, or ^ c. 

Again, every single point of the Ko-dimensional continuum corresponds 
to a single point of the one-dimensional continuum, and this point is an 
enumerable part of the continuum; hence the totality of enumerable sets 
of points of the n-dimensional continuum is £ c. On combining this with 
what has been proved above, we see that the totality of all enumerable 
sets of points in the n-dimensional continuum is c; hence the totality of 
all closed sets of points in the n-dimensional continuum is s c. 

* See Bernstein's Dissertation, p. 7. f Fundamenta Math. vol. v (1924), p. 177. 
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Again, the totality of all closed sets of points in the ^-dimensional 
continuum is ^ c. For one such closed set can be taken in each of an 
infinity of the domains x x = a, where x x is one of the n coordinates which 
determine the position of a point in the 71 -dimensional continuum; and 
the aggregate of all possible values of a has the power c. We thus obtain 
an aggregate of closed sets which has the power c; and it follows that the 
aggregate of all closed sets in the 7? -dimensional continuum is # c. 

Since the totality of all closed sets of points in the n-dimensional con¬ 
tinuum is c, and at the same time is <; c, it must have the cardinal 
number c. 

Since* every curve, or surface, in a continuum is formed by a closed 
set of points, we see that every possible curve, or surface, corresponds 
uniquely to a single definite real number. 

186. A method of constructing a set of points of which the cardinal 
number is Hj, has been givenf by G. H. Hardy. 

If we start from the sequence 

1, 2, 3, 4, 5, . (1) 

of integral numbers, a new sequence 

2, 3, 4, 5, . (2) 

is formed by omitting the first term. 

Continuing this process, we form 

3, 4, 5, 6, . (3) 

4, 5, 6, 7, . (4) 

5, 0, 7, 8, . (5) 

We now form a new sequence 

1, 3, 5, 7, 9, . (co) 

by traversing the above infinite array of sequences diagonally. Then 


3, 


7, 

9, 

11, ... 

... (co 

+1) 

5, 

7, 

9, 

11, 

13, ... 

... (w 

+ 2) 

7, 

9, 

H, 

13, 

15, ... 

... (a) 

+ 3) 

9 , 

11, 

13, 

15, 

17, ... 

... (co 

+ 4) 

1 , 

5, 

9 , 

13, 

17, ... 

... (CO, 

2) 

5, 

9 , 

13, 

17, 

21, ... 

... (co, 

.2 + 1) 

9 , 

13, 

17, 

21, 

25, ... 

... (co, 

.2 + 3) 

1 , 

9 , 

17, 

25, 

33, ... 

... (co, 

.3) 


Thus sequences corresponding to all the numbers co.p. -f v can be formed. 

* Bernstein, loc. cit. p. 43. 

t Quarterly Journal of Math. vol. xxxv, 1903, “A theorem concerning the infinite cardinal 
numbers.” See also Hausdorff, Leipz. Ber. vol. lix (1907), p. 166. 
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To form the sequences corresponding to w 2 , we take the array of 


sequences 

1 , 

3, 

5, 

7, 

9, . 

• M 


1 , 

5, 

9, 

13, 

17, . 

. (co.2) 


1, 

9, 

17, 

25, 

33, . 

. (co.3) 


1, 

17, 

33, 

49, 

65, . 

• (at.4) 


1, 

33, 

65, 

97, 

129, . 

. (at.5) 

and traverse it 

diagonally 

; we 

thus obtain 



1, 

5, 

17, 

49, 

129. 

. (at 2 ). 


Generally, if b l9 b 2 , b 3 , /> 4 , ... is the sequence corresponding to /?, the 
sequence b 2 , b 3 , b 4 , fe 3 , ... corresponds to -f 1. To obtain a sequence 
corresponding to a number y which is a limiting number of the second 
class, we take the array of sequences corresponding to any ascending set 
of numbers ft 2 , ... of which the limit is y, and traverse it diagonally. 
It is clear that, in this manner, a sequence can be found for any given 
number of the second class; but that the set of sequences so obtained 
is not unique. For example, a> might have been taken as the limit of 
1, 3, 5, 7, ..., or co 2 might have been taken as the limit of 
CO -f 1, co.2 + 2, co.3 -h 3, — 

It will be shewn that the sequences h lt b 2 , b 3 , ... can be so chosen that 
in every case b 1 <b 2 <b 3 < ...; and that, if the sequences b l , b 2 , ... and 
6,', b 2 , ... correspond to any two numbers /?', where fi < /3', then there 
exists a number N such that b n ' > b ni for n £ N; and thus that the se¬ 
quences are distinct from one another. 

Let us assume that sequences, corresponding to all numbers < y, have 
been constructed in such a manner that this condition is satisfied. First, 
let y be a non-limiting number, so that y •= y + 1. Then if < y f , there 
is a number N such that a n ' > b n , for n ^ N, w here a x , a 2 , a 3 ' , ... form the 
sequence which corresponds to y'. But if a,, a 2 , a 3 , ... form the sequence 
which corresponds to y, we have a n — a' n+1 > a n ' > b n , for n ^ N. 

Hence, if the construction is possible for all numbers < y, it is possible 
for all numbers s y, where y is a non-limiting number. 

Next, let us suppose that y has no immediate predecessor, and that 
y = lim then also y = lim (fi rn 4- v m ), where the v m are finite numbers. 

m ~oo m~ao 

Now there is a number N lt such that b 2 , n > b hn , for n N ly where 6 m>n 
denotes the nth number in the sequence corresponding to . A fortiori , if 
y m = Pm + v m , we have c. t , n = b t . n+ , t > b 2 , n > b hn ; for n >, N lt where c m ,„ 
is the nth number in the sequence corresponding to y m . But if we take 
v z > we have c^ n = b %n+V2 ^ n + v 2 > b ltNt _ x > b 1>nt for n < N x ; and 

hence we have c a>n > 6 1>w , for all values of n. Similarly, v z can be so chosen 










186 ] Cardinal Number of the Continuum 249 

that y 3 > y 2 , and c 3 , v > c 2t „, for all values of n; and so on generally. If 
we write for ft l , and c Un for b hn , we have a doubly infinite array 

C l,l > C l, 2? c i,3> ••• 

C 2,l> C 2, 2> C 2,3> ••• 

C 3,1 > C 3,Z’ C 3,3 J * * * 

and we define the sequence corresponding to y , by traversing it diagonal^, 
so that c n = c n , n . If then ft < y, we can find m so that ft < y m ; then there 
is a number K, such that c nun > />„, for n>, K. But if n> m, we have 
c n = c n . B > c w , n ; and thus, if n is greater than the greater of the two 
numbers m, K , we have c n > 6 n . It has thus been shewn that, if the 
construction is possible for all numbers < y, it is possible for all numbers 
£ y, whether y is a limiting number or not. 

Tn this manner, a sequence is obtained which corresponds to any 
assigned number y of the second class, and this sequence is distinct from 
those which correspond to the numbers < y, such sequences being also 
distinct from one another. 

The sequences may be correlated with points in the linear continuum 
(0, 1). To correlate a sequence b ,, b 2 , b 3 , ..., we may take the binary radix 
fraction in which the 6,th, /; 2 th, /> 3 th, ... figures are all 1, arid the remaining 
figures all 0. In this manner a set of points is shewn to exist, such that 
one point of the set corresponds to each number of the first, or of the 
second, class. This amounts to the construction of a set of points of 
cardinal number K x . Just as an enumerable set of points is determinate, 
when the point which corresponds to any assigned number n, of the first 
class, is determinate, so the set of cardinal number N 1 is determinate, in 
the sense that a definite point is determined, corresponding to any assigned 
number ft, of the first, or of the second, class. 

It may be remarked that a set of points of cardinal number , or of 
any cardinal number > X 0 , when arranged in normal order , cannot possibly 
be in the order in which they occur in the continuum. 

For, if a set of points, in the order in which they occur in the con¬ 
tinuum, forms a normally ordered aggregate, each point and the next 
succeeding one define a linear interval of which they are the end-points. 
We have thus a set of intervals which must have the same cardinal number 
as the given set of points. Each interval of the set abuts on the next 
one, and thus the end-points together with their limiting points define an 
enumerable closed set. Hence the given set must be enumerable. 
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GENERAL DISCUSSION OF THE THEORY 

187. Cantor’s definition of an aggregate, given in § 145, implies that 
the elements of an aggregate are logically prior to the aggregate itself. 
The definite and distinct objects are, by an act of synthesis, regarded as 
a single collection, or aggregate. Such an act of synthesis involves a 
postulation, which is subject to the law of contradiction. It appears to 
have been fully recognized by Cantor himself, in connection with a certain 
so-called “inconsistent” aggregate, to be discussed below, that this is the 
case. The postulations of the existence, as single objects, of the aggregate 
of integral numbers, and of that of real numbers, are instances of such 
an act of synthesis. A somewhat different view of the nature of an aggre¬ 
gate is however widely spread. In accordance with this view, the elements 
of an aggregate consist of all objects that satisfy some prescribed condition 
or conditions; and thus the aggregate consists of a certain class of objects; 
and there has been a tendency to identify the conception of aggregate 
with that of “class,” employed in Formal Logic. The aggregate itself is, 
from this point of view, logically prior to the elements it contains; in 
fact, in any given ease, it may be doubtful whether any such elements 
exist; or it may even be known that no element exists, in which case the 
aggregate, or class, is called a null-aggregate, or null-class. Thus the 
relation of the aggregate to its elements is inverse to that which is in 
accordance with Cantor’s synthetic definition. The following form of 
definition expresses this view of the nature of an aggregate: 

All objects which are such as to satisfy a prescribed normi are said to 
belong to an aggregate defined by that norm . The norm, consists of a set of 
specified conditions , or of a set of alternative specified conditions; and this 
norm must be sufficient to render it logically determinate, as regards any 
particular object whatever , whether that object belongs to the aggregate or not. 

In the case of a finite aggregate the norm may take the form of indi¬ 
vidual specification of the objects which form the aggregate, but such 
exhaustive individual specification of the elements of a non-finite aggregate 
is not possible. The condition that it must be logically determinate whether 
a particular object is an element of a particular aggregate, or not, covers 
cases in which effective determination is not possible, either because such 
determination is impracticable, on account of the length of the process 
which would be requisite actually to make the decision, or because we are 
not in possession of the requisite means for making the decision. For 
example, the determination of the millionth digit in the value of the 
number tt is merely impracticable, although it can be carried out by a 
finite process. Again, the question whether a particular number, such as 
Mascheroni’s constant, belongs to the aggregate of transcendental numbers, 
has a logically determinate answer, although we may not be in possession 
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of a method by which that answer can be obtained. The definition admits 
the logical determination as sufficient when effective determination is 
impossible or difficult. There may even be cases in which the effective 
determination of the question whether a defined aggregate is a null- 
aggregate, or not, is impossible, by the employment of existing means, or 
is so difficult as to be impracticable. 

It is clear that the elements of an aggregate, being subject to a common 
norm, must have a certain community of nature, their class mark, which 
constitutes the ground of the aggregation. 

188. The discussion of the theory of aggregates which has taken place 
amongst mathematicians, since the publication of Cantor’s theories, has 
led to the examination of certain aggregates which appear to satisfy, 
prima facie, the definition of an aggregate, and yet which are such that 
their existence, at all events if they are regarded as possessing cardinal 
numbers or order-types, involves certain contradictions. In this manner, 
what are known as the antinomies of the theory of aggregates have arisen. 
Both Cantor s definition of an aggregate, and the definition, given above, 
of an aggregate as a class, lead to antinomies of this kind, the most im¬ 
portant of which will be discussed below. It was, however, recognized 
by Cantor* that the collection of elements into an aggregate is only valid 
provided a conception free from contradiction is the result of such syn¬ 
thesis. Suggestions have been made in various quarters, in the direction 
of limiting the concepts of the theory by means of postulates, in such a 
manner that the antinomous aggregates will not arise. At the present 
time however, no such generally accepted scheme of postulations and 
axioms is in existence. The final goal of such investigations would be 
attained if we were in possession of a definition of an aggregate, of such a 
character that it could be shewn a priori to be free from contradiction, 
without waiting for the discovery of antinomies. Such a definition should 
also be sufficiently wide; so that all aggregates, which are fitted for em¬ 
ployment in a mathematical theory, would fall under it. 

On systems of axioms which are suited for the building up of a theory 
of aggregates of such a character that no contradictions will arise, the 
writings of Hilbertf, ZermeloJ, Russell §, Schoenflies||, and Huntington^f 
may be consulted. Developments of the theory which go beyond those 

* See Jourdain, Phil. Mag. (6), vol. vn (1904), p. 07. 

t Verhand . d. Math. Kongrcss z. Heidelberg (1904), p. 174; Jahre.sber. a. Math. Vereinigung, 
vol. viii, 1 (1900),.p. 180; Math. Annalen , vol. xcv (1926), p. 161. See also Mollemp, Math . 
Annalen , vol. lxiv (1907), p. 231. 

f Math. Annalen, vol. lxv (1908), p. 261; Acta Math. vol. xxxn (1909), p. 186. 

§ Proc. Lond. Math. Soc. (2), vol. iv (1905), p. 29. 

|| Math. Annalen , vol. lxxii (1912), p. 551, vol. lxxxiti (1921), p. 173, and vol. lxxxv (1922), 

p. 60. 

Ann. of Math. (2), vol. vi (1904), p. 151, and (2), vol. vn (1905), p. 15. 
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of which an account is given in the present chapter will be found in the 
writings of Hausdorff*, Ffessenbergf, SchoenfliesJ, Bernstein §, Jourdain|| 
and Whitehead^j. A treatise has been published by Konig**, in which the 
subject is treated from his own point of view. 

An interesting discussion of the objections which have been raised in 
various quarters against Cantor’s theory of aggregates has been givenfj* 
by Fraenkel, who has also developed in some detail the “axiomatic” 
treatment of the theory due to Zermelo, in which no direct definition of 
an aggregate, such as that of Cantor, is employed. 

189. It is not clear that an aggregate, defined as a class of objects, is 
necessarily capable of being ordered at all. For example, it is difficult to 
see that such an aggregate as that of “all propositions” could con¬ 
ceivably be ordered; where it is assumed that the meaning of the word 
“proposition” is taken as so definite, that this aggregate has a norm in 
accordance with the definition above. Again, to take an example among 
aggregates of the kind usually considered in Mathematical theory, we may 
consider the aggregate obtained by distributing the aggregate of real 
numbers upon itself. This aggregate which has the cardinal number / = c c , 
is equivalent to the aggregate of all the functions of a real variable; it is 
difficult, if not impossible, to see how order could be effectively imposed 
upon this aggregate. If then, a transfinite aggregate is to be given as an 
ordered aggregate, or is to have an order imposed upon it, or rather dis¬ 
covered in it, it would appear to be necessary that the norm, which 
constitutes the definition of the aggregate, should be of such a character, 
that a principle of order, is contained therein, or can at all events be 
adjoined thereto; so that, when any two particular elements are con¬ 
sidered, the conditions which they satisfy in virtue of their belonging to 
the aggregate, when individualized for the particular elements, may be 
sufficient also to allow of relative rank being assigned to those elements, 
in accordance with a principle of order. This is in fact the case in such 

* Orundzilge der Mengenlehre, Leipzig, 1914; also Math. Annalen, vol. lxv (1908), p. 435, and 
vol. lxxv (1914), p. 428; Leipz. Her. vol. Lin (1901), p. 460; vol. Lvni (1906), p. 106, and vol. lix 
( 1907), p. 84; Jahresber. d. Math. Vereinigung, vol. xm (1904), p. 670. 

t Grundbegriffe der Mengenlehre, Gottingen, 1906. 

t Math. Annalen, vol. Lvm (1904), p. 195; vol. lix (1904), p. 129; vol. lx (1905), p. 181, and 
p. 431. 

§ Math. Annalen , vol. lxi (1905), p. 117. 

|| Phil . Mag. (6), vol. ix (1905), p.42; (6), vol. vi (1903), p. 323, and vol. vn( 1904), pp. 61 and 294. 

If Amer. J. of Math. vol. xxiv (1902), p. 367, and Archiv d. Math. u. Physijc (3), vol. x (1906), 
p. 254; also Quarterly Journal , vol. xxxix (1908), p. 375. 

** Neue Grundlagen der Logik, Arithmetik und Mengenlehre , Leipzig (1914), also Math. Annalen, 
vol. lx (1905), p. 177, and p. 462. Also Comptes Rendus, vol. oxun (1906), p. 110, and Verhand. d. 
Math. Kongrem z. Heidelberg (1904), p. 144. 

ft See hie work, Einleitung in die Mengenlehre, 2nd ed., Berlin (1923). See also Chwistek, 
Math. Zeitschr. vol. xxv (1926), p. 439. 
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aggregates as those of the integral numbers, the rational numbers, or the 
real numbers, In the case, for example, of the positive rational numbers, 
the relative rank of any two particular elements (p, q), (p', q') is assigned 
by the system of postulations, contained in § 12, which defines the aggre¬ 
gate. It may, of course, also be possible in other cases, as in this one, to 
re-order the aggregate, in accordance with some other law, extrinsically 
imposed upon the aggregate; but the nature of the elements must be such 
that this is possible. 

We can now state that: 

In order that a transftnite aggregate, defined as in § 187, may be capable 
of being ordered, a principle of order must be explicitly or implicitly contained 
in the norm by which the aggregate is defined. 

The relative order of any two elements chosen from, an ordered aggregate 
depends upon the individual characteristics of those elements, in accordance 
with the principle of order. 

In the definition of order-type given by Cantor (see § 157), according 
to which the order-type of an aggregate is obtained by making abstraction 
of the particular nature of the elements of the aggregate, it is assumed 
that the aggregate is given as an ordered aggregate. Again, in his definition 
of cardinal number (see § 145), Cantor has assumed that the aggregate is 
given as an ordered one; the cardinal number there appears as the result 
of a double abstraction, viz. of the particular nature of the elements, and 
of the order in which they are given. The question however arises, whether 
the definition of cardinal number should not be such as to be also applic¬ 
able in the case of aggregates which are not given as ordered aggregates. 
Cantor has himself, in fact, in his theory of exponentials involving trans- 
finite cardinal numbers, contemplated certain aggregates as having cardinal 
numbers, whilst such aggregates were not given as ordered aggregates, 
and prima facie, at all events, are not capable of being ordered. 

190. Taking the case of an aggregate defined as an ordered aggregate, 
we now approach the consideration of the fundamental question, whether, 
and under what conditions, if any, such an aggregate can be regarded as 
having a definite order-type, and a definite cardinal number. This is 
equivalent to asking whether, or when, meanings can be given to those 
terms, in accordance with general definitions, of such a character that 
they can be treated as permanent objects for thought, or as mathematical 
entities which may themselves be elements in aggregates. 

With reference to Cantor’s definition (see § 145), of the cardinal number 
of a transfinite aggregate, by abstraction, in accordance with which the 
cardinal number is represented by replacing each element by an abstract 
unity, it must be observed that such a substitution would replace the 
given aggregate by another one which had no longer any intelligible 
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relation with the norm by which the original aggregate is defined. The 
abstract unities would be indistinguishable from one another, and the new 
aggregate would be indistinguishable from any other non-finite aggregate 
of such unities. It would be impossible to decide, as regards any par¬ 
ticular abstract unity, whether it belonged to the aggregate or not; in 
fact, to make complete abstraction of the individual nature of the elements 
of an aggregate is to destroy the aggregate. A definition by abstraction 
could be justified only by the interpretation that abstraction is made of 
those characteristics only, in which the elements of the aggregate differ 
from the corresponding elements of all possible equivalent aggregates. 
Thus the existence of aggregates equivalent to the given aggregate would 
appear to be essential, if the latter is to be regarded as having a cardinal 
number to which any definite meaning can be attached. On the grounds 
stated, the definition of a cardinal number, as the characteristic of- a class 
of equivalent aggregates, is to be preferred to the definition given by 
Cantor. Accordingly, an aggregate has a cardinal number, only when it 
is one of a plurality of equivalent aggregates, distinct from one another. 
In all cases the correspondence between equivalent aggregates must be 
definable by some norm. 

We are thus led to the following statement, containing a definition of 
cardinal number: 

The members of any particular class of equivalent aggregates have a quality 
in common in virtue of their equivalence. This quality is the cardinal number , 
and may be regarded as characteristic of each aggregate of the particular class. 

A cardinal number has been defined* by B. Russell to be a class of 
equivalent aggregates. It may then be urged that such a class may contain 
only one member, and that this is sufficient for the existence of the cardinal 
number of that member. In fact, Russell infersf the existence of the 
number n -f 1 from that of the numbers 0, 1, 2, 3, ... n. Russell objectsj 
to the conception of a number as the common characteristic of a family 
of equivalent aggregates, on the ground that there is no reason to think 
that such a single entity exists, with which the aggregates have a special 
relation; but that there may be many such entities, and that there are in 
fact an infinite number of them. The mind does, however, in point of fact, 
in the case of finite aggregates at least, recognize the existence of such a 
single entity, viz. the number, or degree of plurality, of the aggregate; 
and this creation, by abstraction, is a valid process, subject to the law of 
contradiction. 

191. In Cantor’s definition of the order-type of a simply ordered trans¬ 
finite aggregate (see § 157), abstraction is made of the nature of the 
elements, their order in the aggregate being alone retained. The order- 

* Principles of Mathematics, vol. i (1903), pp. 111-116. f /WdL p. 497. % Ibid. p. 114. 
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type is then regarded* as represented by an aggregate of abstract unities, 
in the order of the elements of the given aggregate. In any ordered 
aggregate, it is however the individual characteristics of any two elements 
which determine their relative order in the aggregate, in accordance with 
some principle of order, valid for the whole aggregate. If complete ab¬ 
straction be made of the characteristics of the various elements, order has 
then disappeared from the aggregate. It must be supposed that, in 
Cantor’s representation of the order-type, there are attached to the abstract 
unities marks of some kind, which may in particular cases be marks 
indicating position in space or time, by which the order of the various 
abstract unities is denoted; the given aggregate is then really replaced by 
an aggregate of these marks, and the abstract unities are superfluous. 
These marks, by which order is determined, must also have been associated 
with the elements of the original aggregate. It thus appears that, in a 
definition by abstraction, it can be only those characteristics (if any) of 
the various elements which are irrelevant in determining the order, of 
which abstraction is made: thus the aggregate is really replaced by a similar 
one. On these grounds, that definition of an order-type is to be preferred, 
in which the order-type is defined as the characteristic, or class-name, of a 
class of similar aggregates. Accordingly, in order that a given aggregate 
may have an order-type, to which a definite meaning can be attached, it 
is necessary that the aggregate be one of a plurality of similar aggregates. 

We may accordingly state that: 

The members of any particular class of similar aggregates have a quality 
in common , in virtue of their relation of similarity . This quality of mutual 
similarity possessed by the aggregates is their order-type , and may be repre¬ 
sented by a name or symbol , regarded as characteristic of each aggregate of 
the particular class. 

The considerations above adduced may be applied in the case of an 
aggregate which is a segment of the hypothetical aggregate of all ordinal 
numbers. In this case it is impossible to make abstraction of the nature 
of the individual elements of the aggregate, without destroying the order, 
because the elements are themselves nothing more than marks indicating 
order. 

192. The question has frequently been discussed whether the con¬ 
ception of an ordinal number is necessarily prior to that of a cardinal 
number. The view is held that the very earliest conception of a number 
was developed as the result of immediate intuition of the degree of plurality 
in a very small group of objects, for example of a pair, or of a triplet, or 
of the unity in a single object, without recourse to the process of counting. 
In this manner the conception of the first few integral numbers may have 


* See Math. Anna,Jen, vol. xlvt (1895), p. 497. 
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arisen, and the conception would be that of cardinal number; the objects 
in a group not being regarded as in any particular order. This mode of 
apprehension of the number of objects in a group must certainly have 
been extremely limited in its scope, and the conception of numbers, after 
the first few, must have arisen, as explained in § 1, in connection with the 
process of counting, in which the objects are taken in some definite order; 
thus leading in the first instance to the ordinal number of the group. It 
appears certain, then, that the actual order of development of Arithmetic, 
except for the case of the first few numbers, must have been one in which 
the ordinal number has a certain priority to the cardinal number. 

In the systematic development of the theory of numbers, finite and 
transfinite, the notion of an ordered aggregate has certainly been funda¬ 
mental, and not derivative. A reversal of the historical procedure, in which 
ordered aggregates, having definite ordinal numbers or order-types, were 
first considered, and afterwards the notion of the cardinal number has 
arisen, would lead to probably insuperable difficulties. It is no doubt 
possible to define the cardinal number of a transfinite aggregate, as has 
in fact been done above, in such a way that the notion of order does not 
explicitly appear, but the question of the comparability of cardinal numbers 
so defined at once arises, and the consideration of ordered aggregates would 
appear to be indispensable for the development of any systematic arith¬ 
metic of cardinal numbers. Cantor’s Aleph-numbers, as forming an ordered 
family of cardinal numbers, are essentially dependent upon the theory of 
the order-types of normally ordered aggregates. It is difficult to see, for 
example, how it could be known, in respect of a particular aggregate, that 
it has the cardinal number X 0 , unless a correspondence of the elements of 
the aggregate with the ordered set of finite numbers could be shewn to 
exist. It may be asserted that, since any non-finite part of an enumerable 
aggregate is itself enumerable, there may exist a part of the integer se¬ 
quence 1, 2, 3, ... n, ... which cannot be defined by means of a finite set 
of rules, and yet that such an aggregate must have the cardinal number 
N„. A distinction is accordingly drawn, by some writers*, between those 
enumerable aggregates for which a correspondence with the integer se¬ 
quence 1, 2, 3, ... can be effectively defined, and those for which this is 
not the case. It can, however, be shewn that there exists no part of the 
integer sequence {n} which is inherently incapable of finite definition. It 
has been shewn, in § 63, in the case of the numbers of the continuum, that 
those numbers, in any interval, that are capable of finite definition, form 
an aggregate which has all the properties of the continuum itself; and in 
view of the theory of the order-type of the continuum given in § 163, 
that aggregate is identical with the continuum. 


* for sample, Borel, Annales sc . de Vdcole normals , (3), vol. xxv (1908), p. 447. 
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Any non-finite part of the integer sequence {n} can, however, be made 
to correspond with a particular number of the continuum in the interval 
(0, 1); we take such a number expressed as a radix fraction in the binary 
scale, defined by the rule that the rath digit is 0 or 1, according as the 
integer ra occurs, or does not occur, in the part of the integer sequence 
which is under consideration. Since there is a (1, 1) correspondence be¬ 
tween the parts of the integer sequence and the numbers of the continuum 
within the interval (0, 1), it follows that, since, as has been shewn in § 63, 
no number of the continuum is inherently incapable of finite definition, 
the same is true of the parts of the integer sequence. It thus appears that 
no enumerable aggregate is inherently incapable of being placed into 
correspondence with the numbers of the integer sequence, by a finite set 
of rules, although a particular fixed apparatus of definition may be in¬ 
sufficient for this purpose. 

193 . We proceed to consider those aggregates which consist of ordinal 
numbers in their order of generation. 

There are two distinct methods of establishing the existence of a class 
of mathematical entities. 

(1) . Their existence, as definite objects for thought, may be shewn to 
follow as a logical consequence of the existence of other entities already 
recognized as existent, or of principles already recognized as valid; so that 
the existence of the new entities in question cannot be denied without 
coming into contradiction with truths already known, or at all events with 
postulations already made. This method may be termed the genetic method. 

(2) . The existence of the entities may be directly postulated; and their 
mutual relations, and their relations with other entities already assumed 
to exist, may be defined by means of a complete system of definitions and 
postulations. Accordingly, the objects in question are a relatively free 
creation of our mental activity. The validity of the scheme thus set up 
is established when it is shewn to be free from internal contradiction. Its 
utility is to be judged by its applicability to the general purposes of the 
science, and by the light it may throw upon the fundamental principles 
of that science, in virtue of the scheme containing a generalization of what 
was previously known. This method may be termed the method of postu¬ 
lation. It may, however, be urged that the failure to discover contradic¬ 
tions within a scheme of postulations is no proof that such contradictions 
do not exist, and that such proof can only be supplied by the exhibition 
of a system of entities already assumed to exist, such that the relations 
between them are in accordance with those postulated in the scheme in 
question. 

Both these methods have been employed by Cantor in his theory of 
transfinite numbers and order-types. In his earlier treatment of the subject, 
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he employed the second of the above methods. The existence of the new 
number co, and of the limiting numbers of the second class, was postulated, 
in accordance with the second principle of generation. Freedom from 
contradiction, and utility in connection with the theory of sets of points, 
which suggested the postulations, were relied upon as the grounds upon 
which the system of new numbers was to be justified. The first number £2, 
of the third class, was introduced by a new postulation. 

In his later and more abstract treatment of the subject, an account of 
which has been given in the present chapter, Cantor applied the genetic 
method. The existence of the number a) is not directly postulated, but is 
taken to follow from the existence of the aggregate {/?}, of integral numbers; 
to is defined to be the order-type of this aggregate, and it is assumed that 
such order-type is a definite object which can itself be an element of an 
aggregate. The successive ordinal numbers of the successive classes are 
obtained by assuming as a general principle, that an ordered aggregate 
necessarily possesses a definite order-type which can be regarded as itself 
an object, the ordinal number coming immediately after all those that 
are the elements of the aggregate of which it is the order-type. 

It will be seen later that the assumptions that an ordered aggregate 
necessarily possesses a definite order-type and that it also possesses a 
definite cardinal number, both of which can be regarded as objects, lead 
to the contradiction pointed out by Burali-Forth It appears, therefore, 
that the class of entities, which is constituted by the ordinal numbers of 
all classes, and the similar aggregate of Aleph-numbers, do not satisfy the 
condition of being subject to a scheme of relations which is free from con¬ 
tradiction. In fact, the principle, in accordance with which their existence 
is inferred, conflicts with the definition of the aggregates as containing 
respectively every ordinal number and every Aleph-number. It would 
then appear that the genetic process which led to the definition of the 
aggregates of all ordinal numbers, and of all Aleph-numbers, cannot be a 
valid one without some restriction. Thus the principle that every ordered 
aggregate has a definite order-type, which may be regarded as a permanent 
object of thought, cannot be accepted as a universal principle to be used 
in a genetic mode of establishment of the existence of a class of entities. 
A denial of the validity of this principle does not however preclude the 
less ambitious procedure of postulating the existence of definite ordinal 
numbers of a limited number of classes, in accordance with Cantor’s earlier 
method. So long as the postulation of the existence of ordinal numbers 
does not go beyond some definite point no contradiction will arise, and 
the validity of the scheme, for purposes of representation, will suffice to 
justify the postulations which are made. An attempt to examine the 
structure of such a class of ordinal numbers, as that of the cuth class, with 
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cardinal number or that of the £2th class, with cardinal number 
will lead to the conviction that such conceptions are unlikely to prove 
capable of useful application in any branch of Analysis or of Geometry. 

THE PARADOXES OF BURALI-FORTI AND RUSSELL 

194 . In accordance with Cantor’s general theory of ordinal numbers, 
and of Aleph-numbers, there exist two aggregates 

1, 2, 3, ... n, ... co, ~f* 1, ... 12, £2 -j- 1, ... /3, ... 

K 0 , K n+ i,... 

the first, the aggregate of all ordinal numbers, which may be denoted 
by W ; and the second, that of all X cardinal numbers. These aggregates 
are similar to one another, and they contain, respectively, every ordinal 
number, and every cardinal number which belongs to a normally ordered 
aggregate. 

The aggregate W is normally ordered. For, if be any part of W y 
such that W contains one or more elements of higher order than all the 
elements of W \, it follows that W l is a part of some segment of W that 
also contains one or more such elements. Since such segment is normally 
ordered, there exists one element of that segment, and therefore of W, 
which immediately follows the part-aggregate W x . It is thus seen that, 
since the conditions of § 165 are satisfied, W is normally ordered. In 
accordance with the principle which is fundamental in the whole theory, 
that every normally ordered aggregate has a definite order-type, which is 
its ordinal number, and has also a definite’ cardinal number, the aggregate 
W has an ordinal number y, and a cardinal number X y . The ordinal 
number y must itself occur in the aggregate W, and must therefore be 
the greatest ordinal number, i.e. the last element of W. There can, how¬ 
ever, be no last ordinal number; for, on the assumption of the existence 
of y, an aggregate, of ordinal number y + 1, can be formed, by adding 
to IT a new element e, of higher rank than all the elements of W. Therefore 
a contradiction has been arrived at. 

This contradiction in the conception of the aggregate W, of all order- 
types of normally ordered aggregates, was pointed out by Burali-Forti*, 
but as is stated by Jourdainf, it was discovered by Cantor in 1895, and 
communicated by him to Hilbert in 1896, and to Dedekind in 1899. The 
contradiction has been discussed by various writers, and the conclusions 
they draw from its existence are not in full agreement with one another. 
Burali-Forti inferred from it that ordinal numbers are not comparable 
with one another, but this is refuted by Cantor’s theorem in § 168. Jourdain 
shews that W is normally ordered, but concludes that it possesses neither 

* Rend, del circolo mat. di Palermo , vol. xi (1897), p. 164. 
f Phil. Mag. (6), vol. vn (1904), pp. 67, 70. 
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order-type nor cardinal number. He describes W as an “ inconsistent ” 
aggregate, a term which had been previously employed by Cantor in the 
same connection. It would thus appear that there are aggregates which 
have no order-type, and no cardinal number, and that the above aggre¬ 
gates belong to such class. Such aggregates are called inconsistent aggre¬ 
gates, in virtue of the fact that the act of synthesis, by which they are 
formed, cannot be carried out without leading to a conception which contains 
an element of contradiction. 

It was pointed out by Schoenflies* and by Bernsteinf, that the hypo¬ 
thetical aggregate W lacks one property that belongs to all its segments, 
and to all normally ordered aggregates, viz. that a new normally ordered 
aggregate can be formed by the addition of a new element, of higher rank 
than all those of the original aggregate. In fact the first principle of 
generation of the successive ordinal numbers in Cantor’s scheme is that 
to each ordinal number there follows a next one, and it is clearly impossible 
that the aggregate of all ordinal numbers can have an ordinal number that 
is subject to this principle. It thus appears that W can have no ordinal 
number, or order-type. This being so, the question, whether W can be 
properly said to be an aggregate, or not, depends on the precise meaning 
assigned to the term aggregate. It would appear desirable that such a 
limitation should be given to the definition of the term that such hypo¬ 
thetical aggregates as W would not fall under the definition. All ordinal 
numbers form a class, and thus doubt is thrown upon the advisability of 
making the term aggregate synonymous with that of the “class” of Logic. 
Moreover, it would appear desirable that the definition of an aggregate 
should be so restricted that every aggregate has a cardinal number and, 
if normally ordered, an order-type. It has been pointed outj by Mirimanoff 
that the antinomy of Burali-Forti would arise, even if the existence of all 
the ordinal numbers in the scheme of Cantor be not admitted. Instead 
of W, the hypothetical aggregate of all existing ordinal numbers could be 
contemplated, and the same reasoning would shew that it would be con¬ 
tradictory to ascribe an ordinal number, or order-type, to this aggregate. 
Let us assume, for example, that the finite ordinal numbers were the only 
ordinal numbers whose existence was admitted; it would then be contra¬ 
dictory to regard the totality of all these ordinal numbers as having an 
ordinal number, or order-type; in fact this totality would be an “incon¬ 
sistent” aggregate, in the sense employed by Cantor and Jourdain. 

The mode of generalizing the ordinal numbers in Cantor’s scheme re¬ 
quires a fresh postulation of existence, in the case of the lowest numbers 
of each of the successive classes. Although the contradiction of Burali- 
Forti cannot be invoked to shew that such a postulation leads to contra- 

• Berichty 2ter Theil, p. 27. f Bernstein, Math. Annalen , vol. LX (1905), p. 187. 

X See L'enseignement mat.y Jan. to March, 1917. 
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diction, as long as we refrain from considering such an aggregate as that 
of all ordinal numbers, or of all existing ordinal numbers, it cannot be 
directly shewn that at no stage of the successive postulations does some 
contradiction arise. It would appear then that the validity of the whole 
scheme, being subject to the possibility of the discovery of contradictions 
at some stage or other, the existence of the numbers in Cantor’s scheme 
must be regarded, from the point of view of the Mathematician, only as a 
working hypothesis. The utility of such hypothesis must be judged by its 
consequences and its applications in Analysis. 

195 . The antinomy* of Russell depends upon the consideration that 
two species of aggregates may be contemplated. An aggregate M, of the 
first species, does not contain M itself as an element; but an aggregate M 
of the second species does contain M itself, as an element. An example 
of an aggregate of the second species is the aggregate of all aggregates, 
rt is clear that an aggregate of the second species contains an element 
which is itself an aggregate of the second species. It follows that an 
aggregate, all of whose elements are aggregates of the first species, is itself 
of the first species. The aggregate which contains, as elements, all aggre¬ 
gates of the first species is a notion essentially affected with contradiction. 
For it must be itself of the first species, since all its elements are of that 
species; on the other hand, it must be of the second species, since it must 
contain itself as element ; for the aggregate itself is one of the aggregates 
of the first species. 

It thus appears that the aggregate of all aggregates of the first species 
cannot exist as a notion free from contradiction. 

This contradiction may be considered in relation both to Cantor’s 
definition of an aggregate, and to the definition of an aggregate as a class. 
In accordance with Cantor’s definition, the elements of an aggregate, being 
logically prior to the aggregate, must be definable in a manner which does 
not make use of the aggregate itself in the definitions; and thus it would 
appear that no aggregate which contained itself as an element would fall 
under Cantor’s definition of an aggregate. On the other hand, the defini¬ 
tion of an aggregate as a class would not prima facie preclude the contem¬ 
plation of such a conception as the class of all classes. 

It is clear that the proper definition of an aggregate should be such 
as to exclude all hypothetical aggregates which contain elements that 
cannot be defined without employing the aggregate itself; in the language 
of Poincar6, non-predicative definitions should be excluded. 

196 . With a view to the elucidation and generalization of the anti¬ 
nomies of Burali-Forti and Russell, a classification of aggregates has been 

* See Principles of Mathematics, vol. I (1903), chap. x. 
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given* by Mirimanoff, according to the modes in which their elements are 
composed of other elements. The elements m, of an aggregate M, may be 
simple elements which are not decomposable into other elements. On the 
other hand, an element m may itself be an aggregate composed of elements e ; 
thus m = {e}. Again, the elements e, of which m is composed, may them¬ 
selves be aggregates composed of other elements e! ; thus e s {e'}. 

It is clear that this process of decomposition of the elements of an 
aggregate into aggregates of other elements, and of these elements into 
aggregates of other aggregates, may proceed until only simple elements, 
no longer decomposable, are obtained; or on the other hand the process 
may go on indefinitely. 

An aggregate is said, by Mirimanoff, to be an ordinary aggregate, when 
each element m is such that, after a finite number of decompositions, 
simple undecomposable elements are obtained. An aggregate which does 
not possess this property is said to be an extraordinary aggregate. 

Two aggregates are said to be isomorphic when (1), they are equivalent, 
i.e. when there is a (1, 1) correspondence between their elements, and 
(2), when, m, m' being two corresponding elements, which are decom¬ 
posable, there is also equivalence between the two aggregates of which 
they consist, and when such equivalence also holds in case further de¬ 
composition can take place, and so on, so long as any continued decom¬ 
position of elements can be made. 

An aggregate is said to be of the first species if it is not isomorphic 
with any of its elements, and it is said to be of the second species if it is 
isomorphic with one at least of its elements. The hypothetical aggregate, 
which consists of all aggregates of the first species, or also that of all 
ordinary aggregates, involves the same contradiction as in the case of 
Russell’s antinomy. 

An aggregate of the second species is an extraordinary aggregate, but 
an aggregate of the first species may also be extraordinary. 

THE MULTIPLICATIVE AXIOM 

197 . An axiom, known variously as the “multiplicative axiom,” the 
“principle of Zermelo,” and the “general principle of selection,” which 
can be expressed in several equivalent forms, was first stated explicitly 
in connection with a proof given by Zermelo|, in 1904, that every aggre¬ 
gate can be ordered in normal order; although it had been implied in the 
reasoning of various writers in connection with special theorems in the 
theory of sets of points, and in the theory of functions. The axiom has 
been statedj by Zermelo in the following form: 

* Venseignement mat., Jan. to March, 1917, p. 42. 

f Math . Annalen, vol. lix (1904), p. 514. J Math. Annalen , vol. lxv (1908), p. 110. 
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An aggregate S, which falls into an aggregate of separate parts A, B,C, ..., 
each of which contains at least one element , possesses at least one part S, which 
has in common with each of the parts A, B, C 9 ..., exactly one element. 

In accordance with the definition, given in § 149, of the product of a 
finite, or infinite, set of aggregates, as an aggregate of which each element 
consists of an association of elements, one from each of the aggregates of 
the given set, the above principle is equivalent to the assertion that the 
product of the aggregates of the given set contains at least one element; 
and thus the principle may be termed the “multiplicative axiom,” in that 
it asserts the existence of the product. 

The principle can be stated in the less objective form that a choice 
can be made of a single element, from each one of a set {M} of aggregates 
M ; an aggregate thus being formed by an infinite number of such acts of 
choice. When viewed in this manner the principle is spoken of as the 
“general principle of selection” (das Prinzip der Auswahl). 

Tn the discussions to which this principle, or axiom, has given rise, 
much divergence of opinion on the part of mathematicians has emerged. 
Borel, who pointed out that Zermelo’s proof that an aggregate can be 
normally ordered simply establishes that this theorem is equivalent to the 
principle of selection, distinguished* between an enumerable and an un- 
enumerable set of acts of choice, and rejected, as outside the domain of 
Mathematics, all reasoning founded upon the supposition of an unenumer- 
able set of acts of choice. The impossibility of proving the principle was 
expressly stated by Zermelo, and emphasized by Borel, and by Peanof. 
Poincare, in the course of his discussionj of the “Logistic” of Peano and 
Russell, expressed the view that the principle, although incapable of proof, 
is an indispensable axiom. 

198 . By some Mathematicians, the employment of an infinite set of 
acts of choice, even if that set be enumerable, is rejected as outside the 
domain of Mathematics, on the ground that the hypothetical entity, the 
existence of which is asserted in such a case by the principle, is not properly 
defined. 

In the course of a discussion § by Hadamard, Borel, Baire, and Lebesgue, 
the distinction was taken into account between independent acts of choice 
and such as are dependent upon acts of choice that have been previously 
made. 

In those special cases in which the aggregates are of such a nature that 
it is possible effectively to define an aggregate which consists of one element 

* See Math. Annalen , vol. lx (1905), p. 195. f Rivwta di Mai. vol. vm, No. 5, p. 145. 

X See the Revue de Mitaphysique et de Morale, vols. xui (1905), p. 816 and xiv (1906), p. 866. 

§ “Cinq lettrea sur la th^orie des ensembles,” Bull . de la soc. math, de France , vol. xxxm 
(1905), p, 261. 
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belonging to each of the given aggregates, the principle is not required as 
an axiom, but such special cases may be regarded as instances in which 
the truth of the principle is verified. In a case in which we are unable to 
give such effective definition, the principle amounts to an assertion of the 
existence of the aggregate in question, apart from any question of effective 
definition, and at least asserts that we are warranted to make the same 
deductions as if we were in possession of such effective definition. In this 
connection it may be pointed out that there is a certain relativity in the 
notion of effective definition, as has already been shewn (§ 63) in the case 
of the definition of numbers of the arithmetic continuum; in fact, the 
possibility of effective definition of an object is relative to an apparatus 
of definition which cannot be considered as once for all, and finally, fixed. 

Even if the number of aggregates from which the selection is to be 
made be finite, there remains a question as to the scope of the axiom. It 
is sufficient to consider whether, and in what sense, it is always possible 
to select a single element from a single given aggregate. Whether this can 
be done effectively will depend upon the mode in which the aggregate is 
defined. For the possibility of such choice, it must in the first place be 
known that the aggregate is not a null-aggregate, i.e. that it contains at 
least one element. The assumption of this knowledge being made, the 
effective determination of an element may be impracticable, or difficult; 
but this will not affect its logical possibility. It appears, however, that 
there are cases in which even the logical possibility of determination 
cannot be demonstrated; and in such cases recourse must be had to the 
axiom of existence, if an element of the aggregate in question is required 
for the purpose of any process of reasoning in which such element is em¬ 
ployed. 

Some of the objections raised to the axiom amount to the assertion 
that the axiom is meaningless, if it is interpreted as asserting the existence 
of an aggregate of which the elements cannot, even in theory, be effectively 
defined. The difference of opinion upon the point, as to whether, or not, 
an aggregate can be said to exist, when none of its elements can be effec¬ 
tively defined, would appear to reveal a fundamental difference of view 
on a matter of Ontology, which cannot be resolved within the domain of 
Mathematics, as it involves an irreconcilable divergence of attitude in 
general Philosophy. It was pointed out by Hadamard (loc. cit.) that the 
real question at issue is whether it is possible to demonstrate the existence 
of mathematical entities which cannot be precisely defined; a question 
which Hadamard himself answers in the affirmative, though with this 
answer the other writers do not appear to be in agreement. By some 
mathematicians, the principle is accepted in a pragmatic sense, as a useful 
instrument of Mathematical research, and as a guide which suggests results 
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that may be verifiable by other means, although the principle cannot be 
regarded as standing upon an absolutely firm basis. 

The case for the acceptance of the axiom rests partly upon the fact 
that the postulation of its validity had been implicitly made, as self- 
evident, by many writers, in various investigations in the theory of sets 
of points, and in the theory of functions, before it was stated explicitly; 
and that it has since not only proved fruitful in many further developments 
of those theories, but that results obtained by its use have been verified 
in many important cases. A detailed analysis has been given* by Sierpinski 
of general results in the theories of sets of points, and of functions, which 
cannot be established without the aid of the axiom, and of other cases in 
which the employment of the axiom is not necessary. The method of 
G. H. Hardy (§ 186), for the construction of a set of points of cardinal 
number is a case in which the principle of selection is required, since 
there exists an indefinite number of sequences of the ordinal numbers, 
preceding a limiting ordinal number y, of each of which y is the limiting 
number. It is not possible to give a norm by which these sequences can 
be determined. 

199. A limiting point P, of a set of points G, has, in § 52, been defined 
as a point such that in every neighbourhood of P there are points of G . 
It has been pointed out by Sierpinski that, in the general case, the inference, 
from this definition, that there exists a sequence P x , P 2 , ... P„, ... of points 
of (7, which converges to P as sole limiting point, so that the distance PP n 
converges to zero, as n is indefinitely increased, cannot be made without 
having recourse to that case of Zermelo’s axiom in which the selections 
are made from an enumerable sequence of sets of points. 

For simplicity, let us consider the case in which G is a linear set; this 
differs in no essential respect from the case in which G is a set in p dimen¬ 
sions, where p > 1 . Let us suppose that the point x — 0 is a limiting point 
of G , in accordance with the definition of § 52. Let G n be that part of G 

which consists of points x that satisfy the conditions - s | x | < ~ . 

Thus G n is in the two half-closed intervals (——,, -V (— - 7 — 7 ^ . 

\n -f 1 n) \ n n -f 1J 

Consider now the sequence of sets G 1 , G 2 , ... G n , ... of which the sum is 
the relevant part of G. The axiom of Zermelo asserts the existence of a 
set of points P r , P r+1 , ... P n , ...; such that P n belongs to G n , for all the 
values r, r 4- 1 , ... of n . Unless it is possible to define by a norm such a 
set of points, its existence can only be assumed in virtue of the axiom. 
This set of points is a sequence such as is required, which converges to 
the point x = 0 . 


* Bull, de Vacad. des sciences de Cracovie , April—May, 1918. 
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There is one important case in which Zermelo’s axiom is not required, 
viz. when G contains a known enumerable set //, for example the set of 
rational points, that is everywhere dense in G. The point P n may then 
be defined as that point of H, of lowest rank when H is arranged in enu¬ 
merable order of type a>, which belongs to G n . 

It has been shewn conversely by Sierpinski that the following theorem 
follows as a consequence of the assumption that, if P be a limiting point 
of a set G y in accordance with the definition in § 52, there exists a sequence 
{ P n }, of points of G , which converges tb P as its sole limiting point. 

If G x , G 2 , ... G n , ... be sets of points , no two of winch have an element in 
common, there exists a sequence P 1 , P 2 , ... P n , ... of points, such that each 
one belongs to one of the sets of {G n } and that no two of them, belong to the same 
set of {<?„}. 

By means, for example, of the transformation 

^ 2 n (n + 1) (f V | x | h 2n + *) ’ 

the set G n can be put into correspondence with a set Q n , all the points f 

of which are interior to the interval (-, 1 ) . Let Q denote the set of 

\n n+ 1/ 

points £, given as the sum of all the sets Q l , Q 2 , ... Q n ,, ...; then the point 
f 0 is a limiting point of the set Q. 

Now let it be assumed that there exists a sequence {^ w }, of points of Q, 
which converges to f — 0 as its sole limiting point. We may suppose that 
there is only one of these points in any one set Q n , for, if q m belongs to 
a set of {Q n } to which any one of the points q x , q 2 , ... q m ~ x belongs, we may 
remove q m from the sequence of points. Let us suppose that q n belongs 
to Qn ; then by applying the inverse transformation to that of x into 
we have a sequence of points P ly P 2 , ... P n , ... which belong to the sets 

G l9 G 2 , ... G Tk , ... 

respectively. 

200 . The question of the justification of the assumption that every 
infinite aggregate contains an enumerable aggregate as a part is related 
to the principle of selection. It was assumed by Cantor that it is always 
possible to pick out of a given aggregate, successively, elements that 
correspond to the numbers of the integer sequence. In a large class of 
cases this can be effected by means of a suitable norm, but in the general 
case an infinite number of acts of choice, each of which is affected by the 
choices already made, are required. In case, however, a given aggregate M 
can be divided into an enumerable set of parts M x , M 2 , M z , ..., no two 
of which have an element in common, one element may be selected from 
each of these parts, in accordance with the principle of selection; and thus 
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the existence of the enumerable part of M follows as a consequence of the 
principle. In case M is an aggregate of points, in one or more dimensions, 
an enumerable set of non-overlapping cells, or intervals, may be defined, 
each of which contains a part of M ; and we may take the parts 

M\ , M 2 , Af 3 , ..., 

of Jf, to be those parts that are contained in the cells, or intervals, of 
the set. 


MULTIPLE CORRESPONDENCE 

201. A propos of a criticism*, by Levi, of Bernstein’s proof (see § 185), 
that the aggregate of all closed sets iff the ^-dimensional continuum has 
the power c of the continuum, Bernsteinf has endeavoured to avoid the 
difficulty involved in the use of a correspondence which cannot be effec¬ 
tively defined, by introducing the conception of "multiple equivalence.” 
Thus, if there are two aggregates M and N, for which an aggregate O = {<£} 
of reversible (1,1) correspondences <f> exists, in which no element is special 
(ausgezeichnet), then the two aggregates are said to be multiply equivalent. 
The cardinal number O = / is then termed the multiplicity of the corre¬ 
spondence. In case the multiplicity is unity, the aggregates are said to 
have a one-valued correspondence. The difficulty of this conception is 
the same as that in the multiplicative axiom itself, viz., that the aggregate 
<l> employed is such that no single element of it is capable of definition, and 
that the elements are consequently indistinguishable from one another. 

THE NORMAL ORDERING OF AN AGGREGATE 

202. It was regarded by Cantor as highly probable that every aggre¬ 
gate is capable of being normally ordered; and this is equivalent to the 
theorem that every transfinite cardinal number is an Aleph-number. If 
this principle could be accepted as valid, the particular theorem would 
follow, that the arithmetic continuum is capable of being normally ordered; 
and the only question which would remain open, as regards this aggregate, 
would be as to which particular Aleph-number is the cardinal number c 
of the continuum. 

No proof of the correctness of his surmise was published by Cantor 
himself, but a proof was given by Jourdainf, which depends upon an 
argument in which the ‘‘ inconsistent ” aggregate W (§ 194) was employed 
to shew that no cardinal number can be greater than every Aleph-number. 
Apart from the employment of the concept of the aggregate W , which 
has been shewn to be affected with contradiction, this proof involved 
implicitly the principle of selection. 

* Lomb. 1st. Rend . (2), 1902, p. 863. t OdUinger Nachrichien, 1904, p. 557. 

X Phil. Mag. (6), vol. vii (1904), p. 67; see also Math. Annalen , vol. lx (1905), p. 465. 
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A second, and more cogent, proof was given* by Zermelo, in connection 
with which the axiom associated with his name was explicitly stated as 
necessary for the purpose of his proof. He has later published! a new 
form of the proof, together with a reply to criticisms of his earlier proof. 

It is assumed that, in each part M', of a given aggregate M, one 
element m', called the special (ausgezeiehnetes) element of M', can be 
chosen. The possibility of doing this for all the parts of M follows from 
the axiom. Each clement M ', of {M'}, corresponds to a special element m f 
which belongs to M ; and this particular mode of distributing the elements 
of M upon the elements of {M'} is called a “covering’’ y; the employment 
of a particular “covering” y is essential to the proof. A y-aggregate is 
then defined as follows:—Let M y be a normally ordered aggregate con¬ 
sisting of different elements of M , such that, if a be any arbitrarily chosen 
element of M y , and if A be the segment of M y defined by a, which segment 
consists of all the elements of M y that precede a, then a is always the 
special element of M — A. Every such aggregate M y is a y-aggregate. 
If every element of M which occurs in a y-aggregate be called a y-element 
of M, it is shewn that the aggregate L y , of all y-elements, can be so ordered 
that it is itself a y-aggregate, and contains all the elements of the original 
aggregate M. It follows that M can be normally ordered. 

It will be observed that the theorem that any aggregate can be 
normally ordered does not assert the possibility of effectively carrying 
out, in the case of any given aggregate, the process of arranging the 
elements of the aggregate in normal order. The theorem must rather be 
regarded as an existence theorem, deducible from certain postulates which 
include the multiplicative axiom. 

In the later form of his proof, Zermelo states the theorem in the 
following manner: 

If there corresponds to each part of the aggregate M , one element of 
that part, its special element, then the aggregate U (M), which contains 
as its elements all parts of M , has one and only one element M , such that 
there corresponds to each part P, of M, one and only one element P 0 , 
of M, such that P 0 has P as a part, and an element of P as its special 
element. The aggregate M is normally ordered by means of M. 

The existence of the aggregate U ( M ) as a valid conception is assumed. 

THE COMPARABILITY OF AGGREGATES 

203. From the theorem that every aggregate can be normally ordered, 
it would follow that the cardinal number of any infinite aggregate must 
be an Aleph-number, or the equivalent theorem of the “comparability of 
aggregates,” that of any two aggregates one at least is equivalent to a 
* Math,. Annalen , vol. lix (1904), p. 514. f Math. Annalen , vol lxv (1908), p. 107. 
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part of the other. It has been proved* by Hartog, conversely, that if the 
comparability of every pair of aggregates be assumed, the theorem that 
any aggregate can be normally ordered can be deduced. This theorem is 
established on the basis of a system of axioms givenf by Zermelo. 

It follows from Hartog’s result, combined with Zermelo’s theorem, 
that the three principles of selection, of comparability of two aggregates, 
and of the normal ordering of an aggregate, are completely equivalent to 
one another, in that any two of them can be deduced from the third. In 
case none of the three principles is assumed to be valid, Hartog’s investi¬ 
gation proves that there exists no aggregate whose cardinal number is 
greater than all the Aleph-numbers. The proof depends upon a classifica¬ 
tion of all the normally ordered aggregates of which the elements are also 
elements of a given aggregate M ; the validity of the conception of the 
aggregate of all such normally ordered aggregates, corresponding to a given 
aggregate M, being assumed. 

204 . The discussion of the fundamentals of the theory of aggregates, 
given in the latter part of the present chapter, in which the divergence of 
view amongst mathematicians, as regards some parts of the theory, has 
been indicated, makes it clear that, for some time to come, a critical 
attitude is likely to be maintained in some quarters as regards the theory 
of the transfinite, at least in some of its developments. The extreme 
sceptical attitude has been expressed by Poincare, in the wordsj “II n’y 
a pas d’infini actuel; les Cantoriens Font oublie, et ils sont tombes dans 
la contradiction.” Even if the general theory of classes of order-types and 
of Aleph-numbers be regarded by many mathematicians as still in the 
region of speculation, nevertheless the debt which Mathematical Science 
owes to the genius of G. Cantor will not be materially diminished. The 
fundamental distinction between enumerable and unenumerable aggre¬ 
gates, the interpretation of the arithmetic doctrine of limits, the ordinal 
theory of the arithmetic continuum, and the conception of the ordinal 
numbers of the second class, with their application to the theory of sets 
of points, remain as permanent acquisitions, independently of the accept¬ 
ance of the whole of the higher developments of the abstract theory of 
aggregates. This order of ideas has become indispensable, for purposes of 
exact formulation, in Analysis and in Geometry; it is constantly receiving 
new applications, owing to its admirable power of providing the language 
requisite for expressing results in the theory of functions with the highest 
degree of rigour and generality. Cantor’s creations have rendered in¬ 
estimable service in formulating the limitations to which many results in 

* Math. Annalen , vol. lxxvi (1915), p. 438. 

f Math. Annalen , vol. lxv (1908), p. 261. 

J Revue de Metaph. et de Morale , vol. xiv (1906), p. 316. 
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Analysis, formerly supposed to be universally valid, are subject. The 
outlying parts of the theory, although they open up a most interesting 
field of investigation, from which much may be hoped for in the future, 
do not appear as yet to be comparable in importance, for the general 
purposes of Analysis, with those parts which are accepted as fully estab¬ 
lished by all except the most extreme finitists. The fact that the general 
theory of Aleph-numbers, as an abstract development of the theory of 
order, has received but few applications in the theory of functions, differ¬ 
entiates it from the theory of normally ordered enumerable aggregates, 
which has now become a most useful instrument of discovery in the theory 
of functions of one or more variables. All aggregates of points of a con¬ 
tinuum, which are defined by the methods in ordinary use, have either 
the power of the aggregate of rational numbers, or else that of the arith¬ 
metic continuum itself. The theories of these two kinds of aggregates, 
including as they do a complete arithmetic theory of limits, would thus 
appear to afford a sufficient basis for all the ordinary parts of Analysis. 



CHAPTER V 


FUNCTIONS OF A REAL VARIABLE 

205 . If we suppose that an aggregate of real numbers is defined, the 
aggregate being either enumerable, or of the power of the continuum; 
such an aggregate is said to be the domain, or field, of a real variable. 
It is necessary for the purposes of Analysis to be able to make statements 
applicable to each and every real number of the aggregate, and which shall 
be valid for any particular number that may, at will, be selected. This is 
done by employing the real variable , denoted by some symbol other than 
those used to denote real numbers; and the essential nature of the variable 
consists in its being identifiable with any particular number of its domain. 
The symbols used for denoting variables differ from those employed in 
the case of numbers in being non-systematic. Operations involving real 
variables x n) , x {2) , x {3) , ..., with, or without, particular numbers, are 
carried out in conformity with the same formal laws as hold in the arith¬ 
metic of real numbers. The result of any such operation is itself a variable 
with a domain of its own, which may, or may not, be identical with that 
of any of the constituent variables. 

The numbers being used to designate in the usual manner the points 
of a set on a straight line, the variable may then be taken to refer to the 
points of the set. 

If the given set of points be bounded, in the sense explained in § 47, 
then the domain of the variable is said to be limited , or bounded. When 
the domain of the variable is not limited, it is said to be unlimited , or 
unbounded , in one or in both directions. 

The variable is said to be continuous, in a given interval (a, 6), when 
all the points of the interval, including a and 6, belong to the domain of 
the variable. If the points a, b do not belong to the domain, but every 
internal point of the interval does so belong, the variable is said to be 
continuous in the open interval (a, 6), or within the interval (a, b). The 
corresponding definitions apply to the case of an aggregate of any number 
p of dimensions, which is regarded as the domain of p independent variables 
x a \ x {%) > ... x (v) . In this case a closed, or an open, connex domain of p 
dimensions takes the place of a closed, or an open, interval. A variable 
point (x (1) , x {2) , ... x {p) ), of such a domain, is often conveniently repre¬ 
sented symbolically by a single variable x. 

The term “ variable” has been commonly associated with the con¬ 
ception of a point moving in a straight line or in a curve. It has however 
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been pointed out in the course of the discussions of the continuum, con¬ 
tained in the earlier Chapters, that the continuum cannot legitimately be 
regarded as a synthetic construction formed by a set of points determined 
successively. Successive determination is applicable only in the case of 
any enumerable sequence which may be defined within the continuum, 
and such a sequence may represent a succession of positions of a point 
moving in a straight line. It is however unnecessary to proceed to a 
detailed analysis of the conception of motion, because the Theory of 
Functions has no need of the conception of temporal succession. The 
theory makes continual use of simply infinite sequences determined in the 
continuum; and any such sequence may be regarded as a series of distinct 
determinations of the variable in which the elements are in logical suc¬ 
cession, each element after the first being preceded and succeeded by 
definite elements. 


THE FUNCTIONAL RELATION 

206. If, to each point of the domain of the independent variable x , 
there be made to correspond in any manner a definite number, so that all 
such numbers form a new aggregate which can be regarded as the domain, 
or field, of a new variable y , this variable y is said to be a (single-valued) 
function of x. The variables x } y are called the independent and the de¬ 
pendent variable respectively; and the functional relation between these 
variables may be denoted symbolically by the equation y = f (x). In this 
definition no restriction is made a priori as regards the mode in which, 
corresponding to each value of x , the value of y is assigned; and the 
conception of the functional relation contains nothing more than the notion 
of determinate correspondence in its abstract form, free from any impli¬ 
cation as to the mode of specification of such correspondence. In any 
particular case, however, the special functional relation must be assigned 
by means of a set of prescribed rules or specifications, which may be of 
any kind that shall suffice for the determination of the value of y corre¬ 
sponding to each value of x. Such rules may, in any particular case, be 
embodied in a single arithmetic formula, from which the value of y 
corresponding to each value of x is arithmetically determinable; or the 
rules may be expressed by a set of arithmetic formulae each one of which 
applies to a part of the domain of the independent variable. In case these 
formulae be reducible to a set of mutually independent formulae, that set 
must be a finite one. In case the function be defined by an enumerably 
infinite set of formulae, each applicable to a part of the domain, these 
formulae cannot be mutually independent, but must be subject to some 
norm. 

It should be observed that when, for any particular value of x , the 
corresponding value of y is given by means of any arithmetic formula, the 



Functional Relation 


273 


205, 206] 

numerical value of y is in general only formally determinate; for only a 
finite number of elements of a convergent aggregate which defines the value 
of y can in general be actually found, and thus the value of y can be 
specified only to any required degree of approximation, but it is still 
regarded as perfectly determinate. 

The domain of x consisting of a set (P) of points, the values of y, in 
the case of a given functional relation y ----- f ( x ), may be represented by 
points Q on a linear interval; all such points forming a linear set (Q). The 
set (Q) is said to be the functional image of the set (P), determined by 
the function / (x) ; to each point of (P) there corresponds a single point 
of ( Q ), if / ( x) be a single-valued function; but to each point of (Q) there 
may correspond a finite, or an infinite, number of points of (P). 

If x be employed to denote a variable point P (x n) , x {2i > ... x (p) ) in a 
/^-dimensional field, all that has been said will apply to the case of a 
function y t of p independent variables. The linear set (Q) will be the 
functional image of the p -dimensional set (P). 

The perfectly general definition of a function which has been given 
above is the culmination of a process of evolution which has proceeded 
largely in connection with the study of the representation of functions by 
means of trigonometrical series. By the older mathematicians, a function 
was understood to mean a single formula, at first usually only a power of 
the variable; but afterwards it was regarded as defined by any one 
analytical expression, and was extended by Euler to include the case in 
which the function is given implicitly by a formal relation between the 
two variables. In connection with the problem of the determination of 
the forms of vibrating strings, which led to the discussion of functions 
represented by trigonometrical series, the conception arose of a single 
function defined in different intervals by means of different analytical ex¬ 
pressions. The arbitrary nature of a function given by a graph was dis¬ 
tinctly recognized by Fourier; thus the notion of a function was emanci¬ 
pated from the restriction that an a priori representation of it is necessary 
by a single formula. 

The idea that a function can be defined completely, in the case when 
the domain of the independent variable is a finite continuous interval, by 
means of a graph, arbitrarily drawn, leaves out of account the essentially 
unarithmetic nature of geometrical intuition. A curve that is drawn is 
indistinguishable by the perception from a sufficiently great number of 
discrete points; and thus all that is really given by an arbitrarily drawn 
graph consists of more or less arithmetically inexact values of the ordinates 
at those points of the #-axis at which we are able to measure ordinates. 
In order that a curve may be really known, sufficiently to serve for the 
purpose of defining a function, a series of rules must be prescribed, by 
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means of which the values of the ordinates can be formally determined 
at all points of the x-axis. It is sometimes said, in order to illustrate the 
generality of the functional relation, that a function is definable in the 
form of a table which specifies values of y corresponding to values of x, 
this table being of a perfectly arbitrary character. The inadequacy of 
such illustration is manifest, if we consider that, even if the table were 
an endless one, as has been remarked in § 187, no aggregate of //-values 
can be defined by an endless set of numbers, apart from the production 
of a norm by which those numbers are defined. Moreover, even if the 
table were subject to a definite norm, it could only theoretically suffice to 
define a function of a variable whose domain consisted of an enumerable 
set of points, and would be totally inapplicable to the case in which the 
variable has a continuous domain, unless some special restrictive assump¬ 
tions as to the nature of the function be introduced, by means of which 
the values of the function are made determinate at the remaining points 
of the continuous domain. 

It thus appears that an adequate definition of a function for a con¬ 
tinuous interval (a, b) must take the form first given to it by Dirichlet*, 
viz. that y is a single-valued function of the variable :r, in the continuous 
interval (a, b) K when a definite value of y corresponds to each value of x such 
that a 5 x < 6, no matter in what form this correspondence is specified. 
A particular function is actually defined when y is arithmetically defined 
for each value of x. 

The theory relating to the properties of functions which retain their 
complete generality, in accordance with the abstract definition given 
above, has been, at present, but little elaborated, since comparatively few 
deductions of importance can be made from that definition which will be 
valid for all functions. When, however, the nature of a function is in some 
way restricted, either in the whole domain, or in the neighbourhoods of 
special points of that domain, there is room for the development of a 
detailed theory which shall deal with the peculiarities that follow from 
such restriction upon the complete generality of functions. 

207 . The functions defined in accordance with the above definition 
are known as single-valued functions, since, to each value of x in the domain 
of x, there corresponds a single value of y. The definition may be so 
generalized as to be applicable to multiple-valued functions. This is done 
by replacing the requirement that, to each value of x in the domain of x, 
there shall correspond a single value of y, by the more general statement 
that, to each value of x there shall correspond a definite aggregate of 
values of y. The aggregate of values of y may, for any particular value 
of x f consist of a finite, or of an infinite, set of numbers. A particular 

* See Dirichlet’s Werke , vol. i, p. 135. 
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function is then defined when the aggregate of values of y is arithmetically 
determinate for each value of x, in accordance with the criteria for the 
determinancy of a linear aggregate which have been developed in the 
theory of aggregates. Although the Theory of Functions, as developed 
in the present work, is mainly concerned with single-valued functions, it 
is necessary, or at least convenient, in the course of the examination of 
particular functions and classes of functions, to make use of auxiliary 
functions which are multiple-valued at certain points of the domain of 
the independent variable. Moreover, Dirichlet’s definition, in its original 
form, has the inconvenience that it excludes from the category of functions 
those represented by analytical expressions which, for particular values 
of the independent variable, cease to define a single number. For example, 
an infinite series which, for particular values of the variable, either diverges, 
or ceases to converge to a single definite limit, does not define a single¬ 
valued function in accordance with Dirichlet’s definition, for the whole 
domain of the variable, and yet it is convenient so to extend the meaning 
of the term “function *' that a function may be defined for the whole domain 
by such a series. 

The distinction has been considered in detail by Broden* between those 
functions for which the relation between the dependent variable y and the 
independent variable x is formally the same for the whole domain of x, 
and those functions for which the domain of x is divisible into a plurality 
of parts, for which the forms of the relation between x and y are different. 
He remarks that the distinction is one relating to the character of the 
definitions rather than to the nature of the functions themselves; in the 
former case the function is said to be homonomically defined, and, in the 
latter case, to be heteronomically defined. Broden has given a formal proof 
that, when a function is heteronomically defined, the number of parts 
into which the domain of x can be divided, so that the relations of y to x 
in any one part are completely independent of the relations in the other 
parts, must be finite. 

The Theory of Functions of a Real Variable is concerned with the 
classification of functions, according as they possess various special pro¬ 
perties, e.g . continuity, differentiability, integrability, throughout the 
domain of the independent variable, or at, or near, special points which 
form part of that domain. The theory requires the introduction of^precise 
arithmetical definitions of the scope and meaning of these characteristic 
properties, and is largely concerned with the determination of criteria 
which shall suffice to decide, in the case of a function defined in some 
special manner, what can be inferred as regards the possession by such 
function of properties other than those that are immediately apparent 

* Acta Univ . Lund . vol. xxxm (1897), “Functionentheoretische Bemerkungen und Satze.” 
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from the definition itself. Much of the theory is concerned with a minute 
examination of functions, and of classes of functions, which possess pro¬ 
perties that do not occur in the case of those functions which are employed 
in ordinary analysis and in its applications to Geometry and Physics; 
and the theory has in consequence frequently been described as the 
Pathology of Functions. It appears however, from the theory itself, that 
many of those peculiarities, which from the point of view of traditional 
Analysis would be described as exceptional, have no claim to be so 
described; that in fact it is in the functions of ordinary Analysis that the 
abnormalities really occur, such functions occupying an exceptional posi¬ 
tion in relation to a scientific analysis of the properties of functions in 
general. An important result of the labours of those who .have developed 
the modern theory of functions of a real variable has been that restrictive 
assumptions, which had previously been unconsciously made in the pro¬ 
cesses of ordinary Analysis, have been placed in a clear light; and it has 
been shewn that modes of reasoning which had their origin in an uncritical 
application of ideas obtained from spatial intuition would fail to yield 
correct results when applied to cases of unrestricted generality; the un¬ 
soundness of the logical basis of such reasoning being thereby demon¬ 
strated. 

In ordinary Analysis the domain of the independent variable is taken 
to be a limited, or unlimited, continuous interval. In the theory of 
functions, on the other hand, it has been found advantageous to consider 
also the properties of functions defined for a domain which is not a con¬ 
tinuous one. It appears, in particular, that a non-dense perfect set of 
points, or more generally any closed set, is well suited to be the domain 
of a function, inasmuch as, for such domains, the principal peculiarities 
of functions, such as continuity, differentiability, &c., are capable of 
precise formulation, and can serve for purposes of classification, exactly 
as in the case of functions defined for a continuous domain. Much of the 
recent progress in the subject is due to a recognition of the parity of all 
perfect sets of points, not only as regards their internal structure, but also 
in relation to their fitness for forming the domains in which functions can 
be defined, without loss of any of the characteristic properties that serve 
for the classification of functions of a real variable, or of several such 
variables. 

EXAMPLES 

1. A function/(x) may be defined for the interval (0, 1) as follows: 

for -^x> /(x) = - x 2 , and for x = 0 , /(0) =1, 

n n +1 n 

n denoting any positive integer. In this case, the norm by which the function is defined 
is expressible by an enumerable set of formulae which are however not independent of 
one another. 
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2. A function may be defined as follows: 

for 1 £x > i, /(x) = for ~£x>~, /(*)=|; for * £x> i,/(x) = |, ..., 

11 a; 

and in general, for - g a: > -r, / (a:) — 

° n n + l 9 J w P n ’ 

where P n denotes the 7?th of the prime numbers 2, 3, 5, 7, .... If the function is to be 

defined at the point x =0, this may be done by assigning to / (0) any arbitrarily chosen 

value we please. It will be observed that the values ~ are in this case not representable 

* n 

by a single expression which involves n and x only, though they are definite single-valued 
functions of these. 


Any number x, of the interval (0, 1), except 0, can be uniquely expressed in the form 


6 , b 9 u n 

— + ” -f 

2 ^2 n 


where b n has, for every value of n, one of the values 0, 1, and it is stipulated that all the b n 
are not to be zero from and after any fixed value of n . 

1 

A multiple-valued function* may be defined by y—x n > where n has all positive integral 
values for which b n = 1. This is a homonomic definition, although no analytical expression 
of a unitary character can be given for the representation of y. 


FUNCTIONS OF A VARIABLE AGGREGATE 

208. The conception of the functional relation, as it has been described 
above, has been restricted to that of a determinate correlation of the points 
of a domain, linear, plane, spatial, or p-dimensional, forming the domain 
of the independent variable, with a set of numbers, or points, forming the 
domain of the dependent variable. The modern development of the notion 
of the functional relation has extended it to the more general case in which 
a prescribed family of objects of any specified kind takes the place of the 
field of the independent variable. If {0} denotes a family, or aggregate, 
of objects defined in accordance with some norm, and if we have also a 
norm by means of which a definite number is made to correspond to each 
member 0, of the family, a variable y , identifiable with each such number, 
is regarded as a function of the objects 0 of the given family. The ordinary 
definition considered above of a function of a variable .r, or of a number p 
of variables, is the particular case of the more general conception of a 
function, which arises when the objects 0 are single numbers, or points, 
belonging to a prescribed linear, or ^-dimensional, domain. Another case 
of the functional relation which has recently become of considerable im¬ 
portance in Analysis is that in which each of the objects 0 consists of an 
enumerable set of numbers x {1 \ x {2) , ... x {n) ,_ Thus the functions con¬ 

sidered may be regarded as functions of a point in space of an infinite 
number of dimensions; the field of the independent variable being any 
specified domain in such space. The particular case in which the numbers 

* Brod^n, loc. cit. p. 4. 
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x {1) , x {2) , ... # (n) , ... are so restricted that the sum of their squares forms 
a convergent series, in which case the domain of the point is said to he 
in Hilbertian space, has recently been applied in connection with the 
theory of integral equations. A theory of functions in Hilbertian space 
can be developed in this connection. 

In arithmetical analysis the most general conception of a function is 
that in which the independent variable has as its field a family of sets of 
points, either linear, plane, spatial, or ^-dimensional. Having given, by 
means of an adequate definition, a family of sets of points, either discrete 
or continuous, a function of the sets is defined by a norm which assigns a 
definite number to each set of points of the given family. An important 
case is that in which the sets of points are all measurable; the measure of 
a set, as defined in accordance with the theory developed in Chapter in, 
is then a function of the variable set in a given family of such measurable 
sets. 

In connection with the modern theory of integration, more general 
functions of measurable sets of points will emerge. The theory of functions 
of curves of prescribed families, developed by Volterra and others, comes 
under this head. 

The Calculus of Variations deals with the theory of maxima and 
minima of functions of continuous .sets of points; the functions taking 
the analytical form of integrals which determine a set of numbers corre¬ 
sponding to the continuous sets of points belonging to prescribed families. 

In the present work the ordinary case of functions of one variable, or 
of a finite number of variables, will, for the most part, be the only one 
dealt with, but incidentally, as in the case of the theory of integration, 
functions of sets of points will be considered. 

The present Chapter is concerned, in the first instance, with the pro¬ 
perties of functions of a single variable, of which the domain is an interval, 
or other linear set of points. The case of functions of two or more variables, 
i.e. of points belonging to a domain in plane, or higher dimensional, space, 
will also be considered; especially in those respects in which the properties 
of such functions are not an immediate extension of the properties of 
functions of a single variable. 

THE UPPER AND LOWER BOUNDARIES AND LIMITS OF FUNCTIONS 

209 . A function y =■ / (x), being defined for the domain of x, we have 
seen that the values of y form a set of points, determined as usual upon 
a linear interval, which is called the functional image of that set of points 
which forms the domain of x. In case the set of points, which represent 
the values of y, is a bounded set, the function / (x) is said to be bounded 
in the domain of x. 
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When the set of values of y is bounded, either boundary may be a 
limiting point, or only an extreme point, of the set. For convenience, and 
in accordance with usage, the terms “ upper limit” and “ lower limit ” will be 
applied to denote the upper and lower boundaries of the derivative of 
this set. Thus the upper and lower limits are the extreme limiting points 
of y in the domain of x. Accordingly we may say that: 

If the set of points y, which represents the functioned image of a function 
f (x), defined for a given domain of x, have an upper and a lower boundary , 
then the function f (x) is said to be a bounded function , and the boundaries 
are said to be the upper and lower boundaries of f (x) in the domain of x. 
The upper and lower limits of the function are the upper and, lower boundaries 
of the. limiting points of the set of points y. 

The upper limit of a function / (x) for its given domain may, or may 
not, be attained , i.e. there may, or may not, be a value of x , in the domain 
of x , for which the functional value is equal to the upper, or to the lower, 
limit. This is the case whether, or not, the upper, or the lower, limit is 
identical with the upper, or the lower, boundary. An upper, or a lower, 
boundary that is not identical with the upper, or the lower, limit, must- 
be attained. 

In case y have no upper boundary, or no lower boundary, for the 
domain of x f the function f (x) is said to be an unbounded function. In 
this case there exist values of the function, of one sign, or of both signs, 
which are numerically greater than any arbitrarily assigned number A. 

When y has no upper boundary in the domain of x, the function is 
said to have the improper boundary + oo, in the domain of x. Similarly, 
when y has no lower boundary, it is said to have the improper boundary 
— oo. It is frequently said, for the sake of brevity, that the upper or the 
lower boundary of the function is infinite. 

The ex,cess of the upper boundary of a function , in its domain , over its 
lower boundary , is called the fluctuation (Schwankung) of the function in 
the domain. 

In case the upper or the lower boundary is infinite, the function is said 
to have an infinite fluctuation in its domain. 

Instead of the whole of the domain of x , we may consider that part 
which lies in a given interval (a, b), closed or open, and the preceding 
definitions may be applied to this portion of the domain; thus: 

The upper boundary of a function f (x) in an interval (a, b), closed or 
open , is the upper boundary of the function when only those points of the 
domain of x which lie in the interval (a, b) are taken into account . A similar 
definition applies to the case of the upper limit, and also to the cases of 
the lower boundary and the lower limit. 
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The excess of the upper boundary of f (x), in the closed , or open , interval 
(a, b), over its lower boundary in that interval , is called the fluctuation of 
f (x) in the closed , or open , interval (a, 6). 

In case one, or both, of the boundaries is infinite, the fluctuation of 
the function in (a, b) is said to be infinite. 

If the upper boundary of / (x) in (a, b) is attained, i.e. if there exists 
a value c of x such that / (c) is the upper boundary, where c is a point of 
the domain in (a, b), then this upper boundary is said to be the upper 
extreme of the function in (a, b) ; and a similar definition applies to the 
lower extreme . 

If the end-points a, b of the interval be left out of account, in case 
they belong to the domain of x , the fluctuation is called the fluctuation 
in the open interval (a, b). This is sometimes spoken of as the inner 
fluctuation of the function in (a, b), and is determinable as the limit of the 
fluctuation in the interval (a -} e, b — e), when € is indefinitely diminished. 

Precisely similar definitions are applicable to the case of a function 
of p variables; a cell (a, b) taking the place of an interval (a, b). 

210. In accordance with the definition which has been given for a 
function in any domain, the value of the function at any particular point 
of the domain has a definite finite value. It may happen that a point P, 
of the domain of x, is such that, in any arbitrarily small neighbourhood 
of P, either the upper, or the lower, boundary of the function, or both, 
may not exist; so that, in any neighbourhood of P however small, there 
exist functional values numerically greater than any number that may be 
assigned. In that case, the point P is said to be an infinity , or point of 
infinite discontinuity of the function ; although the function has a definite 
finite value at the point P itself. 

Although / (#) is not properly defined at a point P (x 0 ) y unless a definite 
numerical value be assigned to f (x 0 ), nevertheless an improper definition 
of the functional value at the point P is sometimes admitted, of the form 

. j = 0; in this case the function is said to possess an infinity at P. 
J (*o) 

This infinite discontinuity is said to be removable , provided that, when the 
functional value at P is altered to some finite value, the function have 
finite upper and lower boundaries in a sufficiently small neighbourhood 
ofP. 

There are other cases in which an improper definition of the functional 
value at a point x^ of the domain of # is admitted. The function may be 
defined by means of an infinite series, of which the terms are given func¬ 
tions of x . This series may diverge at the particular point x 0 ; but it is 
nevertheless frequently convenient to regard the series as defining the 



209-2ii] Upper and Lower Boundaries 281 

function for all values of x in some interval which includes x Q . The 
functional value at x 0 is then regarded as infinite, + oo, or — oo. 

In accordance with strict arithmetic theory, the function is regarded 
as undefined at points where no definite finite value of the function is 
specified. For the most part, in the theory which will be developed here, 
this restriction will be rigorously adhered to. It will be found, however, 
that in cases, such as in the theory of infinite series, in which it is con¬ 
venient to admit improper definitions of functions at particular points, 
no essential change in the main results of the theory will be necessary. 

In some cases it will be found convenient to remove the restriction 
that at each point of the domain of the independent variable the function 
shall be single-valued, and to define the function in such a manner that, 
at single points, or at each point of some set belonging to the domain of x , 
the function may possess finite, or infinite, multiplicity. It will bo found, 
in the cases in which it is convenient to make this extension of the meaning 
of a function, that no difficulty arises as regards the use of results primarily 
applicable to functions which are single-valued at all points of the domain 
of the variable, without exception. 

THE CONTINUITY OF FUNCTIONS 

211. Let the domain of the independent variable x be continuous, 
and either bounded or unbounded; and denote the function y , at the 
point x, by f (x). When the domain of x consists of the points of a con¬ 
tinuous interval (a, 6), that interval will be taken as closed unless the 
contrary is stated. 

The function f (x) is said to be continuous at the point a of the domain 
of x, if , corresponding to any arbitrarily chosen positive number e whatever , 
a positive number 8, dependent on e, exists , such that 

I f (a + V) -/(«) I < «> 

for all positive or negative values of rj which are numerically less than 8, and 
which are such that a + rj is in the domain of x. At an e fid-point of a limited 
domain , the values of p will have one sign only. 

In accordance with this definition, a neighbourhood (a — 8, a -f 8) of 
the point a exists, such that the function, at any point in the interior of 
this interval, differs numerically from its value at a, by less than e. It 
follows that the inner fluctuation of the function in (a — 8, a 4- 8) is less 
than 2c, and it is obvious that the fluctuation in any interval interior to 
(a - 8, a -f 8) is less than 2e. The condition of continuity of the function 
/ (x) at the point a may thus be stated to be that a neighbourhood of the 
point can be determined in which the fluctuation of the function is as small 
as we please. 
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The above definition of continuity at a point is that due to Cauchy, 
and is a particular case of the definition of continuity for a function of 
any number of variables. If we denote by f (x a) , x {2) , ... x (v) ) a function 
of the variables x {l) , x {2) , ... x ip) , defined for any continuous domain, the 
condition of continuity at the point (« (1) , « (2) , ... a (p) ) is that, corresponding 
to every arbitrarily chosen positive number e, a number 8 , dependent on e, 
can be found, such that 

!/(«<*> + h™, tt< 2) 4- h {2 \ ... «<*> + h'P>) — f (« (1) , « (2) , ... a' p) ) | < 6 , 
provided // (1) , h {2) , ... h [p) have any values which are numerically less 
than 8 . In this case, a neighbourhood (a — 8 , a \- 8 ), of a point of a linear 
domain, is replaced by an equilateral cell, which is a square in the case 
of a two-dimensional domain. 

The definition of continuity has been stated by Heine* in a form which 
depends upon the notion of a convergent sequence of numbers or of points. 
Let (P 1 ,P 2 , ... P n , ...) be a convergent sequence of points in the given 
domain, and of which P is t he limiting point. The condition of continuity 
of the function at P is that, for every such convergent aggregate which 
has P as limiting point, the numbers /(P,),/ (P 2 ), •••/ (P„), ... form a 
convergent sequence which represents the number/ (P). It is easily seen 
that a function w hich is continuous in accordance with Cauchy’s definition 
is also continuous in accordance with that of Heine. For if the sequence 
P t ,P 2 , ... P n , ... of points of the domain of x converges to P (a), we may 
define a sequence of numbers rj 1) rj 2 , ... rj ft , ... corresponding to these 
points, such that rj n ~ 0 , as n ~ oo, and such that PP n < rj n ; it then follows 
by Cauchy’s definition that, for a sufficiently large value of n, 

l / ( x ) - f («> i < «■. 

for all the values of x which correspond to the points P n , P w+1 ,_ Since 

this holds for all values of €, w r e see that the sequence {/(P w )} converges 
to /(«). 

That a function / (x), which is continuous at the point a, in accordance 
wdth Heine s definition, is also continuous in accordance with that of 
Cauchy, cannotf be proved in the general case without the employment 
of Zermelo’s axiom (see § 197). To establish this equivalence, with the 
assumption of the axiom, let H (t) ( 17 ) denote the set of points within the 
interval (a — 7 ], a -f rj) for which | / (#) - f (a) | <- e. Assuming that there 
exists a set of points H {t) (rj), of the domain of x , with e fixed, however 
small rj may be, (which will be the case if Cauchy’s definition is not 
satisfied), we may consider a monotone sequence {rj n } of values of rj con¬ 
verging to zero. It is impossible that the sets H (t) (rj n ) should be all 
identical, from and after a fixed value of n , for the intervals (a — rj n , a 4 - rj n ) 

* Crelle’s Journal , vol. lxxiv (1872), p. 182. 
t See Sierpiriski, Compter Rendu .9, Paris, vol. olxitt (1916), p. 688. 
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have no points common to an infinite number of them, except the point a 
which does not belong to the sets. We may consequently, without loss 
of generality, assume that the sets II ie) (rj n ) are all different from one 
another. Considering the sets 

//«> ( Vl ) - //«> (rj 2 ) y //«> ( V2 ) - ( V3 ), ... //«> (77J - fo n4l ), ..., 

a set of points p 2 , ... p n , ... exists in accordance with the axiom, such 
that j) n belongs to H M (77J — 7 / (e) (r} n ^), for every value of n. We have 
now |/(/>„) - f (a) | > 6, for every value of n\ and thus a sequence {p„\ 
of points converging to a exists, such that {f ('p n )} does not converge to 
/(a), and hence Heine’s definition of continuity at the point a is not 
satisfied. It follows that, if Heine’s definition is satisfied, the set H ic) (7/) 
cannot exist for all values of 77; and thus that for a sufficiently small value 
of 17, Cauchy’s condition | f (x) — f (a) | < € is satisfied if | x — a | < 77, 
for all points x , of the domain of x. 

A function which is not continuous at a point a may satisfy the con¬ 
dition that, in a neighbourhood of a, on the right, the fluctuation of the 
function is as small as we please when the neighbourhood is small enough; 
the function is then said to be continuous on the right at a. A similar 
definition applies to continuity on the left. 

A function is said to be continuous in the interval (a, b) if it satisfies the 
condition of continuity at every point in the interval. 

For a function that is continuous in the closed interval (a, b), the upper 
boundary is identical with the upper limit, and the lower boundary is 
identical with the lower limit of the function in the closed interval. This 
does not necessarily hold for an open interval. 

212. The domain of the independent variable has hitherto been con¬ 
sidered to be continuous; it is however clear from a consideration of the 
definition of continuity, either in Cauchy’s or in Heine's form, that the 
definition is applicable in case the domain of the independent variable is 
not continuous, but consists of any set of points which contains limiting 
points that belong to the set. It is, of course, only at such a limiting 
point that the question of continuity arises; for, at an isolated point of 
the aggregate, there are no values of the function, other than that at the 
point itself, in any sufficiently small neighbourhood of the point. If P 
be a point of the domain of x which is a limiting point of the domain, the 
function is continuous at P when Cauchy’s definition of continuity is 
satisfied, those points only, in any neighbourhood of P, being taken into 
account, which belong to the domain for which the function is defined. 
If the function be continuous at every limiting point of the domain of x 
it is said to be continuous relatively to the given domain; and thus the 
notion of continuity of a function is applicable whatever be the domain 
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of the independent variable, except when it consists of an isolated set of 
points. 

Let Pi, P2, Pg, ... be a convergent sequence of points of the domain 
of x t of which P w is the limiting point; and let P w also belong to the 
domain of x. Supposing the functional image, corresponding to / (x), to 
contain the points ft, ft, ft, ... which correspond to P x , P 2f P 3 , ..., let 
ft, ft, ft, ••• form a convergent sequence of which the limiting point Q « 
corresponds to P w . If this condition be satisfied, however the convergent 
sequence be chosen in the domain of x, the aggregate (4?), of values of y , 
is said to be a continuous functional image of the domain (P) of x. 

It is clear that the continuous functional image of a closed domain is 
itself closed. For there corresponds to the points of a convergent sequence 
(ft, ft, ft, •*•)> in (ft, an aggregate (P lf P 2 , P 3 , ...), in (P), which must 
have at least one limiting point, and all such limiting points belong to 
the domain (P), and must correspond to the limiting point of (ft, ft , ft , • • • )> 
which therefore belongs to the aggregate (ft. Moreover if (P) be perfect, 
the continuous functional image ( Q) is perfect also; for, corresponding to 
any particular point ft, of (ft, we may take a point P', of (P), for which 
ft is the image. P' is the limiting point of a convergent sequence of points 
of (P), and to this convergent sequence there corresponds a convergent 
sequence in ( Q ), of which ft is the limiting point. It has thus been shewn 
that ( Q ) contains no isolated points, and therefore (Q) is perfect. 

If ( Q) be a continuous functional image of the closed set (P), and if 
only one point of (P) correspond to each point of ( Q ), then (P) is a con¬ 
tinuous functional image of ( Q ). 

To the points of any convergent sequence (ft, Q 2 > •••)> (Q)> which 

ft is the limiting point, there corresponds a convergent sequence 

(^i > -P* > P '.3 > • • •) > 

in (P), of which P w is the limiting point; and P w is the functional image 
of ft. 

213 . The theorem has been given by Weierstrass that, if (a, b) be any 
interval containing points of the domain of a function , then one point at least 
exists in the interval , which is such that, in any arbitrarily small neighbour¬ 
hood of that point , the upper boundary of the function is the same as the upper 
boundary of the function in the whole interval (a, b). 

This theorem holds for all functions without restriction, and it makes 
no difference whether the whole interval (a, b), or only a set of points in 
that interval, belongs to the domain of the independent variable. 

Let a system {D n } of nets with half-closed meshes be fitted on to the 
interval (a, 6). If M be the upper boundary of the function in that part 
of the domain of the independent variable contained in the closed interval 
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(a, b ), it is clear that in none of the meshes d l , of 1 \ , can the upper boun¬ 
dary of the function be greater than M , and that in one at least of these 
meshes the upper boundary of the function must be M. Take the mesh 
dj of lowest rank in T) Y for which this is the case. In one at least of the 
meshes of I) 2 that are contained in d x , the upper boundary of the function 
must be M ; if there are more meshes than one that satisfy this condition, 
take that one which is of lower rank than the others, and let this 
mesh be d 2 . Proceeding in this manner, we obtain a sequence of meshes 
d x ,d 2 , ... d n , ... each containing the next, and such that in each of them 
the upper boundary of the function for all the parts of the domain in 
the mesh is M. The single point P defined by this sequence of meshes 
satisfies the prescribed condition; for any neighbourhood A, of P, will 
contain all the meshes d n , from and after some value of n ; therefore in 
this neighbourhood the upper boundary of the function is M. 

A similar result holds for the lower boundary of a function. 

It is clear that this proof can be applied, in the case of functions of 
more variables than one, to prove the corresponding theorem that one 
point exists in a domain such that in its arbitrarily small neighbourhood 
the upper boundary of such a function is the same as in a cell. 

In the case of a function which is continuous in the interval (a, b), it 
follows from the foregoing theorem that the upper and lower boundaries 
of the function in (a, b) are both finite, and thus that a function which is 
continuous in an interval is bounded in that interval. 

For consider that point x l , in (a, 6), in the arbitrarily small neigh¬ 
bourhood of which, (x t — e, x x -f e), the upper boundary has the same value 
as for the whole interval (a, b). Since the function is continuous at x lf 
corresponding to a given number S, a number e can be determined such 
that | / (x) - f (*i) | < 8, provided x lies in (x x — e, x x + c); consequently 
the upper boundary of / (x) in this interval must be finite, and hence / (x ). 
has a finite upper boundary in (a, b). It may be shewn in a similar manner 
that the function has a finite lower boundary. 

A function which is continuous in the closed interval (a, b) is such that 
its upper limit and its lower limit are each actually attained at one point at 
least in the interval. 

For suppose, if possible, that / (jej) has a value A , different from M ; 
and consider an arbitrarily small interval (x 1 — e, x ± + c) for which M is 
the upper limit of the values of the function; then points exist in this 
interval for which the function differs by less than an arbitrarily small 
number 8, from M . These values of the function would differ from / (a^) 
by an amount which is not arbitrarily small, and this would be incon¬ 
sistent with the condition of continuity of the function at the point x x . 
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It follows that we must have / (a^) = M . Similarly it may be shewn that 
the lower limit m is reached at least once in the interval (a, b). 

A function that is continuous in an open interval (a, b) may have no 
upper, or no lower, boundary. 

CONTINUOUS FUNCTIONS DEFINED FOR A CONTINUOUS INTERVAL 
214 . It will now be shewn that, iff (x) be continuous in the closed interval 
(a, b), and if f (a), f (b) have opposite signs, then there is at least one value 
of x in the interval , for which f (x) vanishes . 

Let us suppose that f (a) < 0, f (b) > 0; and let a system of nets be 
fitted on to the interval ( a, b ). Since /(/>)>(), in a sufficiently small 
neighbourhood (b — e, b) of the point b,f(x) must be everywhere positive. 
If n be sufficiently large, one or more meshes of D n are interior to (b — e, b), 
and in these meshes / (x) is positive. There exists consequently a mesh d n 
of lowest rank in D n , for which/ (#) is negative at the left hand end-point, 
and such that/ (x) is positive at the right hand end-point; unless/ (^) — 0 
at the left hand end-point, in which case a point such as the theorem 
requires is determined; we assume that this latter case does not arise. 
There is then among those meshes of D nn which are contained in d n , one 
of lowest rank, such that f (x) is negative, or zero, at its left hand end¬ 
point, and is positive at its right hand end-point. Proceeding in this 
manner, we obtain a sequence d n ,d n+1 , ... d n+m , ... of meshes, each of 
which contains the next, and such that, either, for some value of m, 
f (x) is zero at the left hand end-point of d n+m , or else, such that, for the 
unending sequence of values of m, f (x) is negative at the left hand end¬ 
point of d n+m , and is positive at the right hand end-point. In this latter 
case, there exists a point c, in all the meshes {d n+m }. If / (c) were negative, 
in a sufficiently small closed neighbourhood of c, / (x) would be everywhere 
positive; and this is not possible, because d nArm would be interior to that 
neighbourhood, provided m were sufficiently large. In a precisely similar 
manner, it is seen that / (c) cannot be positive. Therefore f (c) = ()., and 
thus the theorem is established. 

From this theorem we can deduce that, whatever values f (a), f (b) may 
have , there must be in the interval (a, b) at least one value of x, for which 
f (#) has any prescribed value lying between f (a) and f (b). 

Let this value be C , and suppose / (a) < C <f (6); then the function 
f (x) — C is continuous in the given interval, is negative when x — a, and 
positive when x = 6; thus it vanishes at least once in the interval (a, b). 

A continuous function has frequently been defined as a function such 
that, if / (a), / (6) be its values at any two points a and 6, then the function 
passes through every value intermediate between / (a) and / ( b ), as x 
changes from a to 6. The property contained in this definition has been 



Continuous Functions 


287 


213-21 r>] 

shewn above to hold of every function which is continuous in accordance 
with Cauchy’s definition; but the converse theorem does not, in general, 
hold. The definition just referred to is accordingly not equivalent to that 
of Cauchy, which is here adopted as the basis of the treatment of con¬ 
tinuous functions. As an example of the non-equivalence of the two 

definitions, we may consider the function defined by y ----- sin *, for x *5 0, 

x 

and by y 0, for x 0. For this function there are values of x between 
a (< 0) and b (> 0) for which f (x) has any assigned values c lying between 
f (a) and/(6); but the function is not continuous, in accordance with 
Caucby'.s definition, in any interval (a, b) which contains the point 0. It 
is, in fact, easily seen that the point 0 is a point of discontinuity of the 
function; for an arbitrarily small neighbourhood of the point 0 contains 
points at which the function has all values in the interval (-- 1, 1). 

As another example* of a function which satisfies the condition re¬ 
ferred to, but is discontinuous in accordance with Cauchy’s definition, let 
the number x in the interval (0, 1) be expressed as a decimal 

■<h<h { h • 

then consider the decimal •« 1 a 3 (/ 5 a 7 — If this last decimal is not periodic, 
we take / (x) ^ 0; if it is periodic, and the first period commences at a 2n . j, 
we take / (x) — -a 2n a 2rH2 a 2n+i — The function so defined for the interval 
(0, 1), of x , has every value between 0 and 1, in every arbitrarily small 
interval in the domain of x\ thus the function is discontinuous at every 
point. A value of x for which/ (.r) has any prescribed value -p 1 2 > 2 ... p n ... in 
■a^Aa^B ... ^'a 2 „_,^ 1 a 2w p 2 a 1! „ +2 

where * a x a 3 ... is any periodic decimal, the first period of which begins at 
a 2n _j, and A, B, ... K are arbitrarily chosen digits. Nevertheless there are 
values of or, between a and ft, at which the function takes any assigned 
value intermediate between / (a) and / (/?). 

CONTINUOUS FUNCTIONS DEFINED AT POINTS OF A SET 

215 . It will now be shewn that, if a function f (x), having prescribed 
values at each point of an infinite set of points in the interval (a, b), be con¬ 
tinuous in that interval , then the values of the function are determinate at each 
point of the derivative of the set. 

Suppose tq, a 2 , a 3l ... a n , ... to be a convergent sequence of points, for 
which x 1 is the limiting point, and suppose/ (cq),/ (a 2 ), .../(a n ), ... to be 
known; it will be shewn that these functional values form a convergent 
sequence whose limit is / (a^). An interval (x t — 8, x x -f 8) can always be 
found, corresponding to any fixed number €, such that the function at 

* See Lebesgue, Lemons sur V intdgration, p. 90. 
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any point of this interval differs from / (x x ) by less than the arbitrarily 
small number e; this follows from the continuity of the function. A number 
n can be found such that all the points a n) a n+1 , a w42 , ... lie within the 
interval (x l — 8, x 1 -f 8). It follows that 

I / (*i) - / (««) | and | / (xj - f (a n+ .,) |, 

&c. are all less than c, which is arbitrarily small; hence/ (a^) is the limit 
of the sequence / (af) y f (a 2 ), ..., and thus f (x x ) is determinate. From this 
special case it follows that, for all the limiting points of a given set of 
points in (a, b), the values of the continuous function are determinate. 
It thus appears that the function is determinate for all points which belong 
to the derivative of the given set, for the points of which the values of 
the function are known. 

In particular, if a continuous function have prescribed values for points 
of a set which is everywhere dense throughout the interval (a, b ), then its 
values are determinate for all points of the interval. 

A special case of such a set would be that of the rational points within 
the interval. It follows that a continuous function whose values are 
known for all the rational points in an interval is determinate for all the 
irrational points. A continuous function which is known to be constant 
for all the rational points has the same constant value for all the irrational 
points in the interval. 

It is clear that, if cells be employed, instead of intervals, the method 
in the proof is applicable to shew that a continuous function of any 
number of variables is determinate at each point of the derivative of a set 
of points at which the values of the continuous function are assigned. 

A generalization of the above theorem is that, if a function is con¬ 
tinuous with reference to a domain which consists of a set (P), and is 
known for all points of a sub-set which is everywhere dense in (P), then 
the function is determinate for every point of (P). This may be seen by 
remembering that every point of (P) is a limiting point of the sub-set, 
and applying the same reasoning as before. 

216 . From the theorem established above, that a continuous function 
is determinate when its values at an everywhere dense enumerable set of 
points are prescribed, we may deduce that the cardinal number of the aggre¬ 
gate of all continuous functions of a real variable is the cardinal number c, 
of the continuum. 

We may suppose the values of a function to be prescribed at the 
rational points. The cardinal number of the aggregate of all functions 
defined for the rational points only is the cardinal number of the ways of 
distributing on the aggregate of rational numbers the aggregate of numbers 
of the continuum. This number is c*, which has been shewn in § 183 to 
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be equal to c. Only some of the distributions of this kind are such as will 
give rise to continuous functions; hence the aggregate of all continuous 
functions is a part of the aggregate of all possible distributions on the set 
of rational numbers of the numbers of the continuum. It follows that the 
cardinal number of the aggregate of all continuous functions is s c. Again, 
this cardinal number is '<s c; for among the continuous functions are those 
which are constant, and everywhere equal to any assigned number of the 
continuum; and thus the aggregate of all continuous functions contains 
a part which has the cardinal number c. Since the cardinal number is 
£ c, and also s c, it is equal to c. 

This theorem was established by Borel*, who also shewed that the 
aggregate of all analytical functions of two or more variables has the 
cardinal number c. 

The cardinal number of the aggregate of all functions of a real variable 
is that of all distributions of the continuum upon itself; this is, in accord¬ 
ance with the definition in § 150 , denoted by c c , for which we may write/. 

Each particular distribution of the numbers of the continuum on them¬ 
selves is definable by a definite norm, and corresponding to each such 
distribution there is a definite function for a continuous domain. Let the 
aggregate of all such functions be denoted by F: it will then be proved 
that the cardinal number/, of F, is > c. 

First, F has a part which is equivalent to the continuum. This is at 
once seen, since the functions / ( x) — c, where c is any number of the con¬ 
tinuum, constitute such a part. It follows that / £ c. 

Next, let it be assumed, if possible, that F is equivalent to a part of 
the continuum. As has just been proved, such a part cannot have a 
cardinal number < c; we therefore assume that F is equivalent to the set 
of numbers of the continuum. This amounts to the assumption that F 
can be ordered in the same type as the continuum, so that, to any assigned 
number £ of the continuum, there corresponds a definite set of rules Rt , 
which defines a function ft (#). The correspondence between $ and Rt must 
itself be defined by a set of rules, so that when £ is assigned, R iy and 
therefore the function ft ( x ), is defined. The aggregate {/ f (x)} must contain 
every definable function of a real variable. The number £ being assigned, 
ft ( x ) is producible; and its existence implies that, at any assigned point 
the functional value ft (£') can be determined arithmetically. We may 
take, for example, f; and thus, if £ is assigned, ft (£) is known. We 
may regard ft (f) as a function of $; for its value at any point £ can be 
arithmetically determined, and it is therefore an element of the aggregate 
F of all functions. With this understanding as to ft (£), choose a fixed 
number, say unity; then the function </> (£) = ft (f) + 1 has a definite 

* See Lemons sur la theorie des fonctions , p. 127. 
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norm; for we have only to add to the rules by which ft (f) is defined, the 
further rule, that, at each point unity is to be added to the value of 
ft (f). We have now a now definable function <f> (x) ; but this cannot possibly 
belong to the aggregate F, for if it do so belong, there must be some on© 
point of the continuum, with which it corresponds; but <f) {x) cannot be 
identical with j\ x ( x ), for </> (^) and ft , (f 3 ) differ by unity. Since </> (£) is 
not contained in F, contrary to the hypothesis, it follows that F cannot 
be equivalent to the continuum, and thus the theorem, /> c, is estab¬ 
lished. It has therefore been shewn that / > c , and consequently that— 
the aggregate of all functions of a real variable ha* a cardinal number /, 
greater than c. 

UNIFORM CONTINUITY 

217 . It will now be shewn that, if the domain of x be a continuous closed 
interval, then a continuous function is uniformly continuous through the 
domain of x. Ft will be proved that a number 8 can be found, corresponding 
to any given e, such that, for all values of x, the fluctuation of / (x) within 
the neighbourhood (x — 8 , x f 8 ), or for all of this neighbourhood which 
lies within the domain, is less than the number e. The essential point is 
that 8 is independent of x. 

Consider a symmetrical system {/)„}, of nets with closed meshes, fitted 
on to the interval (a, b). 

If € be a prescribed positive number, there must be some smallest 
integer n lf such that, in each of the meshes of the net l)„ lt the fluctuation 
of f (x) is <: \e. For suppose that no such integer n x exists; then, however 
large n may be taken, there is one mesh at least of D n in which the fluctua¬ 
tion of / (x) is ■ - |e. Let D n ' be the set of all the meshes of D n for which 
this is the caso; the sequence of sets of meshes D/, Df, ... D n ', ..., each 
of which contains the next, defines a closed set of points in all of them. 
Let P be such a point, and let a neighbourhood A', of P, be constructed, 
within which the fluctuation of the function is < |e. This neighbourhood 
A' will contain, in its interior, a mesh of D n \ from and after some fixed 
value of n, and in each of these the fluctuation of / (x) is ^ |e, and thus 
there is a contradiction in the assumption that such a point as P exists. 
Therefore D n ' cannot exist for all values of n, and thus, for some integer n x , 
the fluctuation in every net of D ni is < Je. 

If x be any point of the closed interval (a, 6 ), and the meshes of D nx 
be all of breadth 28 , the interval (x — 8 , x + £), or the part of it that is 
in (a, b), is in the interval formed by at most two consecutive meshes of 
D ni , and the inner fluctuation of / (x) in (x — 8, x -f 8 ) is consequently < e. 

It has thus been shewn* that it is unnecessary, for a function of a real 

* This theorem was first stated and proved by Heine; see CreUe's Journal,’ vol. lxxi (1870), 
p. 361, and vol. lxxiv (1872), p. 188. 
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variable, to draw a distinction, as has sometimes been done, between 
functions which are uniformly, and those which are non-uniformly, con¬ 
tinuous in the continuous domain of x; for all continuous functions are 
uniformly continuous. 

The theorem may also be stated in the following form: 

If f (x) be continuous in the closed interval ( a, b), then, corresponding to 
any arbitrarily chosen positive number e, a number 77 can be determined , such 
that the condition \f(x y ) — / (x 2 ) | < e, is satisfied, for any two points x l , x 2 , 
in (a, b), such that \ x l — x 2 | < 77. 

The following theorem can be immediately deduced: 

If a f unction be continuous in a finite closed interval, then the interval can be 
divided into a finite number of sub-intervals, in every one of which the fluctua¬ 
tion of the function is less than a prescribed positive number. 

It is in fact clear that, if e be the prescribed number, the condition is 
satisfied when the interval is subdivided in any manner such that the 
length of the greatest of the sub-intervals is < 77. 

Another proof of the above theorem, in an extended form, will be given 
in § 234 , by employing the Heine-Borel theorem. 

It is clear that the above proof applies also to the case in which the 
domain of x is not a continuum, but is any closed set; because the essential 
point of the proof depends upon the limiting points all belonging to the 
domain. Those meshes of the nets which contain no points of the domain 
of x are disregarded. For domains which are not closed the proof does 
not apply; thus a function which is continuous, relatively to an aggregate 
which is not closed, is not necessarily uniformly continuous. 

In the case of a function of p variables, when the domain of the 
variables is a closed set, the neighbourhood 

(x x — l, x 2 — S, ... x v — l\ x x 4 - h, x 2 -f 8 , ... + S), 

of the point (x l9 x 2 , ..., x v ), may be (§ 49 ) denoted by (x — S, x 4 - S). The 
condition of uniform continuity takes then the same form as in the case 
p — 1 . If we take the cell in which the domain is contained to have its 
edges all equal, and employ a system of nets in which the edges of each 
mesh are all equal, the above proof can be applied to shew that a function 
that is continuous in a domain that consists of a closed set of points is 
uniformly continuous. 

ABSOLUTE CONTINUITY 

218 . A function f (x), defined in the closed interval (a, b), is said to be 
absolutely continuous in (a, b) if, corresponding to an arbitrarily chosen 
positive number e, another positive number 77 can be so determined that . in 
every enumerable , or finite , set of non-overlapping intervals 

(*!,*,')» ■■■ (*«.*/). 
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in the interval (a, 6), and such that the total measure of the intervals is < rj, 
the sum y or limiting sum y 2 | / (x r ) — f (x/) | is < e. 

r 1 

It is clear that a function which is absolutely continuous in (a, b) is 
also continuous, in accordance with the definition of § 211 ; but the con¬ 
verse does not hold. We may take the set of intervals to consist of a single 
interval of length < 77; thus the condition of uniform continuity is satisfied. 

It is easily seen that an equivalent form of the definition is that, in 
every non-overlapping set of intervals of total measure < 77, the sum, or 
limiting sum, 2 {/ (x r ) — /(a?/)} is in absolute value < e. For we can 

r 1 

divide the set of intervals into two parts, those for which f (x r ) --- f (xf) 

is positive and those for which it is negative. 

The sum and the product of two functions which are absolutely continuous 

in (a, b) are absolutely continuous in the interval. 

Let /j ( x),f 2 (x) be absolutely continuous in ( a , 6); and let their sum 

be denoted by / (x). We have then 

2 | / (*r) - / «) | =S 2 | /, (x r ) - /, (x/) | + 2 | / 2 (X r ) - / 2 (x/) I • 

r = 1 r 1 r 1 

If e be arbitrarily chosen, and if the measure of the set of intervals is 
less than some number ^, the first sum on the right hand side is < Je; 
also there exists a number ry 2 such that the second sum on the right hand 
side is less than |e, provided the measure of the set of intervals is less 
than t) 2 . If rj be the smaller of the numbers tj 19 t) 2i we have 

s | / (av) - / (V) | < «, 

r *« 1 

provided the measure of the set of intervals is < 17; therefore f (x) satisfies 
the condition of absolute continuity. 

Next, let / (x) = / 2 ( x ) / 2 (x) ; then since 

/ (x r ) - f (x/) = /, (x r ) {/ 2 (x T ) - / 2 (X/)} + / 2 (X/) {f, (x r ) - (X/)}, 

we have 

2 I / ( x r) -/(*/) I s M x S | /, (Xr) -/, (X/) I + M t Z I/, (x r ) (X/) |; 

r-1 r = 1 r-1 

where M 1 , M 2 are the upper boundaries of \f x (x) |, \f 2 (x) | in (a, 6). 
A number 77 can be so chosen that 

f 2| /, (*r) - /l (V) I < 2 ^. r Z | k (Xr) - /, (*,') | < 2 € Mi , 

provided the measure of the set of intervals is < 77. When this last con¬ 
dition is satisfied, we have 

2 |/(z,) -/(*/) I < e; 

r -1 

and therefore the product (x)f 2 (x) is absolutely continuous in (a, b ). 
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If f (x) be absolutely continuous in the interval (a, b ), of x , and <f> (t) 
be absolutely continuous in the interval (a, £), of t , then, if x and t be 
related by the condition x — <f> (£), the function / {<f> (£)} can easily be shewn 
to be absolutely continuous, in case the function <f> (t) is also monotone 
(see § 239 ). It was however pointed out by Dunham Jackson* that this 
is not necessarily the case when the absolutely continuous function <f> (t) 
is not monotone. De la Vallee Poussin has given a sufficient conditionf 
to be satisfied by <f> (t), in order that f {<f> (£)} may be absolutely continuous 
for all absolutely continuous functions / (x); and the matter has been 
further investigated by FichtenholzJ. The latter has shewn that the 
necessary and sufficient condition that / {</> (£)} should be absolutely con¬ 
tinuous, where f (x) and <f> (t) are absolutely, continuous, is that f {cf> (/)} 
should be of bounded variation (see § 243 ). 


THE CONTINUITY OF UNBOUNDED FUNCTIONS 


from which it follows that 


219 . Let it be assumed that the continuous function <f> ( x ), defined in 
the interval (a, 6), of x , has 1 and — 1 for its upper and lower boundaries, 
these being, in virtue of the continuity of the function, also its upper and 
lower limits. 

Consider the function / (x) = - ——, - v"-— - 

1 “ I 9 \ x ) 

f (x) 

<f> ( x ) = r ^y — ; the functions cf> (x), f (x) have the same sign at each 

point x. When <f> (x) = 1 , we may suppose the improper value oo to be 
assigned to / (x); and when cf) (x) = — 1, we may suppose the improper 
value — oo to be assigned to / (x). In this manner, by adjoining to the 
real numbers the two elements oo, — oo, we ensure that a (1, 1) corre¬ 
spondence exists, without exception, between the values of <f> (x), f (x). 
Thus, to the closed interval (— 1 , 1 ), there now corresponds the closed 
interval (— oo, oo), and, in the correspondence so defined, the relative 
order of two elements is preserved without change; thus, as <f> (x) increases 
steadily from — 1 to 1,/ (x) increases steadily from — oo to oo. 

The function / (x) is continuous for any finite value a, of x , which is 
not such that <f> (a) has one of the values 1, — 1,0. 

<f> (*) - (a) _ 

f («) |}’ 


For 


f(x)-f (a) 


{1 - I -A (*) |} {1 - 

provided / ( x) and f (a) have the same sign. A neighbourhood of a can 
be so determined that, at every point of that neighbourhood, 

1 - | <A (£) | > h, 


* See de la Vallee Poussin, Trans. Amer. Math. Soc. vol. xvi (1915), p. 462, footnote, 
t Loc . cit. p. 462. 

t Bulletin de Vacad . roy. de Belgique (1922), Nos. 6, 7, p. 430. 
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where h is some positive number so chosen that 1 — | <f> (a) | > A, and also 
such that | <f) (x) — <f> (a) | < h 2 e , where e is an arbitrarily chosen positive 
number. In this neighbourhood we have then \f(x)—f(a)\< €, and thus 
/ ( x) is continuous at a. 

If </> (a) ~ 0, we have | <f> (x) | < e, in a sufficiently small neighbour¬ 
hood of a, and therefore \f (a) | < < 2c, if e < Hence f(x) is 

continuous at a. 

At a point a at which <f (x) = 1, the function / (x) has the value oo. 
If N be an arbitrarily large number, a neighbourhood of a exists in which 
N 

<f> (x) > y j> in virtue of the continuity of <f> (x), at a; so that/(a) > N 

in that neighbourhood. In a similar manner, it may be shewn that, at 
a point a, at which f (a) — — 1, a neighbourhood of a exists such that 

/(«)< - A T - 

It is convenient to extend the definition of continuity of a function 
at a point in such wise that the points at which / (x) has the values 
oo, — oo are points of continuity of the function, corresponding to the 
continuity of the function <f> (x) at such points. 

We are thus led to the following definition of the continuity of an 
unbounded function at a point in the neighbourhood of which the function 
has indefinitely great values: 

A function f (x) is said to be continuous , in the extended sense , at a 
point a, when , corresponding to any arbitrarily chosen positive number N, a 
neighbourhood of a exists , such that , at every point of that neighbourhood , 
/ (x) > N. It is also said to be continuous at a, if the above conditions be 
replaced by f (x) < — N. The value of the function at a may be regarded 
as oo, in the first case , and — oo in the second case. 

An unbounded function / ( x ) is regarded as continuous in the interval 
(a, b), for which it is defined, if it is continuous at every point / (x) in the 
neighbourhood of which the function is bounded, and is continuous, in 
the extended sense, at every other point. 

Thus, in the above case, the unbounded function / (x), continuous in 
this sense, corresponds to <j> (x) which is continuous in the more restricted 
sense of the term. It can easily be seen that the continuity of <f> (x) is a 
consequence of that of / (#). 

In the above extension of the conception of a continuous function to 
the case of unbounded functions, the points oo and — oo have been re¬ 
garded as distinct. 

It is however possible to regard these points as not distinct from one 
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another, so that when we consider the functions f (x), j ^ x y a va l ue at 

which / (x) — 0 corresponds to a single value i oo, of j ^ ■ 

In this case a function f (x) is said to be continuous at a point a for which , 
corresponding to each arbitrarily chosen positive number N, a neighbourhood, 
of a exists , in the whole of which | f (x) | > N. 

It is clear that this condition may be satisfied in eases in which the 
condition of continuity is not satisfied when oo and — oo are regarded as 
distinct values of a function. 


T11E LIMITS OF A FUNCTION AT A POINT 

220 . Let a be a limiting point of the set of points which forms the 
domain of the independent variable x\ the point a may, or may not, itself 
belong to the domain of x. Let (a, a f h) be a neighbourhood of a on the 
right, and let IJ (h), L (h) denote the upper and lower limits of a given 
function / (x) for all the points of the domain of x which are inferior to 
the interval (a, a -f h). It will be observed that f (a), if it exists, is not 
reckoned amongst the functional values of which U (h)> L (It) are the 
upper and lower limits. 

Let a sequence of diminishing values be assigned to h, which converges 
to zero; denoting this sequence by h x , h 2 , h 2 , ...; the corresponding numbers 
U (h^), U (h 2 ), ... U (//„), ... form a sequence of which the members do not 
increase, and therefore they have in general a definite lower limit, which 
is called the upper limit of f (x) at a on the right. It is easily seen that 
this limit is independent of the particular sequence {& n }, employed in 
defining it. It may happen that all the upper limits U(h) are infinite, in 
which case we say that the upper limit of / (a;) at a on the right is -J- oo; 
or it may happen that the sequence U (h x ), U (h 2 ), ... U (h n ), ... has no 
lower limit, in which case we say that the upper limit of / (#) at a on the 
right is — oo. In any case, the finite, or infinite, upper limit of f (x) at a, 
on the right, is denoted by / (a 0 ). 

The numbers L (/q), L (h 2 ), ... L (h n ), ... form a sequence of which the 
elements do not diminish, and they have an upper limit, which is called 
the lower limit of f (x), at a on the right , and may, as in the former case, 
have infinite values oo, or - - oo. This limit is denoted by / (a h 0). 

Corresponding definitions apply to the left of the point a; and the 
limits of / (x) at a on the left are denoted by / (a — 6), / (a — 0) respec¬ 
tively. In case the point a is a limiting point of the domain of x, on one 
side only, the two limits of the function at a on the other side are non¬ 
existent. 
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In defining / (a + 0) it is immaterial whether we employ the upper 
limit, or the upper boundary, of the function in the open interval (a, a + h). 
If U (h) be the upper limit, and U (h) the upper boundary, all except a 
finite number of values of / (x) in the open interval (a, a f h) are such 
that/ (x) < U (h) 4 - e h ; where e h is an arbitrarily chosen positive number. 
A number h' < h can be so chosen that U (h') £ U (h f ) < U ( h) 4- e A . If 

this be done in such a manner that lirn e h ■= 0, it follows that U (h) and 

__ 

U (h) have the same lower limit, as h ~ 0. 

A similar remark applies to / (a 4 0), f (a — 0), f (a — 0). 

The definitions of the upper and lower limits at a point a may be 
stated shortly as follows: 

The upper limit f (a 4~ 0) of a function at a, on the right , is the limit of 
the upper limit of f (x) in the open interval (a, a 4- h) y when h is indefinitely 
diminished. 

The lower limit f (a 4 0) of a function at a t on the right , is the limit of 
the lower limit of f (x) in the open interval ( a , a 4- h ), when h is indefinitely 
diminished . 

The definitions for the left of a may be stated in a precisely similar 
manner. 

It is to be observed that the four functional limits / (a 4- 0), f (a 4- 0), 
f(a - 0), f (a - 0) are entirely independent of / (a), in case a belongs to 
the domain for which / (x) is defined. Any arbitrary alteration in the 
value of / (a) will not affect these four limits of / (x), at a. 

The conditions that the point a may be a point of continuity of the 
function / (x) are that / (a 4 0), / (a 4- 0), / (a — 0), / (a — 0), f (a) should 
all have the same finite value. 

It may happen that / (a), / (a + 0), / (a f 0) have one and the same 
finite value, but that either, or both, of / (a — 0), / (a — 0) have not this 
value; in this case / (x) is said to be continuous at a , on the right. Con¬ 
tinuity at a, on the left, is defined in a similar manner. 

If the four functional limits at a be all finite and equal, but f (a) have 
a different value, then the function is said to have a removable discontinuity 
at the point a. In this case the function would be made continuous at a 
merely by properly altering the value of / (a). 

The four functional limits at the point x = 0 are usually denoted by 
/ (+ 0), /(+ 0 )./ (- 0)> /( — o ) respectively. 

If, at a point a, 

f(a)=f (a 4 0 ) = / (a 4 0 ) = / (a - 0 ) = / (« - 0 ) = 4- oo, 
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or if / (a), and all the four limits, are — oo, the function is said to be con¬ 
tinuous at a, in the extended sense of the term; the distinction between 
+ oo and — oo being preserved. 

Tn case / (a) = / (a + 0) --- / (a + 0) = -f oo, or - oo, the function is 
said to be continuous at a, on the right, in the extended sense of the term. 

The four numbers f (x \- bj, f (x -f 0), f (x — 0), f (x — 0), which are 
dependent on x, may be regarded as defining functions of x called the upper, 
and the lower, right hand, or left hand, limiting functions associated 
with f (x). 

The upper associated function A (x) may be defined to be a function 
whose value at x is the greater of the two numbers / ( x + 0), f (x - 6); 
and the lower associated function a (.r) may be defined to be a function 
whose value at x is the lesser of the two numbers / (x -f 0), / (x — 0). 

The greatest of the three numbers f (x), f (x -f 0), f (x — 0) may be 
regarded as the value, at x, of a function M (x), called the maximalfunction 
associated with f (x). 

The least of the three numbers / (x), f (x -f 0), / (x — 0) may be re¬ 
garded as the value, at x , of a function m ( x ), called the minimal function 
associated with f (x). 

221. If the upper and lower limits of / (x) at a, on the right, have the 
same value, this common value is called the limit of f (#) at a, on the right, 
and is denoted* by f (a i 0). If the upper and lower limits of / (x) at a, 
on the left, have the same value, this is called the limit of / (x) at a, on 
the left, and is denoted by / (a — 0). Either of the limits, on the right, or 
left, at a point, when such limit exists, may be either finite or infinite. 

The limit at x = 0, on the right, is denoted by / (-f 0); and the corre¬ 
sponding limit on the left is denoted by /(- 0). 

The limit at a point P, on one side, may be also defined as follows:— 
Let (Pj, P 2 , P 3 , ...) be any convergent sequence of points belonging to 
the domain of x, which is such that a, or P, is its limiting point, and such 
that all the points of the sequence are on the one side of P. The values 
of f (x) at Pj, P 2 , P 3 , ... form an aggregate which may be a convergent 
sequence; let us suppose it to be so, and also that its limit has a value 
which is independent of the particular sequence, which is however subject 
to the conditions above stated. In that case, this limit is denoted by 
/(« + 0), or by / (a — 0), as the case may be, and is called the limit of 
/ (#) at a, on the right or left. 

It may be observed that the necessary and sufficient condition for the 
existence of a definite finite limit, on the right, at a , is that, corresponding 

* This notation was introduced by Dirichlet; see Werke, vol. i, p. 166. 
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to every arbitrarily small number e, a neighbourhood (a, a -f 8) can be 
found, such that the difference of the values of the function at every pair 
of points of the domain of x, which are in the interior of this interval, is 
numerically less than e. 

The necessary and sufficient condition that / (a -f 0) should exist, and 
be equal to -f oo, is that, if A be an arbitrarily chosen positive number, 
8 can be so determined that at every point interior to (a, a + 8) the con¬ 
dition f (x) > A is satisfied. In order that / (a + 0) may exist, and be 
equal to - oo, the corresponding condition is that f (x) < — A. 

It is possible that one of the limits / (a -f 0), / (a — 0) may exist and 
not the other. If the domain of x be either a continuous interval, or a 
perfect set of points, a may be taken to be at any point of the domain. 

When the condition for the existence of / (a -f 0), or of / (a — 0), at a 
point «, is not satisfied, the convergent sequence (P 1? P 2 , ... P n , ...), of 
which P (a) is the limiting point, may be such that 

is either not a convergent sequence, or else that its limit depends upon 

the particular choice of the points P lf P 2 , ... P n ,_ In this case the 

fluctuation of / (x) within an arbitrarily small neighbourhood (a, a -f- 8), 
on the one side of a, is either a finite number which has not zero for its 
limit, when 8 is indefinitely diminished, or else it is indefinitely great, 
however small 8 may be. 

222 . If x l9 x 2i x 3 , ... x n9 ... be a convergent sequence of points be¬ 
longing to the domain of x , with a for its limiting point, then the sequence 
f(x 1 ),f ( x 2)i '"f( x n)> ••• may not be convergent; but, if it bo convergent, 
its limit may have either (1), a single value independent of the mode in 
which the convergent sequence is chosen, in which case a is either a point 
of continuity of / ( x ), or a point of removable discontinuity of the function; 
or (2), one of two values, in which case both the limits / (a f 0), / (a — 0) 
exist; or ( 3 ), one of a finite, or an indefinitely great, number of values, 
which all lie between the greatest and least of the four functional limits at a. 

If the sequence/ (x l ),f(x 2 ), ... / ( x n ), ... be not convergent, by omitting 
a certain set of the numbers of the sequence, we shall obtain a convergent 
sequence, or else one which diverges to -f oo, or to — oo. We can therefore 
without loss of generality suppose that x l9 x 2 , ... x n , ... are so chosen that 
this is the case. 

The aggregate of all possible values of the limits of such a sequence 
/ ( x i)>f ( x 2)> •••f ( x n)> •••» f° r all sequences { x n } which converge to a, each 
of which limits has a finite value, or is oo, or — oo, is called the aggregate 
of functional limits at the point a. Of the numbers x l9 x 2 , ... x», ... an 
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infinite number are on one side of a; we may therefore without loss of 
generality assume that x l9 x 2 , ... x ny ... are all on the same side of a. 

It will be shewn that the aggregate of functional limits at a is a closed 
set (which may be finite), provided, in case any functional limit is infinite, 
we regard the point oo, or — oo, as belonging to the set. 

Suppose U is the limit to which such a sequence converges. If the 
aggregate of all values of U is not finite, let U 1 , IJ 2 , ... U n , ... be a con¬ 
vergent sequence of values of U , of which is the limit. It will be shewn 
that U w is itself a value of U. First suppose that is finite. Let U r be 
the limit of a sequence {/ (a; n (r) )}, for n — 1 , 2 , 3 , ...; we may choose r so 
large that | f7„ - U T \ < {e, 

for this value, and all greater values, of r. We can then choose n so that 

| U r — f (x n (r) ) | < ^6, 

for that value, and all greater values, of n. It follows that, for all 
sufficiently large values of r and n , \ U u — f (x n {r) ) | < e. 

As € is arbitrarily small, we can obtain a sequence of numbers x, such 
as x n ir) , for which / (x n (r) ) converges to U whilst the numbers x n (r) con¬ 
verge to a. Hence U„ belongs itself to the aggregate of functional limits. 

In case U n is infinite, we can choose r so that | U r | > A, an arbitrarily 
chosen number; then, as before, n can be so chosen that 

| Ur -fix*') |< *€, 

and thus | f (x n {r) ) I > A - Taking an increasing sequence of values 
of A, and a diminishing sequence of values of e, we obtain a sequence of 
points x n {r) , for which \f (x n ir) ) | is divergent. 

It thus appears that the aggregate of functional limits at a point a is 
closed, in the ordinary sense, when the upper and lower functional limits, 
at a, are all finite; and that, when this is not the case, the set will be 
closed if we regard one, or both, of the points oo, — oo as belonging to it. 

The aggregate of functional limits may be finite, or may consist of a 
closed set, of any type as regards density. 

223 . If the domain of x be unbounded, in one, or in both, directions, 
it may happen that a point x 1 (> 0) of the domain can be found, corre¬ 
sponding to every arbitrarily chosen positive number e, such that the 
difference between the values of / (x), for any two values of x which are 
both greater than x x , is numerically less than 6. In this case the function 
has a definite limit, as x is increased indefinitely in its domain; and this 
is called the limit of f (x) for x ~ oo. Under a corresponding condition 
/ (x) may have a definite limit for x ~ — oo. 

If, as x increases, a point x x of the domain of x, corresponding to each 
assigned positive number A, chosen as great as we please, can be found, 
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such that f (x) > A, for all values of x which belong to the domain, and 
are > x lf then the limit of f (x) is said to be oo, as x is increased inde¬ 
finitely. If f (x) < — A, for all such values of x lf then the limit of f (x) 
is said to be — oo. Similar definitions apply to the case in which x has 
indefinitely great- values in the negative direction. 

In case the limit / (a -4- 0), at a point a on the right, do not exist as a 
definite number, and be not infinite with a fixed sign, it is frequently 
convenient to regard f (a -4 -0) as still existent, but indeterminate, and 
capable of all values belonging to some closed set of which f (a -f 0), 
/ (a 4 0) are the extreme values. It is then said that f (a -b 0) is indefinite 
in value, and that f(a + 0), / (a -4- 0) are its limits of indeterminancy. 
A similar remark applies to / (a — 0), which may also be either definite, 
or indefinite, with / (a — 0),/ (a — 0) as its limits of indeterminancy. One, 
or both, of the limits of indeterminancy, in either case, may be infinite. 

THE DISCONTINUITIES OF FUNCTIONS 

224 . Let us suppose the domain of x to include all points in a suffi¬ 
ciently small neighbourhood of a point a; or, in any case, let a be a 
limiting point of the domain of x. 

The fluctuation of the function / (#) in the closed, or open, neighbour¬ 
hood (a ~ 8, a + 8) of the point a, depends in general upon 8 , but cannot 
increase as 8 is diminished. It therefore has a lower limit, for values of 8 
which converge to zero. This limit, which may be zero, finite, or inde¬ 
finitely great, is called the saltus (Sprung), or measure of discontinuity, of 
the function/ (x), at a; thus: 

The saltus , or measure of discontinuity, of a function f (x), at a point a , 
is the limit of the fluctuation of the function in a neighbourhood (a — 8 , a + 8 ), 
as 8 converges to zero. 

The upper boundary of the function f (#) in the interval (a — 8, a 4- 8) 
has a lower limit, as 8 is indefinitely diminished , which is called the maximum 
M (a), of the function f (x), at a (see § 220). 

The lower boundary of the function , in the same interval , has an upper 
limit , as 8 is indefinitely diminished , which is called the minimum m ( a ), 
of f 0 * 0 * ol «■ Either the maximum or the minimum at a point may be 
indefinitely great. 

The saltus of / (x) at a is easily seen to be the excess of the maximum 
at a over the minimum. 

It is clear that the maximum of / (x), at a, is the greatest of the numbers 
f (a -f 0),/ (a — 0),/ (a), and that the minimum is the least of the numbers 
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/(« + 0), / (a — 0), / (a ); and thus that the saltus at a is the excess of the 
greatest over the least of the numbers 

7(«T0), /(« + 0), 77 ^ 0 ), / (« - 0) , / (a). 

At a point of continuity of /(x) the saltus is zero. Any point at which 
the saltus has a finite value (> 0), or is indefinitely great, is called a point 
of discontinuity of / (#), and in the latter case it is said to be a point of 
infinite discontinuity . 

If the closed neighbourhood (a, a -b 5 ), on the right of a, be taken, the 
lower limit of the fluctuation in this neighbourhood when 8 is indefinitely 
diminished is called the saltus at a, on the right . This is equivalent to the 
excess of the greatest over the least of the three numbers 

/m /(« + o) ,/(4 

A corresponding definition applies to the saltus at a on the left. 

225 . The points of discontinuity of a function may be classified as 
follows:—(1). If both the limits / (a -I 0), f (a — 0) exist, and have values 
which differ from one another, the point a is said to be a point of dis¬ 
continuity of the first kind , or a point of ordinary discontinuity. 

The difference between the greatest and least of the three numbers 
/ (a 0), / (a — 0), / (a) is the saltus, or measure of discontinuity, of the 
function at a . If a be not a point of the domain of x, | / (a + 0) — f (a ~ 0) | 
measures the saltus at a; and if a be a point of the domain, and / (a) lies 
between / (a + 0) and / (a — 0), then the saltus is also measured by 
|/(« + 0)-/(«-0)|. 

When / (a) does not lie between / (a f 0) and f (a — 0), the function 
is said to have an external saltus at a. 

In every case, the saltus on the right is measured by | / (a -f 0) — / (a) |, 
and that on the left by | / (a — 0) — / (a) |. 

Whether there be an external saltus at a, or not, the number 
| / (a + 0) — / (a — 0) | 

is said to measure the oscillation (Schwingung) at a. The oscillation at a 
point differs from the saltus in that the functional value / (a), at the point, 
is in the former case disregarded. 

If / (a) = / (a — 0), whilst / (a) /=/ (a + 0), the function is said to be 
ordinarily discontinuous at a on the right. If f (a) f (a — 0), whilst 
f {a) = f (a + 0), the function is said to have an ordinary discontinuity 
at a on the left . 

It may happen that f (a -f 0), / (a — 0) have equal values which differ 
from f (a). In that case the discontinuity at a is removable (see § 220); 
since by merely altering the functional value at the one point a, the func¬ 
tion can be made continuous at the point. 
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(2) . If neither of the limits / (a -f 0), / (a — 0) exists, the discontinuity 
at a is said to be of the second kind. 

The oscillation* at a is measured by the excess of the greater of the 
two numbers f (a - f 0), / (a — 0) over the lesser of the two numbers 
/(« + 0 ),/(a-0 ), the value of / (a) being left out of account. 

The differences / (a d- 0) — / (a d- 0), / (a — 0) — / (a — 0) may be 
spoken of as the oscillation at a on the right, and on the loft, respectively. 

By Dimf, a definition of the saltus is adopted which differs from the 
one which we have employed; he takes the greatest of the four differences 
I 7 F i 0) — / (a) | as the measure of the saltus, the greater of the two 
differences \f (a + 0) — / (a) | being taken as the measure of the saltus 
on the right. 

( 3 ) . It may happen that one of the two limits/ (a d- 0),/ (a — 0) exists, 
whilst the other does not. In this case, the point a may be said to be a 
point of mixed discontinuity. 

If f (a) exist, and be equal to that one of the two limits f (a d- 0), 
/ ( a — 0) which exists, then the function is continuous at a on one side, 
and has a discontinuity of the second kind on the other side. 

( 4 ) . If one or more of the four limits / (a db 0 ) be indefinitely great, the 
point a is one of infinite discontinuity. 

Under infinite discontinuities is sometimes included the case in which 
/ (a) is defined by 1// (a) = 0, or when / (z) is defined as the limiting sum 
of a series which, for the value a, becomes divergent. 

226 . In an arbitrarily small neighbourhood (a, a -f h ), on the right of 
a point a at which the limits /(ad- 0),/ (a }- 0) have different values, there 
must be an infinite number of points at which / (x) > /(« d- 0 ) — e, where 
e is an arbitrarily small fixed number. 

For, if there were only a finite number of such points in (a, a d- h), 
h could be chosen so small that all such points would be excluded from 
the neighbourhood; thus, in a sufficiently small neighbourhood (a, a + h), 
we should have at every internal point f (x) £ f (a + 0) — e; and then the 
upper limit at a on the right could not be / (a d- 0). In a similar manner 
it can be shewn that, in the arbitrarily small neighbourhood (a, a d- h), 
there must be an infinite number of points at which / (x) < / (a + 0) + €. 

* This definition of the “Schwingung” is given by Pasch in his Einleitung in die Differential- 
und IntegralrechnuTig, p. 139. 

| See Grundlagen , p. 55. 
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In this case we say that, in the arbitrarily small neighbourhood of a , 
on the right, the function makes an infinite number of finite oscillations. 
In case of the infinity of one, or of both, of the limits / (a + 0) and f (a -f 0), 
and, in the latter case, only if they be of opposite signs, the function makes 
an infinite number of infinite oscillations in the arbitrarily small neigh¬ 
bourhood of a . A similar remark applies to the case in which / (a — 0), 
/ (a - 0) have unequal values. It has thus been shewn that: 

A point of discontinuity of the second kind, is one such that , in its arbi¬ 
trarily small neighbourhood ,, the function makes an infinite number of finite 
or infinite oscillations . 

In an arbitrarily small neighbourhood on either side of a point of dis¬ 
continuity of the first kind, the function may make an infinite number 
of oscillations; but since the neighbourhood can be chosen so small that 
the fluctuation of the function in its interior is arbitrarily small, the 
oscillations, when they are infinite in number, are arbitrarily small, suffi¬ 
ciently near the point. 


EXAMPLES 


1. Let f (x) = sinx/x, when x^0, and / (a:) — A , when x -0. 

In this ease /( -» 0) -/( ())--], /(0)- A; thus f(x) has a removable discontinuity at 

x —0, unless A —1, in which ease the function is continuous in any interval. 

2. Let / {x) = - we have then f (a +0) = », / (a -0) = - <*, and /(a) is undefined. 

3. Let / {x) = (x - a) sin ^ * , and / (a) =0; then / (a +0) — 0, / (a -0) =0. This function 

is continuous at x = a, and maktfs an infinite number of oscillations in any neighbourhood 
of that point. 

4. If / (x) = * cosec * , then 

x a x — a 

/ (a +0) =oo, f (a +0)= -oo, f (a -0) =<*>, f (a -0) = -oo. 

This function has an infinite discontinuity of the second kind at the point a. 


l 

5. If f{x)=e x , we have /(-i (>)=«,, /( -0) =0. 

/(-(»=!. 


If /(x)-—i—, then /(+())=(), 
1 -e* 


6. If / (x) -sin x, lim / (x) is indeterminate, the limits of indeterminancy being +1, - I. 

X~ 00 

In the case / (x) =x sin x, the corresponding limits of indeterminancy are +qo, - oo. 

7. Let y=E(x), where E (x) denotes the integral part of x. This function is discon¬ 
tinuous when x has an integral value n ; we then have E (rt - 0) n - 1, E ( n ) = n, E (n + 0) = ti . 

8. Let (x) denote the positive or negative excess of x over the nearest integer; and 
when x exceeds an integer by let (x) =0. This function is continuous except for values 
x =t n + jb where n is an integer. We have (n + J) =0, (n + J -0) = J, (n + J +0) = - 



304 


Functions of a Real Variable 


|CH. V 


ORDINARY DISCONTINUITIES 

227 . Although the points of discontinuity of the first kind have, in 
accordance with usage, been spoken of above as ordinary discontinuities, 
it will be shewn that they are, in a sense, more exceptional than dis¬ 
continuities of the second kind. The following theorem will be established : 

The set of points at which a function f (a;) has ordinary discontinuities is 
enumerable (or finite , or absent); whereas the set of points of discontinuity of 
the second kind may be unenumerable. 

Let ^ be a positive number, and let S k denote that set of points of 
ordinary discontinuity in (a, b) at which the saltus of / (a;) is > k. If the 
set S k be not finite, let a be a point of its derivative. Any neighbourhood 
of a contains points of S k , and therefore contains pairs of points, both on 
the same side of a, such that the difference of the values of / (a:) at the 
points of such a pair is numerically > k. It follows that the point a must 
be a point of discontinuity of the second kind, and thus that a is not a 
point of S k ; and therefore the set S k is an isolated set, and is consequently 
enumerable. If k l9 k 2 , ... be a sequence of diminishing numbers that con¬ 
verges to zero, all the sets S kl , JS k2 , ... are enumerable (or finite, or non- 
existing), and therefore their outer limiting set, which contains all the 
points at which / (#) has an ordinary discontinuity, is enumerable (or 
finite, or absent). 


THE SYMMETRY OF FUNCTIONAL LIMITS 

228 . The question as to the nature of the set of points at which a 
function / (#) has functional limits, on one side, which differ in value from 
the corresponding functional limits on the other side has been investi¬ 
gated* by W. H. Young. 

The general result obtained is that: 

Except at points of an enumerable set , the upper and lower limits of a 
function , on the right , are equal to the upper and lower limits respectively , 
on the left , and there is no external saltus . 

Let us consider the set of points G ktA at which 

/ (x 4- 0) - / (x - 0) > k, f (x + 0) > A ; 
where A and k are positive numbers. If the set G ktA is not enumerable, 
it contains a component H , each point of which is a limiting point on both 
sides (§ 89 ). It follows that, at each point of H , / (x — 0 ) £ A ; and there¬ 
fore / (x -f 0 ) > A 4- k, at each point of H . Similarly, it is seen that, at 
each point of H> f (x 4- 0 ) > A + 2 k ; and by repeating the process we 


See Quarterly Journal of Math. vol. xxxix (1908), p. 67. 



227-229] Symmetry of Functional Limits 305 

have / (x + 0) > A -f nk , whatever value n may have. It then follows 
that at the points of H, f (x + 0) = + oo, and therefore / ( x — 0) = -f oo; 
which is contrary to the condition 

/ (x + 0) - / (x - 07 > k. 

Therefore the set G ktA is enumerable. Giving to A the values in a sequence 
of decreasing numbers diverging to — oo, we see that the set of all points 
at which / (x 4- 0) — / (x — 0) >-k is enumerable. Next, letting k have 
the values of the numbers in a decreasing sequence converging to zero, 
we see that the set of points at which / (x ~f 0) — / (x — 0) > 0 is enu¬ 
merable. Similarly, it may be shewn that the set of points at which 
f (x - 0 ) -f(x + oj > 0 is enumerable. Therefore, except at the points 
of an enumerable set, we have f (x 4- 0) = / (x — 0). 

By considering the function — f(x), for which an upper limit is ob¬ 
tained by changing the sign of the corresponding lower limit of / (x), we 
see that / (x + 0) = / (x — 0), except at the points of an enumerable set. 

Therefore / (x + 0) — / (x — 0) and / (x -j- 0) — / (x — 0), except for 
values of x belonging to an enumerable set. 

In the proof of the above theorem / (x) may be substituted for/ (x -f 0), 
without affecting the reasoning. It then follows that the set of points at 
which / (x) > / (x — 0) is an enumerable set. By changing / (x) into —f(x), 
it then follows that the set at which / (x) < / (x — 0) is also enumerable. 
Similarly the sets at which f (x) > f (x + 0), and at which f (x) < f {x -(- 0), 
are enumerable. Hence, except at the points of an enumerable set, we have 
/ (a? + 0) == f (x — 0) a f(x) 
and f (x + 0 ) = f (x - 0 ) s/ (x). 

The theorem has now been completely established. 

Also, it follows that: 

The set of points at which there is a removable discontinuity is enumerable 
(or finite, or absent ). 

FUNCTIONS CONTINUOUS IN AN OPEN INTERVAL 

229 . The theorem that a function that is continuous in a closed 
interval (a, b) attains its upper and lower boundaries (which are also 
limits), each at one point in the interval, may be extended to the case of 
a function which is continuous only in an open interval (a, b). 

We have in that case the following theorem: 

If a function f (x) be continuous in an open interval (a, b), and iff (a + 0), 
/ (b — 0) are both less than the upper boundary of the function in the interval , 

H I 


20 
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there is one interior point at least at which the function attains the value of its 
upper boundary. Similarly if f (a -f 0), / (b — 0) are both greater than the 
lower boundary of the function in the interval , there is one interior point at 
least at which the function attains the value of its lower boundary. 

For 6, e' may be so chosen that, for a < x ^ a f e, and for b — e' ^ x <b, 
the upper boundaries of the function are less than in (a, b). The original 
theorem may then be applied to the closed interval [a -f 6, b — c'), and 
the result follows. 

The theorem of § 214 may also be extended to the case of a function 
that is continuous in an open interval (a, b). We have the following 
theorem: 

If f(x) be continuous in the open interval (a, b), and if any one of the 
functional limits at a is of opposite sign to any one of those at b , there is at 
least one point of the open interval at which f (x) has the value zero. 

Let / (a 0) ^ A a / (a -f- 0), and f (b — 0) s B ^ / (b — 0), where A 
and B are any functional limits at a and 6, not necessarily extreme limits; 
and suppose A < 0, B > 0. Positive numbers e, e' can be so determined that 
f (a f e)< 0 ,f(b - e) > 0; 

if the theorem of § 214 be applied to the closed interval (a -f e, 6 - e) the 
result follows. * 

If A y B satisfy the conditions 

f (a + 0 )*A* f(a + 0 ), f (b - 0) £ B z f (b - 0 ), 

there is at least one value of x in the open interval at which f (x) has a pre¬ 
scribed value O, lying between A and B. 

For the theorem just proved can be applied to the function / (x) — C. 

If f (x) be continuous in the open interval (a, b), the aggregate of func¬ 
tional limits , on the right at a, contains every number in the interval 

(/(« + 0 j, f(a + 0 )). 

Let c be a number such that / (a -f 0) > c > f (a + 0), and let € be less 
than / (a + 0) — c, and than c — / (a 4- 0); thus 

/(a-f0) — €>c> /(a + 0) -f c. 

In an interval (a, a 4- h), where a + h < b, there is a set of points O h (1) y 
at each point of which/(a + 0) + e>/(x)>/(a + 0) and another 
set G h {2) , at each point of which / (a + 0) -f e > / (x) > f (a -f 0) — €. Let 
be the upper boundary of the set G h (1) , and £ 2 the upper boundary of 
the set G h l2) . We have then, since / (x) is continuous at and 

/ (« + 0) + « (fi) */(« + 0) - e, 

/(p + 0) + £ */(&) zfJa + 0) - €. 
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In the closed interval of which f x , £ 2 are the end-points, since c lies 
between / (f x ), / (f 2 ), there is a set of points at which the function is equal 
to c. This set must be closed; if £ be its upper boundary, we have / (f) — c ; 
where £ is a definite point in (a, a 4- h). If now we assign to h successively 
the values in a monotone sequonce that converges to zero, we obtain a 
sequence of points £ that converges to a, and at each point of the sequence 
the value of / (x) is c. Thus the aggregate of functional limits at a includes 
the number c. This sequence of values of £ is determinate, and its con¬ 
struction does not involve the use of the principle of selection. 

If we assign to e the values in a monotone sequence that converges to 
zero, there will be a corresponding sequence of values of f x such that, in 
that sequence, / (£ x ) converges to / (a + 0). Similarly the sequence / (f 2 ) 
will converge to / (a -f 0). 


SEMI-CONTINUOUS FUNCTIONS 


230. The condition of continuity of a function / (x), at a point x , 
namely that, if e be any prescribed positive number, an open interval 
(x — h, x 4- h) exists such that, for any point x' in it, | / (x') — f (x) | < €, 
can be divided into two separate conditions, viz. that / (x') <f(x) -f e, 
and that / ( z ') > f (x) - e. 


It is possible that, at a point x, one of these conditions may be satisfied 
and not the other. This consideration gives rise to the definition of a 
property called semi-continuity. 


If <f> (#) be a function defined for a continuous domain, and if, corre¬ 
sponding to every arbitrarily chosen positive number e, an open neigh¬ 
bourhood (x — h, x -f h) of a particular point (x) can be determined such 
that, for every point x' in this open interval, the condition, 

<f> (*') < <f> (x) + e, 

is satisfied; then the point x is said to be a point* of upjper semi-continuity 
of the function <f> (x). 

If an open neighbourhood of the point x can be determined, for each e, 
such that <f> (x') > <f> (x) — €, then the point x is said to be a point of lower 
semi-continuity of the function <j> (x). 

m 

That a point x may be a point of continuity of the function <f> (#), it 
is necessary that both the above conditions be satisfied. 

A function may have upper, or lower, semi-continuity at a point, on 
the right, or on the left. 


See Baire’s memoir “Sur lea fonctions des variables r^elles,” Annali di Mat. (3 a), vol. m, 
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If every point of the domain (a, b), for which the function <f> (x) is 
defined, is a point of upper semi-continuity, then the function <j> ( x) is 
said to be an upper semi-continuous function. 

A similar definition applies to a lower semi-continuous function . 

It is clear that, if <f> (x) be a lower semi-continuous function, then 
— (f> (x) is an upper semi-continuous function. Thus the properties of the 
one class of functions may easily be extended to the other class. 

If f (x) be a function, defined for the interval (a, b), and if M (x), m (x) 
denote the maximum and the minimum of f (x) at the point x, then M (#) is 
an upper semi-continuous function, and m (x) is a lower semi-continuous 
function. 

For a neighbourhood (x — h, x + h) of any point x can be determined, 
such that the value (see § 224) of / (x), at every point in this neighbour¬ 
hood, is less than M (x) + e, where e is a prescribed positive number. At 
every point in (x — h 1} x + hfj, where h x is chosen < h, the value of the 
function M (x) is less than M {x) -f €. Since this holds for every value 
of c, the function M (x) is upper semi-continuous at x. 

It is clear that the function m (x), where m (x) denotes the minimum 
of / (x) at the point x , is a lower semi-continuous function. 

The saltus M (x) — m (x), of the function / (x), may be taken to be the 
value of a function, w (x), which is called the saltus-function of / (x). 

1'he saltus-function w (x) of any function f (#) is an upper semi-con¬ 
tinuous function. 

For M (x) and — m (x) are both upper semi-continuous functions, and 
it is easily seen that the sum of two such functions belongs to the same class. 

Instead of considering the functions M (x), m (x) we may consider the 
upper and lower associated functions A (x), a ( x ) (see § 220). It can be 
proved in the same manner as in the cases of M ( x), m (#) that the upper 
associated function A (x) is upper semi-continuous, and that the lower 
associated function a (x) is lower semi-continuous. 

The function f (x - f- 0) is upper semi-continuous on the right, and 
/ (x + 0) is lower semi-continuous on the right. 

If 4* ( x ) b e any upper semi-continuous function , then the set of points for 
which <f> (x)^& a is a closed set, where a is any fixed number. 

For let S be a set of points at which the condition <f> (x) a is satisfied, 
and let P be a limiting point of S. Let us suppose that, if possible, 
<f> (P) < a ; then a neighbourhood of P can be determined, such that at 
every point in it the value of <f> (x) is less than a , and hence this neighbour¬ 
hood cannot contain any of the points of S; but this is contrary to the 
hypothesis that P is a limiting point of S. It follows that <f> (P) z a, ard 
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thus that the set of points for which <j> (x) ^ a contains ah its limiting 
points, and is therefore a closed set. 

It can be shewn, in a similar manner, or it can be deduced from the 
above theorem, that the set of points for which iff (x) a , is a closed set; 
where ift (x) is any lower semi-continuous function. 

If we apply the theorem proved above to the saltus-f unction to (x), 
of any function / ( x ), we obtain the following theorem : 

Having given any function f (x) defined, for a continuous domain , the 
saltus-f unction to ( x) is such that the set of points for which to (x) > a, forms 
a closed set . 


231. In an open interval, f (x) and the minimal function m (x) have the 
same lower boundary . 

For, if there are points at which / (x) is between A — e, A + e; at any 
such point m (#) £ A -f e. If ^4 is the lower boundary of / (x), we see that 
the lower boundary of m (x) is ^ A, since e is arbitrarily small. Again if 
there are points at which m (x) is between A' — e, A ' -f e, there must be 
a point at which / (x) is also between these numbers; hence if A' is the 
lower boundary of m (x), the lower boundary of f (x) is < A' -f e, and is 
therefore £ A'. The two inequalities shew that / (x) and m (x) have the 
same lower boundaries. 

If f (x) is an upper semi-continuous function, defined in the interval, 
(a, b), the set of jfoints at which to (x) & a is non-dense in the interval (a, b). 

We have in this case M (x) — <f (x), and thus to (x) — </> (#) — m (x). 
If possible, let there be an open interval in which at every point 

tf) (x) — m (x) cr; 

and thus m (x) s <f> (x) — a. If a be a point of the interval, and e (< a) 
be arbitrarily chosen, a point x x in an arbitrarily small neighbourhood of a , 
contained in the interval, exists such that 

<f> (tfj) < m (a) -f € < tf> (a) — (a — e). 

In an arbitrarily small neighbourhood of x x , also contained in the open 
interval, a point x 2 can be determined such that tf> (x 2 ) < cf> (x x ) — (cr — e) f 

° r <f> (x 2 ) < tf> (a) - 2 (a - e). 

Proceeding in this manner, a point x n , arbitrarily near to a, can be so 
determined that <f> (x n ) < <f> (a) — n (a — e), for any integer n; and thus 
<f> (# n ) is an arbitrarily great negative number. It follows that there exists 
an everywhere dense set of points at which <f> (x) < — A, where A is an 
assigned positive number. For, if B be a point at which <f> (x) < — A', 
where A ' > A, a neighbourhood of exists in which 
<f> (x) < — A' + € < — Ay 
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if € be chosen < A' — A. This neighbourhood contains no points of the 
set for which <f> (x) & — A. Since the points are everywhere dense, we 
see that, in any interval, a sub-interval exists which contains no points at 
which <f> (x) ^ — A, and thus the set of such points is non-dense. Giving 
to A the values in a divergent sequence of increasing numbers, and re¬ 
membering that <f> ( x ) is finite at every point, we see that the points of 
the continuum (a, b) are divided into the sum of sets of a sequence of 
non-dense sets. But this is impossible, since the continuum does not form 
a set of the first category. It follows that there cannot exist an every¬ 
where dense set of points, corresponding to each positive number A , such 
that cf> (x) < — A. Thus no open interval exists in which cu (x) ^ cr, and 
therefore the set of points at which oj (x) ^ a is non-dense. We have now 
the equivalent form of the above theorem: 

The points of discontinuity of an upper semi-continuous function form a 
set of the first category. 

232. If </> (x) be an upper semi-continuous function, and if , at every point 
of the interval (a, b), the minimum of </> (x) be zero , then there exists a set of 
points , everywhere dense in the interval , at which </> (x) is itself zero. 

To the interval (a, b), the field of the variable x, let there be fitted on 
a system of nets with closed meshes. Let P be the centre of a mesh d n , 
of D n ; then since the minimum of <f> (x) at P is zero, a mesh d Ul (n x > n), 
of the sequence of meshes which defines P, must be such that in every 
point of d ni we have 0 ^ <f> (x) < c/2. Another mesh d tl2 , contained in d ni , 
can be so determined that, for all points of it, 0 </> (x) < e/2 2 ; and so on. 

We have then a sequence of meshes d ni , d 1H , ... each of which contains 
the next, and such that, in d nt , we have 0 ^ </> (x) < e/2 r . This sequence 
defines a point P (.r), in all of them, for which 0 ^ ( x) < e/2 r , for every 

value of r. Therefore cf> (x) =-• 0; and every mesh d n contains a point such 
as x. It is thus seen that <f> (x) vanishes at the points of an everywhere 
dense set. 

In particular, we soe that, if tu (x) be the saltus-function of any given 
function f ( x ), and if a > (x) has its minimum equal to zero at every point of 
the domain of x, then oj (x) vanishes at an everywhere dense set of points. 
The points of this set are the points of continuity of f {x). 


233. If infinite values of a function are admitted, the function / (x) 
may still be regarded as semi-continuous, when it is infinite. 

Consider a function/ (x) which has infinite values; the function 


is bounded, and the values 
tively, of f (x). 


f (*) 

i +1/(*) 


1,-1 correspond to values -f oo, — oo respec- 
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We have / (x') — / (x) = 


(*') - <f> (x) 


provided <f> (x), <f> (x') have the same sign; hence if <f> (x') — <f> (x) < e, then 


/ (*') -/(*)< 


< € 


U - I 4 > (*) I) 2 

Therefore, if (j> (x) is upper semi-continuous at x, so also is f (x). In case 
<f> (x) is positive, and <f> (x') negative, </> ( x') < <f> (a:), and then f (x') <f(x). 

If / (x) — oo, we have <f> (x) = 1 , and at all other points <f> (x') ^ <M*); 
therefore a point where / (x) =■= 4- oo is a point where <f> (x) is upper semi- 
continuous; and similarly where / (x) = — 00 , <f> (x) is lower semi-con¬ 
tinuous. 


It is therefore convenient to regard / (a;) as upper semi-continuous, 
where it has the value 4 - 00 , and lower semi-continuous where it is — 00 ; 
the correspondence with (f> (x) being thus preserved. 


234. The following theorem is a generalization* of the theorem of § 217, 
that a continuous function is uniformly continuous in its closed domain. 

If f (a;) is bounded in the interval (a , b), and, if k be a number greater than 
the upper boundary of oj ( x ) in (a, b), there exists a positive number a, such 
that , in every closed sub-interval in (a, b) of length not exceeding a, the 
fluctuation of f (x) is < k . 

It is convenient to assume that, for values of x that are < a, f {x) is 
defined to be equal to/ (a), and that, for values of x that are > bj(x) =f(b). 

If x be any point in (a, b), the fluctuation of f (x) in the interval 
(x — h, x 4- h) is < k, provided h be sufficiently small. The upper boundary 
of all values of h for which this is the case may be denoted by h. If 6 be 
a fixed number such that 0 < 0 < 1, the fluctuation of the function in the 
closed interval (x — Oh, x 4- Oh) is < k. Corresponding to each point x in 
(a, 6), a definite interval may be assigned, viz. the interval (x — Oh, x -f Oh), 
or the part of it which is interior to (a, b). By the Heine-Borel theorem, 
a finite set of these intervals exists, such that every point of (a, b) is 
interior to at least one of these intervals. The end-points of the intervals 
of this finite set A form a finite set of points in (a, b) ; let a be the smallest 
of the distances between consecutive points of this set. Any closed interval 
whatever, in (a, b), of length not exceeding a is in one of the intervals of 
the finite set A. Therefore, in such an interval, the fluctuation of / (a;) 
is < k. 

The corresponding theorem for the case of a bounded function defined 
in a cell of two or more dimensions may be stated as follows: 

If a function be bounded in a cell (a, b) of any number of dimensions, and 
if k be a number greater than the upper boundary of the saltus-function in the 

* See Baire, loc. cit. p. 15. 
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whole cell , there exists a positive number a such that , in every closed cell con¬ 
tained in (a, b), of span not exceeding a , the fluctuation of the function is < k. 

The proof is a modification of that given above for the case of a linear 
interval. There is shewn to exist a finite set of closed cells, such that in 
each of them the fluctuation of the function is < k , and such that each 
point in the cell (a, 6) is interior to one of the cells of the finite set. If we 
project on the edges of (a, b) all the edges of the cells of this finite set, 
we have on each edge of (a, b) a finite set of points. The number a can 
then be taken to be the least distance between consecutive points of these 
finite sets, when all the points on all the edges of (a, b) are taken into 
account. Any cell whatever, in (a, b ), of span not exceeding a will bo in 
one of the cells of the finite set, and therefore the fluctuation of the function 
in such a cell is < k. 

The definitions of semi-continuity given above, and the theorems 
established, are applicable when the domain of the variable x is not an 
interval (a, b), but any closed set of points. In that case, we regard func¬ 
tional values in any interval as only existing at those points in the interval 
which belong to the domain of x. 

Moreover, the definitions and theorems are applicable to the case of 
functions of a number p of variables. In this case, instead of an interval 
(x — h, x -f h) used in defining semi-continuous functions and the saltus- 
function, we may employ the “sphere” 

(P> - x (1 >) 2 + (f< 2 > - x (2) ) 2 + ... b (£<*» - x^Y < h 2 , or £ h 2 , 
according as the neighbourhood is open or closed, or else a cell may be 
employed. 

EXAMPLE 

If* / ( x) be any function, <f> (x) its maximum, and <p (x) its minimum at the point x, 
and g (x) be any continuous function, then <f> (x) +g (x), (x) +<7 (x) are the maxima and 

minima at x, of/(x) 4 g (x). Also the functions 4> (x) ~/(x), / (x) - ift (x) have each, in any 
domain, the minimum zero. 


APPROXIMATE CONTINUITY 

235. A function / (x) is saidf to be approximately continuous at a point 
a of the domain (a, b) of the variable x , when, corresponding to each 
arbitrarily chosen number e, the set of points x for which \f(x) — / (a) | < e 
has the metric density unity at the point a. 

The necessary and sufficient condition that the function f (x) should be 
approximately continuous at the point a is that it should be continuous , at a, 
relatively to a set of points 0 which has the metric density unity at the point a. 

* Baire, loc . cit. p. 9. 

f Den joy. Bull, de la soc. math, de France, vol. xuii (1915), p. 165. See also Looman, Funda- 
menta Math. vol. v (1924), p. 105. 
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The condition in the theorem is sufficient; for there then exists a 
number h , such that, in the interval (a — h , a + h), the condition 

i/(*) -/(«) i <« 

is satisfied for every point x that belongs to G. This part of G has the 
same metric density at a as G itself. 

To shew that the condition is necessary, let it be assumed that the 
set G (a, €) of points at which \ f (x) — f (a) \ < e has the metric density 1 
at the point a, whatever value e may have. 

Let {€„} be a sequence of decreasing values of e that converges to zero; 
for each e n a number h n (< h n _ x ) can be so determined that the measure of 
the part of G (a, e n ) in the interval (a — h n , a + h n ) is > 2h n (1 — e n ). The 
numbers Ii n may, if necessary, be so altered that h n+1 /h n < l/e n , for every 
value of n. 

Let F n be the part of G (a, e n ) that is in the two intervals 
(a ( e n h nn , a + h n ), (a - h n , a - e n h n+l ); 
then F n+m is a part of G (a, e n+wl ), and therefore of G (a, e n ). If G be the 
set M (F x , F 2 , ... F n , ...), then the set G has the metric density 1 at the 
point a, since m (F n ) > 2h n (1 — 2e n ). Let x l9 z 2 > ••• x m> be a sequence 
of numbers converging (increasing, or decreasing) to a , and such that all 
of them are points of G. Each one of them x m belongs to one of the sets 
F l9 F 2 , ... F n , ...; say to F^. It is clear that the numbers in increase 
as m does so, and that m ~ oo as m ~ oo. 

We have then | / (x m ) — / (a) | < e^; and therefore the sequence 
converges to f (a). Hence f (x) is continuous relatively to G ; 
also G has the metric density 1 at the point a. 

THE CLASSIFICATION OF DISCONTINUOUS FUNCTIONS 

236. Let us suppose a function to be defined for all points in a con¬ 
tinuous interval (a, b) ; at each point x , the saltus of the function has a 
finite value, or is indefinitely great, its value being zero at a point of 
continuity. With a view to the classification of functions, in accordance 
with the distribution of the points of continuity, and of discontinuity, in 
the interval (a, 6), the question arises, what is the most general distri¬ 
bution of the points of continuity ? 

The answer to this question is contained in the theorem: 

The points of continuity , if they exist , of a function defined for a con¬ 
tinuous interval , form an ordinary inner limiting set. 

To prove this theorem, let e be a fixed positive number, and enclose 
each point P, of continuity of a function / ( x ), in an interval so chosen 
that the fluctuation of / (#) therein is < e; all the points of continuity are 
then enclosed in a set of intervals which in general overlap; let these 
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intervals be replaced by the equivalent set A«, of non-overlapping intervals. 
Imagine these sets A« constructed, corresponding to a sequence of dimin¬ 
ishing values of € which converges to zero; there exists then a set of points 
which are interior to intervals of all these sets of intervals, since this set 
of points includes all the points of continuity of f (x). If Q be any point 
which belongs to the inner limiting set so defined, Q must be a point of 
continuity of / (x) ; for, corresponding to any arbitrarily small number e w , 
Q is in the interior of some interval in which the fluctuation of the function 
is < e n , and thus Q is a point of continuity of the function. 

In accordance with the theorems which have been obtained in § 97, 
relating to ordinary inner limiting sets, the points of continuity of a 
function may form an enumerable set which contains no component dense 
in itself, or else they form a set of the cardinal number of the continuum. 
In the latter case the set is a residual set, provided it be everywhere dense. 
For the closed set at which a> (x) £ e, will, for every value of €, be non- 
dense; and the points of discontinuity accordingly form a set of the first 
category. 

These results lead to the following classification* of functions: 

(1) . A function may have no points of continuity; it is then said to 
be totally discontinuous. 

(2) . The points of continuity may form an enumerable set which has 
no component dense in itself. 

(3) . The set of points of continuity may be of the cardinal number of 
the continuum, and be 

(а) , non-dense; 

(б) , everywhere dense and unclosed, in which case the function is 
said to be a point-wise discontinuous function ; 

(c) , everywhere dense and closed, in which case the function is 
continuous; 

( d ) , everywhere dense in each interval of a set, and non-dense in 
each interval of another set external to the former one. 

This last case (d) is not essentially distinct from the former ones. 

By Hankel and others the term “totally discontinuous” has been 
applied to all functions which are neither continuous nor point-wise dis¬ 
continuous. 

237. It has been shewn by W. H. Young that a function can be con¬ 
structed which is continuous at every point of any given ordinary inner 
limiting set of points, and is discontinuous at every other point of the 
interval. 

* See a paper by W. H. Young, Wiener Sitzungsber . vol. cxn, Abt. n a (1903), p. 1307. 
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Let E denote an ordinary inner limiting set, and let the function 
/ (x) be defined as follows : 

(1) . At every point x, of E, let / (x) x. 

(2) . It has been shewn that a sequence of sets of non-overlapping open 
intervals can be constructed such that the only points, each of which is 
in an interval of every set, are the points of E. Let Q be a limiting point 
of E which does not belong to E\ then a number n exists such that Q is 
in an interval of the (n — 1 ) th set, but not in one of the intervals of the n th set. 
Let this interval of the (n — l) th set be of length d Q \ and at the point Q 
let / (x) = x Q -f e n d Q , where e is a fixed positive number less than unity; 
in the case n = 1, we put d Q ^ e. 

(3) . If R be a point which does not belong* either to E or to its de¬ 
rivative, it must lie between two definite points A, B both of which 
belong to E or to E', and such that no point of E or of E' lies between 
A and B. If x R be a rational number, let / ( x R ) = x A or x Bi according as R 
is nearer to A or to P; when x R is irrational, let / (x R ) — x R ; and if R be 
the middle point of AB , let f (x R ) x R . 

It is clear that the function so defined is discontinuous at every internal 
point of the interval AB , and at the end-point A it is continuous, or dis¬ 
continuous, on the right, according as A does, or does not, belong to E\ 
a similar result holds for B. It has thus been shewn that the function is 
discontinuous at every point which does not belong to E. 

To shew that, at any point P, of E, the function is continuous, con¬ 
sider those intervals, one of each set in the sequence, which contain the 
point P : the lengths of these intervals wdll have a lower limit rf, which 
may be zero. In every interior point of d we have / (x) — x\ and thus, if 
P be interior to d, P is a point of continuity of the function. If P be an 
end-point of d , it is certainly continuous on the side towards the interval; 
and we have to shew that it is also continuous on the other side. Choose 
an arbitrarily small number or, and an arbitrarily large integer m; then 
a number n 1 > m can be found such that the nj th interval, and all subse¬ 
quent intervals of the sequence which contain P, are of length between 
d and d 4- a. The piece of one of these intervals which is not a portion 
of d is of length < a ; suppose that Q is a point in this piece which belongs 
to E' but not to E : then 

I / ( x p) - / ( x q) | = | Xp- Xq- e n d Q I < I cr + e m (d + <x) | 
since n 1 > m, and d Q < d -f a. From this, there follows 
I / ( x p) “ / ( x q) | < 2cj + e m d < 3a, 

if m be chosen sufficiently great. If R be an interior point of an interval 
AB which contains in its interior no point of E\ or of E, and if the points 
A y B be both so near to P that their distances from it are less than a, we 
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have | f (x F ) — / (%) | < cr, in virtue of the definition of / (x R ) ; also if only 
one of the ends A, B be within the interval of length < d -f or, which has 
been chosen, then an interval further on in the sequence can always be 
found such that the middle point of AB is exterior to it, and thus the 
inequality | / (x P ) — / (^ p ) | < cr holds as in the former case. It has now 
been shewn that, for any arbitrarily chosen a, a neighbourhood of P can 
be found such that, for all points x in it, | / ( x P ) — / (x) | < 3ct; therefore 
P is a point of continuity of the function. The case in which d 0 does 
not require separate treatment. 


EXAMPLES 

].* Let (J denote a non-dense perfect- set of points in the segment (0, 1), such that 
the end-points of the complementary intervals are rational points. Let f (x) be defined 
thus:—At every irrational point inside an interval complementary to G , let / (x)~x; at 
every rational point of such interval, let f (x) be equal to the value of x at the'middle point 
of the interval; and at every point external to a complementary interval, let/(x)=l. 
This function is discontinuous except at the middle points of the intervals complementary 
to G; thus the set of points of continuity is an enumerable set which contains no com¬ 
ponent dense in itself. 

2* With the same non-dense perfect set as in Ex. 1, let AB be a complementary 
interval of G , and M its middle point. At every rational point of AM except M, let 
/ (x) ~x A , and at every rational point of MB except M, let f (x) -x B ; also at all points of 
(0, 1), except those for which the functional value has been already specified, let / (x)=x. 
In this case the points of continuity are non-dense, and of the power of the continuum. 


POINT-WISE DISCONTINUOUS FUNCTIONS 

238. A function being defined for the continuous domain (a, b), it has 
been shewn in § 230 that, if k be any fixed positive number, those points, at 
which the saltus of the function is k, form a closed set. 

This theorem follows immediately from the property of semi-con¬ 
tinuous functions established in § 230, by considering the saltus-function. 
It may also be proved directly as follows: 

If P be a limiting point of the set for which the saltus is 2 ? k, then in 
any arbitrarily small neighbourhood of P there are points of the set; 
hence the fluctuation of the function in this neighbourhood is ^ k, and 
therefore the saltus at P is ^ k. 

Moreover, such a limiting point P, of the set of points at which the 
saltus is ^ k , must be a point of discontinuity of the second kind, at least 
on one side of P. If, at P, the function have a limit on the right, a neigh¬ 
bourhood PQ can be found such that the inner fluctuation in PQ is < k ; 
hence inside PQ there can be no point at which the saltus is ^ k ; and 
therefore P is not a limiting point on the right, of the set for which the 
saltus is ^ k. A similar remark applies to the left of P. 

* See W. H. Young, loc. cit. 
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237, 238] 

We have already, in § 236, defined a point-wise discontinuous function 
as one of which the points of continuity are everywhere dense and un¬ 
closed in the domain of the function; this definition is that given by Dini*, 
and is equivalent to the following definition given by Hankelf: 

A point-wise discontinuous function is one for which those points at which 
the saltus is ^ h , an arbitrarily chosen positive number, form a non-dense set K, 
whatever value k may have. 

That this set is closed has been shewn above. 

To prove the equivalence of the two definitions, let it be assumed that 
in any arbitrarily chosen sub-interval (a, /?), a point of continuity x x can 
be found. A neighbourhood can be found for x x , internal to (a, /?), in which 
the fluctuation of the function is < k , and this neighbourhood can contain 
no point at which the saltus is ^ k; hence the points at which the saltus 
is ^ k form a non-dense set K , since, interior to any sub-interval, a sub¬ 
interval can be found which contains no point of the set K. 

Conversely, choose a descending sequence of values of k , say 

k\ , k 2 , k 3 , ... 

which converges to zero, and let K 1 ,K 2 ,K 3f ... be the corresponding 
non-dense closed sets, each of which necessarily contains the preceding 
one; then the set M (K l9 K 2 , K 3) ...) is the set of all the discontinuities 
of the function. 

In accordance with § 93, this set is of the first category, and the com¬ 
plementary set, which is the set of points of continuity of the function, 
is everywhere dense, and has the cardinal number of the continuum, being 
a set of the second category, of the kind that is called a residual set. 

It will be observed that the set of all the points of discontinuity may 
be either everywhere dense, or non-dense, in the whole or part of the 
domain of the variable. This set may be finite, enumerably infinite, or of 
the power of the continuum. 

In accordance with the theorem proved in § 231, an upper semi-con¬ 
tinuous function can only be point-wise discontinuous. 

The set K , although non-dense, is not necessarily of content zero. By 
HamackJ, the term “point-wise discontinuous function'’ was only used for 
such functions as possess the property that the set K, for each value of k , 
has content zero. It will be seen that this latter case is of spedial im¬ 
portance in connection with the theory of Riemann integration. 

* See Orundlagen , p. 81. 

f Math. Annalen , vol. xx (1882), p. 90. This is a reproduction of Hankel’s Univ. Programm, 
Tubingen, 1870, entitled “ Untersuchungen uber die unendlich oft oscillierenden und unstetigen 
Functionen.” 

x Math. Annalen , vol. xix (1882), p. 242, and vol. xxrv (1884), p. 218. 
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It has already been shewn, in § 236, that the points of continuity of 
the point-wise discontinuous function form an ordinary inner limiting set; 

andif {M, {S 2 },... {§„},... 

be the sets of intervals complementary to the closed sets 

K Xi K 2 , ... K n , ..., 

they form a sequence of sets of non-overlapping intervals which define 
the set of points of continuity as their inner limiting set. 

The whole theory of point-wise discontinuous functions is applicable 
to the case in which the domain of the variable is not a continuum, but 
is any perfect set. In this case also, the points of continuity of a point- 
wise discontinuous function are everywhere dense relatively to the perfect 
domain, and the points at which the measure of discontinuity is ^ k form 
a closed set, non-dense relatively to the domain of the variable. That 
this is the case may be shewn by making the points of the perfect set 
correspond in order to the points of a continuous interval, as explained 
in § 96. The points of discontinuity, and those of continuity, relatively 
to the perfect domain, are sets of the first and the second category respec¬ 
tively, relative to that domain. 

239. An important class of functions is that in which each function 
has ordinary discontinuities only; the domain of the functions being either 
a continuous interval, or a perfect set of points. 

A function which has only ordinary discontinuities is point-wise dis¬ 
continuous. 

The set of points at which the saltus is £ k must be finite. For if it 
were not finite it would have a limiting point which would be a point of 
discontinuity of the second kind; and this does not exist. It thus follows 
that the set of points of discontinuity is a non-dense enumerable set. 

A monotone function is one such that, for every pair of values x x , x 2 of 
the variable , such that x 2 > x lf the condition f (x 2 ) ^f(x x ) is satisfied; or else , 
for every such pair the condition f (x 2 ) £ f (x x ) is satisfied. 

In the former case the function is said to be monotone non-diminishing, 
and in the latter case monotone non-increasing. Since a monotone function 
has no oscillations in the neighbourhood of any point, every discontinuity 
is an ordinary one. It follows that: 

A monotone function is point-wise discontinuous (or continuous). Its 
points of discontinuity form an enumerable set. The points at which the 
saltus is ^ Jc (> 0) form a finite set. 

The domain of the variable being either a continuous interval or any 
perfect set, let us suppose that, at every point, the oscillation (see § 226) 
of the function, both on the right, and on the left, is < k\ there can then 
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only be a finite number of points at which the saltus is s k ; i.e. the set K 
is finite. 

For if K were not finite, it must contain a limiting point P, which has 
been shewn, in § 238, to be a point of discontinuity of the second kind. 
Any arbitrarily small neighbourhood of P, on one side at least, must 
therefore contain points at which the saltus is ^ k , and hence the oscilla¬ 
tion at P on this side could not be < k. 

The domain can therefore be divided into a finite number of parts 
within each of which there is no point at which the saltus is ^ k\ and it 
follows that the domain can be divided into a finite number of parts, 
within each of which the fluctuation of the function is < k. 

If , at each point of a set which is everywhere dense in the domain of the 
variable , there exist a limit of the function on one side at least , then the 
function is either point-wise discontinuous , or else it is continuous. 

In any interval, containing points of the domain, a point can be found 
which has a neighbourhood, on one side at least, in which the inner 
fluctuation is < k\ within such neighbourhood the saltus is everywhere 
< k ; hence the points of K are non-dense in the domain, and thus the 
function is either point-wise discontinuous, or else it is continuous. 

If a function be defined for a continuous interval , and all the points of 
discontinuity be ordinary ones , at least on one side , then the set K, of points 
at which the saltus is £ k, is a set of content zero. 

The set K can be resolved into a perfect set G and an enumerable set; 
the set G contains points which are limiting points on both sides, and at 
such points the oscillation both on the right and on the left must be £ k; 
it follows that the set G is non-existent, and that K is therefore an 
enumerable closed set, which has necessarily content zero. 

The theorem still holds if there be points of discontinuity of the second 
kind which form a closed set of content zero, for these points may be 
enclosed in a finite number of intervals whose sum is arbitrarily small; 
the theorem can then be applied to each of the remaining intervals of the 
domain. 

240. It has been shewn in § 222 that the aggregate of functional limits 
of a function, at any point, is a closed set. 

A point-wise discontinuous function* can be so constructed that , at a 
point Xq , the aggregate of functional limits is a prescribed dosed set G . 

To establish this theorem, we observe that, if G be unenumerable, it 
may be replaced by an enumerable set G 1 , everywhere dense in G. Let 

V l ,v t ,v 3> ... 

denote the set G l9 arranged in the order-type w. 

* This theorem was given by Bettazzi, see Rendiconti d% Palermo , vol. vi (1892), p. 173. 
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Next, choose an enumerable sequence x lf x 2 , ... x nf ... of values of x 
having the single limiting point x 0 ; and arrange the sequence {a; n } in the 
order-type a> 2 . The sequence {x n } may thus be split up into an enumerable 
set of sequences { x lr }, {; x 2r }, ... {x^}, ... each of which has the limit x 0 . The 
function / (x) may be defined by the specifications, that / (x) = 0, for all 
values of x which do not belong to the sets {x lr }, {x 2r } } ...; and that 
/ ( x sr) = v *r> where v 8ly v s2 , ... v ar , ... form a sequence chosen so as to con¬ 
verge to the hmit V B . The function/ (x) so defined is continuous at every 
point except x Q , x ly ... x n , ..., and it has the required property; since the 
points of G lt and therefore of G , are all values of the aggregate of func¬ 
tional limits at the point x 0 . 


EXAMPLES 

1 .* Iff (x), <f> (a:) be two point-wise discontinuous functions defined for the same interval, 
there is an everywhere dense set of points at each of which both functions are continuous. 
This theorem follows at once from the fact that the points common to two residual sets also 
form a residual set, and that this holds for every sub-interval contained in the given interval. 

2. Let (x) denote the positive or negative excess of x above the integer nearest to it, 
and if x be half-way between two successive integers, let ( x) =0. Let a functionf / ( x ) be 
defined for the interval (0, 1) as the limit of 

(x) (2x) (3x) (nx) 

14 9 n 2 ’ 

when n is indefinitely increased. The function / (x) is a point-wise discontinuous function, 
in which the set X, of points at which the saltus is ^ k 9 is finite for each positive value of k. 
It can be proved that, if x =m/2n, where m and 2 n are relative primes, then 

/(£-•)-/(£)♦!&• 

For values of x not of the above form, f (x) is continuous. The number of points of K is the 
number of irreducible proper fractions having even denominators 2 n, such that 7r 2 /8n 2 ^ k. 
The set of all the points of discontinuity is everywhere dense in the interval (0, 1). 

3. LetJ y — c, for all rational values of x; and y —d, for all irrational values of x. This 
function is totally discontinuous. 

4. Let§/(x) = 1, for all values of x in the interval (0, 1), except x = ^, (» = 1, 2, 3, ...), 

for which / (x) =0. At each of the points there is a saltus equal to unity. This funotion 
is point-wise discontinuous, and the content of K is zero, for every value of k. 

5. In§ the interval Q, 1^ of x, let /(x)= 1; in the interval Q 2 , let/(x)=|; and 

in general, in the interval , let f{x)~^. In this case the point-wise discon¬ 

tinuous function / (x) is such that the number of points at which the saltus is ^ k is finite 
for every value of k> 0. 

* See Volterra, Oiomale di Mat . vol. xrx (1881), p. 77. 
f See Riemann’8 Oes. Werkc f p. 242. 

X Dirichlet’s WerJce , p. 132. § Hankel, Math. Annalen , vol. xx (1882), p. 85. 
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6. The* points of a continuous interval (0, 1) may be put into correspondence with the 
points of a non-dense set of points, dense in itself, contained in an interval (a, b), in such 
a manner that the relative order of two points of the interval (0, 1) is the same as that of 
the corresponding points in (a, b). Such a correspondence is defined by a point-wise dis¬ 
continuous monotone function y—f ( x ). 

7. f Let the numbers of the interval (0, 1) be expressed as finite or infinite decimals 

x - -a x a 2 a 3 ... a n ..., and let/ (x) - +.... The function / ( x ) is monotone, and 

is discontinuous for every value of x represented by a finite decimal. The set of points K , 
for a given value of k, is finite. The function / (x) defined by / {x) - - 0 r/, 0 « 2 0«3 ... has similar 
properties. 

8. J Let the points of the interval (0, 1) be represented by decimals, and consider the 
set Gq of those points for which only the digits 0 and 1 occur in the decimal representation, 
excluding those points for which all the figures are 0, from and after some fixed place. 
The set G 0 is non-dense in the interval (0, 1), and has the cardinal number r. Any point 
x 0 , of G 0 , is represented by ■n 1 a 2 a. A ... a n ..., where a n is 0 or 1. Ltd. £ be a fixed point 
b x b 2 ... b n ... of G 0 , and let x$ denote x 0 ■+ 2^ze*c 1 c 2 ... c n ...; so that c n =a n + 2 b n . With £ 
fixed, let the set of all points X( be denoted by G%\ the points of G% are all different from 
those of G/ 0 , and for two values £, £' of the sets (/$, G^ f have no point in common. For 
the two numbers c x c 2 ... c n ..., ‘c x c 2 ... c n ' ... are identical only when c n =c n ', which holds 
only when a n —a n \ and b n ~b n '. If we read off in the dyad scale the decimal representation 
of £, we obtain, by giving £ all the values in G 0 , every point in (0, 1) except the point 0, 
and these once only; let the point which, by thus using the dyad scale, corresponds to £, 
be denoted by (£). Now let f (x) be defined by the rules f (x^) =(£), f (x 0 ) -0, and / {x) --0 
for all other values of x. The point-wise discontinuous function / (x) so defined for the 
interval {0, 1) is such that, at all the points of the enumerable sot G$ 9 the saltus is (£); the 
set of all the points of discontinuity is non-dense in (0, 1); and f (x) is constant, and =0, 
in an everywhere dense set of linear intervals. 

9. J; Let the points x, of the interval (0, 1), be expressed as radix-fractions in the scale 
of 3. Let 6r 0 be the set of points for which all the figures of the radix-fraction are 0 and 1, 
excepting those points for which all the figures are 0, after some fixed place. Let G n consist 
of all the points which contain the digit 2 in at most the first n places, but are also such 
that the nth figure is 2; then G n is non-dense, and of cardinal number c. There are left 
only those points for which the radix-fractions contain the digit 2 an infinite number of 
times; and these points belong to a set //, for which the radix-fractions contain an infinite 
number of digits other than 2, or to a set G, for each point of which every digit is 2, from 
and after some fixed one. Each point of G can be represented by a terminating ladix- 
fraction which contains only a finite number of 2’s, and can be added to a G n . Let 
G 0 ,G lt G 2i ...» when so increased, become U 0 , G x , G if ...; and take a sequence of decreasing 
numbers g Q ,g x ,gz, •••• Let the function / (x) be defined by the rules f(x)~g n , if x is a 
point of G nJ and f (x)~ 0 for all points of H. The point-wise discontinuous function / {x) 
is continuous at all the points of //, and the points of discontinuity are everywhere dense 
in (0, 1), and of cardinal number c. 

* Hamack, Math . Annalen, vol. xxm (1884), p. 285. 

f Peano, Riv. di Mai . vol. 1 . 

X Schoenflies, Gottinger Nachrichten, 1899. 
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DEFINITION OF POINT-WISE DISCONTINUOUS FUNCTIONS BY EXTENSION 

241. Let us suppose a function / (x) to be defined for a domain which 
consists of a set of points which is dense in itself, but not closed; and 
further let us assume that / (x) is continuous in this domain. The new 
domain obtained by adding to the original domain those of its limiting 
points which do not belong to it may be spoken of as the extended domain. 
It has been pointed out in § 222 that, at a point a, of the extended domain, 
which does not belong to the original domain, there is an aggregate of 
functional limits which is certainly a closed set, and may consist of a 
finite, or an infinite, set of numbers. 

Let us now define a function </> (x), for the extended domain, in the 
following manner:—At each point of the original domain, which may be 
called a primary point , let </> (x) = / (x); at each point a, which may be 
called a secondary point , and which does not belong to the original domain, 
attribute to (x) the values contained in the aggregate of functional 
limits of / (x) at a; this function </> (x) may then be multiple-valued at any 
secondary point. The new function cf> (x), defined for the extended domain, 
may be spoken of as the function obtained by extension of f (x); and those 
points for which f> (x) is multiple-valued are regarded as points of dis¬ 
continuity at which the measure of discontinuity is the excess of the 
greatest over the least value of the function at that point. 

It will be shewn that the extended function f> (x) is point-wise discon¬ 
tinuous in the extended domain , unless it be continuous. This gives rise to 
a method of constructing point-wise discontinuous functions which has 
been employed by Broden in various special cases. Since we may so choose 
the original domain that it shall consist of an enumerable set of points, 
the method includes one for the construction of a point-wise discontinuous 
function from an enumerable set of specifications. 

To prove that the extended function <f> (x) is at most point-wise dis¬ 
continuous, it is sufficient to shew that f> (x) is continuous at all points of 
the original domain G , which is a set that is everywhere dense in the 
extended domain O'. 

Let x be a point of G y and let it be the limiting point of a convergent 
sequence (x/, x 2 ', x 3 ', ...), of which all the points belong to G'. Consider 
the aggregate {<f> (x/), <f> (x 2 '), ...}, where <f> (x 1 / ), <f> (x 2 '), ... have any of the 
values which belong to the points x/, x 2 ', x 3 ', .... Now a point x n , of G , 
can be found such that | x n ' — x n | < rj n , and | <f> (x n ') — / (x n ) | < e n , 
where j ] n , € w are independent arbitrarily small numbers. If we take a se¬ 
quence of values of 77 , such that rj 1 > rj 2 > tj 3 > ..., with zero as its limit, and 
also a similar sequence of the e numbers, then the sequence (x l9 x 2 , x 8 , ...) 
has the same limit x as the sequence (x/, x 2 ', x 3 ', ...); and the aggregate 



323 


24i, 242] Point- Wise Discontinuous Functions 

{<f> (x/), <f> (x 2 '), ...} has the same limit as the convergent aggregate 
{/(Xi),/(x 2 ), viz./( x) or (j> (x); and thus the theorem is established. 

It will be observed that the values of <f> (x) at all the secondary points 
in an arbitrarily small neighbourhood of a secondary point a depend only 
on the values of / (x) in that same neighbourhood; it follows therefore 
that a is a point of continuity or of discontinuity of (f> (x), according as 
the aggregate of functional limits of / (x) at a consists of one number or 
of more. In the latter case the measure of discontinuity of <f> (x) at a is 
the excess of the greatest over the least of the numbers belonging to the 
values of <f> (x) at the point. 

It has been shewn (§ 240) that a point-wise discontinuous function 
can be so constructed that, at a given secondary point, the values of the 
function may be an arbitrarily assigned closed set. 

242. Although a class of point-wise discontinuous functions may be 
obtained by extension of a continuous function defined for a primary 
domain, dense in itself but unclosed, yet not every point-wise discon¬ 
tinuous function can be generated in this manner. 

Let / (x) be a point-wise discontinuous function in a domain which is 
either a continuum or a perfect set of points. 

Consider the function c f> (x), obtained by taking the values of / (x) as 
given only at its points of continuity, and extending this function to the 
complete domain, in the manner explained above. 

At each point of discontinuity of / (x) there is a saltus k f , and at that 
point the function </> (x), obtained by extending the set of values of / (x) 
at its point of continuity, has a measure of discontinuity which will 
be zero in case </> (x) be continuous at the point; but in any case the 
condition k^ & k f is satisfied, since, within any neighbourhood of the point, 
the fluctuation of <f> (x) cannot be greater than that of / (x). 

If 4 = 0 at any point of discontinuity of / (x), that point may be said 
to be a point of unessential discontinuity of the function / (x); and if 
kj, > 0, the point is one of essential discontinuity. 

Let now a function x (#) be defined for the whole domain as follows:— 
At every point of continuity of / (x), and at every point of discontinuity 
at which k f — jfc#, let ^ (a:) = 0; at each point at which k f >k^, let 

X (*) = */ - h- 

The function x (#) is not necessarily continuous at every point at which 
it is zero. At a point x x at which <f> (x) is continuous, the measure of dis¬ 
continuity of x ( x ) is , or x (#i); but this is not necessarily the case if 
<f> (x) be not continuous at x x . This function x ( x ) may be called a ^point- 
wise discontinuous null-function. 
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By subtracting from / (x) a function fa (x) which never exceeds, at 
any point x , in absolute value, the value of y (x)> we obtain a function 
fa (x), of which the measure of discontinuity is everywhere = k$ (x). 

The function* fa (x) may be spoken of as the most nearly continuous 
function associated withf(x). 

It thus appears that a point-wise discontinuous function can always 
be expressed as the sum of a point-wise discontinuous null-function and 
the most nearly continuous function associated with the given function. 

The latter function fa (x) has only those discontinuities which neces¬ 
sarily arise from the values of the given function at its points of continuity, 
and is independent of the parts of the discontinuities which arise out of 
the functional values of / (x) at the points of discontinuity. The null- 
function depends upon the unessential parts of the discontinuity of / (x). 

EXAMPLES 

1. f Let f (x) = 0, for x=0 , '» o~ » anf l tor all other positive and negative values 

7 T ZtT 077 

of x , let / (x) - cos ^ . The function <f> (x), associated with f (x)> agrees with cos ^ at every 
point except x—0 , where <j> (x) is represented by ( -1, +1). The measure of discontinuity 
k, is zero except at ■?-, ™ , ~ y ...» where k f — 1 , and at x =0, where k f — 2 ; the measure 
vanishes everywhere except at ;r=0, where k+= 2. The function x (z) vanishes except at 
-, ..., where it is 1 ; it vanishes at x — 0 , but is discontinuous at that point. 

77 ZtT 

2. f Let f (x) vanish except at the points ... ~ y ..., where f(x)~ 1 . The 

function <f» (x) is everywhere zero, and thus k$ is everywhere zero. The function fa {x) is 
everywhere zero, and k f is zero except at 0 , ... ^, ..., where k f - 1. In this case 

f(x)-fa(x)=f(x). 

3. | A point-wise discontinuous function / (x) can be constructed}: such that the function 
<f> (a:) may have, at a point x 0 , the values belonging to a prescribed closed set G y in accordance 
with Bettazzi’s theorem (§ 240). If G be unenumerable, choose an enumerable set O x , dense 
in G y and let g 1 , g %y g 3y ... be the points of G x . Take a set of intervals { 8 n }, where 8 n is 

(*o+^. x o + 2 n-l)> 

and define f (x) as follows:—in 8 j, 8 S , 8 5 , 8 ,, ... let /(x) =g 1 ; in 8 a , 8 e , 8 ^,... let f (x) =g % ; 
in 8 4 , Sjj, 8 * 0 , ... let /(a:) =g 3 ; in general f (x) ~g n , in the first free interval, and in every 
second of the following free intervals; further let / {x) —g lf for x & x 0 . The function / (x) 

* This definition is not in complete agreement with that of Schoenflies, see Bericht y vol. i, 
p. 134, to whom the term is due. Some erroneous statements of Schoenflies, in this connection, 
were pointed out and corrected by Hahn; see Monaishefte /. Math. vol. xvr (1905), p. 312. 

f See Hahn, loc. cit. 

| This is contrary to a statement of Schoenflies, see Bertcht y vol. I, p. 135. 



242, 243] Point- Wise Discontinuous Functions 


325 


is point-wise discontinuous, the points of discontinuity being x 0 and the points x 0 + ™. The 

function 4> (x) has two values at the points x 0 + ; and at x 0 it has all the values of G x > 

and therefore all those of G. 


FUNCTIONS OF BOUNDED VARIATION 

243. Let us suppose that the interval (a, b ), for which a function / (x) 
is defined, is divided into a number of parts 

(x 0 , xj (x l , x 2 ) ... (av-i. *r) ••• (*«-!,*«), where x 0 - a, x„ = b; 
these parts forming a net with closed meshes. 

Consider the sum 

l/(*i) ~fM | + |/(* 2 ) - /(*i) | + ••• + |/(*«) -/(*«-!) |. 
or 2 |/(a; r ) -/ |. 

r =-1 

r - 7 i 

// function be such that the sum 2 | / (# r ) — / (#,._!) | Zess 

r = 1 

<some fixed positive number, for all possible nets , the function f (x) is said to 
be of bounded * variation (a variation bornee) in the interval (a, b). 

The function / (rr) being of bounded variation, there must exist a 
number which may be denoted by V a b f (#), such that, for every net, 

2 | f(x r ) -/(a? r _i) | s V a b f(x), 

r -1 

and such that, if e be an arbitrarily chosen positive number, a net 

(«b, x i), ( x i> * 2 ), ••• (*»-!> X n)> 

where x 0 = a, x n = 6, can be so determined that 

2 |/(*r) ~/(*r-1) | > ~ e. 

r -1 

The number V a b f(x) which satisfies these conditions is defined to be 
the total variation of / (x) in the interval (a, b). 

In case the function is not of bounded variation in (a, b), V a b f (x) may 
be regarded as infinite, and it will be possible to determine a net, for which 

r S n |/(* r ) -/(*„-,) \>N, 

r -1 

where N is an arbitrarily chosen positive number. 

A function of bounded variation has only ordinary points of discontinuity . 

Accordingly the set of points of discontinuity of such a function is 
enumerable. 

♦ See Jordan’s Cours <TAnalyse, vol. i, p. 55. 
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To see that this statement is correct, let us suppose that, at a point f, 
in (a, 6 ), the upper and lower limits of the function on one side, say the 
right, have different values; thus let /(f 4 - 0 ) be >/(£ 4 - 0 ). If rj be an 
arbitrarily chosen positive number, in the neighbourhood of £ there is an 
indefinitely great number of points at which / (x) > f (f 4 - 0 ) — 77 , and 
also such a set at which f (x) < f (£ 4 - 0 ) 4 - 77 . It follows that an inde¬ 
finitely great number of non-overlapping intervals exists, in each of which 
the absolute value of the difference of the functional values at the end¬ 


points is 


>/tf+ 0)-/tf+ 0)-2iy. 


which is a positive number k , provided 77 be chosen to be less than 

H/rr+0) -/(£ 4- 0)}. 


A set of m intervals can be so chosen that, if their end-points belong 

r n 

to those of a net, their contribution to 2 | / (x r ) — f | is > tyik. 

r~ 1 

Since m may be taken as large as we please, it is clear that the function 
cannot have bounded variation. 

It is obvious that / (b) — / (a) = 2 {/ (x r ) — / ( 24 _i)}; now let the 

r -1 

terms in the sum on the right-hand side be divided into two parts, those 
for which / (x r ) — f (x^) is positive, and those for which it is negative. 

ThUS /(&)-/ («) = s, { / (*,) - / + 2, { / (x r ) - f (*,_,)} 

where, in 2 1? those values of r are taken for which f (x r ) — f (x r _ x ) is 
positive, and in 2 2 , those terms for which it is negative. 

If the function is of bounded variation, the value of 


S, {/ (X r ) ~ f - S 2 {/ (*,) ~ / (*,_*)} 

cannot exceed F 0 h /(x). Thus 

Sx { / (*r) ~ / (*r-l» * \ { Va* f (x) + /(*)-/ («)}, 

and - Z 2 {/ (av) -/ (*,.,» * J {F 0 -/ (*) -/ (6) + / (a)}. 
It follows that the two sums 


^ { / (x r ) - / (Zr-Jh 2 2 { / (av) - / (Zr-J] 
are both bounded, for all nets fitted on to (a, b). 

Choosing a net fitted on to (a, 6 ), for which 


r 2\f(Xr)-f(Xr- l )\> F.V (*)-€, 

we see that r ~ 1 

{/ (* r ) - / > i {Vaf (*) + /(&)“/ («) ~ 

- 2, {/ (x r ) - / (*_!» > i [V*f <*) -/ (6) + /(a) - «}. 
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It follows that the numbers 

i{V a b f (*) + / (b) - / (a)h i {V b f(x) - / (b) + / (a)} 
are the upper boundaries of 

{ / M “ / - S 2 { / (x r ) - / (X^)} 

for all possible nets fitted on to the interval (a, b). 

Denoting these upper boundaries by P b f ( x ), N a b f (x) respectively, 

we have Va / <*) = /V / (*) + N.* f (X), 

f(b)-f(a)^P a »f(x)-N a »f(x). 

The numbers P b f{x), ~-N a b f(x) may be called the total positive 
variation, and the total negative variation , of/ (x) in (a, b). 

The following definition* of a function of bounded variation is equi¬ 
valent to the one given above: 

If any set of non-overlapping intervals , finite, or indefinitely great , m 
number , 6e defined in (a, b), and the sum , or limiting sum, of \ f (ft) — f (a) |, 
/or all the intervals (a, ft) of the set , be less than some fixed number , inde- 
pendent of the particular set of intervals , the function f (x) is said to have 
bounded variation in (a, b). 

In the first place, we see that a function which satisfies this definition 
must satisfy the definition employed above. For we may take the set of 
intervals to be those of any net fitted on to (a, b). Conversely, if the first 
definition is satisfied by / (x), we may arrange any infinite set of non¬ 
overlapping intervals in order of descending magnitude of | f (ft) — f ( a ) | 
and any finite part of the set containing the first m of the intervals is part 
of a net in (a, b ); and thus 2 | f {ft) — f (a) |, for these m intervals, is less 
than V b f (x). As this holds whatever value m may have, we see that the 
second definition is satisfied by / ( x) % 

243 1 . If the function f (x) be not of bounded, variation (a, b), a system of 

r in 

nets can be fitted on to (a, b) such that the sum S ]} = 2 | / (x r ) — f (x r _ x ) |, 

r 1 

for a net (a, x 1} x 2 , ..., b), increases indefinitely , as D consists successively 
of the nets I) 1 ,1> 2 , • •• D„, ..., of the system of nets . 

If {A n } is a divergent sequence of increasing positive numbers, a net 
D x can be so determined that Sn x is > A 1 ; and a net A t can be so deter¬ 
mined that S Sl > A 2 ; taking D 2 to be the net (D ly A*) obtained by super¬ 
imposing the two nets D 1} A x , we have Sn 3 ^ > A 2 . We proceed to 

form a net Z) 3 or (Z) 2 , A 2 ), where A 2 is such that > A s ; then S Di > A 3 ; 
and so on. Thus the required system of nets D ± , I) 2 , D 3 , ... is determinate. 

If f (x) is not of bounded, variation in (a, b), and if a net D be fitted on 
to (a, b) y in one at least of the meshes of I) the function f (x) is not of bounded 
variation. 

* See W. H. Young, Quarterly Journal , vol. xm (1911), p. 57. 
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Consider the net D n , for which Sn n > A n , employed in the last theorem. 
The net (Z), Z) n ), obtained by superimposing l) n and Z), is such that 
S(D,D n )> A n . Now (Z), I) n ) consists of a sum of nets fitted on to the 
meshes of the net Z); hence there must be at least one of the meshes of Z), 

A 

for which the total variation of / ( x) is ^ n , where m is the number of 

J v 7 m 

meshes in I). This is applicable for each value of n ; hence there is at least 
one mesh of D such that, for an infinite set of values of n , the total varia- 

tion of f (x) over that mesh is ^ n . In such mesh of I) the variation of 

f (x) is unbounded. 

Every absolutely continuous function is of bounded variation. 

Let f> (x) be a function which is absolutely continuous in (a, b) (see 
§ 218). If (f> (x) is not of bounded variation in (a, b) y a system of nets 
with closed meshes can be fitted on to (a, b) such that the sum 

r i 

r 1 

for the net D n increases indefinitely, as n ~ oo. Since </> (x) is not of bounded 
variation in (a, b), there must be at least one mesh of D 1 , say d Pl , in which 
(f> (x) is not of bounded variation; this follows from the preceding theorem. 
Further, d Pl must contain at least one mesh d v? , of Z) 2 , which has the same 
property; and so on. For a large enough value of n y d Vn has a measure 
less than an arbitrarily chosen positive number. In d Pn , for each value 
of n there is a set of intervals for which the sum of the absolute differences 
of <j> (x) at the ends of an interval is > A, an arbitrarily chosen positive 
number. Hence a set of non-overlapping intervals, of total measure arbi¬ 
trarily small, exists such that the sum of the absolute values of the 
differences of <f> (x) at the ends of an interval is > A ; and this is contrary 
to the hypothesis that cf> (x) is absolutely continuous. Therefore <f (a;) 
must be of bounded variation in (a, b). 

If a function / (x) is not of bounded variation in (a, b), there must be 
at least one point £, in (a, b), such that, in no neighbourhood of £, is f (x) 
of bounded variation. For, consider a system of nets {Z> n }; in D 1 , there is 
at least one mesh d Pl in which / (x) has unbounded variation. In at least 
one mesh d P% of I ) 2 , contained in d Pl , the variation of / (a:) is unbounded. 
In this manner a sequence d Pl , d Pi , ... of meshes, each of which is con¬ 
tained in the preceding one, is determined such that, in each of them, 
the variation is unbounded. This sequence defines a point £ which has the 
required property; for any neighbourhood of £ contains d Pn , if n is suffi¬ 
ciently large. The point £ may be said to be a point of unbounded variation 
oif(x). 
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It is easily seen that: 

For any function f ( x ), of unbounded variation in (a, 6), there exists a 
set of points of unbounded variation which is closed , and may be finite. 

For a limiting point of the set of points of unbounded variation is such 
that any neighbourhood of it contains points of the set, and therefore 
contains intervals in which the function is of unbounded variation. 

FUNCTION OF BOUNDED VARIATION EXPRESSED AS THE DIFFERENCE 
OF TWO MONOTONE FUNCTIONS 

244. It is clear that, if x be any point in (a, b) y a function / (x) which 
has bounded variation in (a, b) has also bounded variation in (a, x); and 
consequently the positive, and the negative, variations of the function in 
(a, x ) are also bounded. 

The three numbers which represent these total variations may be 
denoted by V a x f (x), P a x f (x ), — N a x f(x); and they may be regarded as 
functions of x. 

They satisfy the relations 

V a 'f(x)^P a *f(z) + N a *f(x), 

/ (x) - / (a) - iV7 (x) - N'f (x). 

It is clear that, if x' > x , we have 

/V7W - Pa x f (*), N a *'f(x) * N a x f (x); 
and thus the two functions P a x f ( x ), N a x f(x) are monotone non-dimin¬ 
ishing functions, in (a, b). For simplicity, we denote these functions by 
P (x) y N (x) respectively. 

Since / (x) = {P (#) -f / (a) + k} — {N (x) -f k } 

and / (x) = {k’ - N (x)} - {k’ - P (x) - / (a)}, 

where k, k' are arbitrarily chosen numbers, we see that: 

Every function of bounded variation can be expressed , in an indefinitely 
great number of ways , as the difference of two bounded monotone functions , 
both of which are non-diminishing , or both of which are non-increasing . 

This theorem expresses the cardinal property of functions of bounded 
variation. It is of great value in Analysis, as, by means of it, properties 
of monotone functions can be immediately extended to the wide class of 
functions of bounded variation. 

The converse theorem holds that: 

The difference of two bounded monotone functions , both non-diminishing , 
or both non-increasing , is a function of bounded variation. 

For, consider such a function / (x) = P (x) — N (x), where P (x), N (x) 
are non-diminishing; then, in any interval (a, p)> we have 

|/08) -f(a) \*{P(fi-P (a)} + {N (0) - N (a)}. 
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Hence, in any net fitted on to (a, b ), the sum of the absolute values of 
the differences of the values of / (x) at the ends of the meshes (a, ft) is 
* £ {P 08) - P (a)} +X{N(f$)-N (a)} 
s P (b) - P (a) + N (b) - N (a); 
and thus the theorem is established. 

It has now been shewn that it is necessary and sufficient, in order that 
a function may be of bounded variation in the interval for which it is 
defined, that it can be expressed as the difference of two bounded monotone 
non-diminishing (or non-increasing) functions. 

The sum, difference, or product, of two functions of bounded variation is 
also of bounded variation . 

If f x (a?) — P x ( x ) — N x (x), f 2 (x) — P 2 (x) — N 2 (x), the theorem follows 
at once from the relations 

fl (*) +ft ( X ) = {Pl (*) + Pi (*)} - { N 1 (x) + N 2 (*)}, 
fi (x) -/* (*) = {Pi (x) + A 7 2 (a;)} - {Pi (X) + A 7 ! (x)}, 
fi (x)f t (x) = {P, (x) P 2 (x) + N, (x) N 2 (x)} - {P, (x) N 2 (x) + P 2 (x) N, (x)}. 

244 1 . If the interval (a, b) be divided into two or more parts, a function 
which is of bounded variation in each part is of bounded variation in (a, b). 
The total variation in (a, b) is the sum of the total variations in the parts. 

It is clearly sufficient to consider the case in which (a, b) is divided 
into two parts (a, c), (c, b). 

If / (x) is of bounded variation in (a, c) it is expressible in that interval 
as <f) X (x) — tp x (x), where <f> x (x), ip x (x) are the monotone non-diminishing 
functions / (a) -f P a x f (x), N a x f (x). In the interval (c, b) it can be expressed 
similarly in the form <f> 2 (x) - ip 2 (x), where (f> 2 (x), tp 2 (x) denote 
A+/(c) + P c *f(x), \ + N*f(x), 

respectively. We have/(c) ~ cf> 1 (c) — ifj x (c) = <f> 2 (c) — ip 2 (c); and we may 
choose A so that if/ x (c) = ip 2 (c). It then follows that <f> x (c) = <f> 2 (c); and 
thus <f> x (x) in (a, c), <f> 2 (x) in (c, b) define a single monotone function <f> (x) 
in (a, b) ; similarly the monotone function ip (x) is defined to be equal to 
ip x (x) in (a, c) and ip 2 (x) in (c, b). Since / (x) is, in (a, b), the difference of 
two monotone non-diminishing functions, it is of bounded variation in 
(a, b). 

The total variation in (a, c) is {<f> x (c) — <p x (a)} -f {ip x (c) — \p x (a)}; that 
in (c, 6) is {<f> 2 (b) — <p 2 (c)} -f {</r 2 (b) — tp 2 (c)}, and the sum of the two is 
{f> 2 (b) — <f> x (a)} -f {ip 2 {b) — (a)}, which is the total variation in (a, 6). 
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FUNCTIONS OF BOUNDED TOTAL FLUCTUATION 

245. If, in the net (# 0 , xf), (x lf x 2 ),... (x n _ x , x n ), where x 0 — a, x n = b, 
we take the sum of the fluctuations in the closed meshes of the net, viz. 

r n 

2 (U r — L r ), where TJ r , L r are the upper and lower boundaries of f (x) 

r 1 

in the closed mesh (x r _ 1 , x r ), then if this sum does not exceed some fixed 
positive number, independent of the particular net, the function / (x) is 
said to have bounded total fluctuation in the interval (a, b). If F a b f (x) be 
the number such that, for every net, 

r i n (U r -L r )^F a b f(x), 

r - 1 

and such that a net can be fitted on to ( a , b) for which 
r Z(U r ~ L r ) > F a b f (x) — €, 

r - 1 

where e is an arbitrarily chosen positive number, then F a b f (a;) is termed* 
the total fluctuation of / (x) in (a, b ); and it is finite for a function of bounded 
total fluctuation. Every discontinuity of / (x) is then ordinary (see § 243). 

It will appear that the class of functions of bounded total fluctuation 
is identical with that of functions of bounded variation. 

In order that a, function may have bounded total fluctuation it is sufficient 
that 2 (U r — L r ), when taken over the successive nets of a given system of nets, 
should have a definite limit. Moreover the limit for the system of nets is 
£ F a b f(x), and k \F a b f{x). 

This limit may be called the total fluctuation of / (x) for the particular 
system of nets. 

It is clear that, if an interval ( a , ft) be divided into two parts (a, y), (y , ft), 
the sum of the fluctuations in (a, y), (y, ft) cannot be less than the fluc¬ 
tuation in (a, ft). Hence, if we consider the successive nets D x , D % , ..., of 
a system of nets, the sum of the fluctuations in the meshes of D m is £ the 
sum in the meshes of D m+1 . Thus, as we proceed through the successive 
nets of the system, the sums of the fluctuations in the meshes of a net 
form a sequence of non-diminishing numbers which have a finite upper 
limit, unless they increase indefinitely. Assume the former to be the case. 
Let us now consider a second system of nets D x ', Df, ... D m ', — Suppose 
7) to be a number less than the breadth of all the meshes of D m '; and let 
the two nets D r , D m ' be superimposed to form a net (Z> r , D m '). We may 
suppose r so large that D r contains no mesh of breadth > rj. A mesh d r , 

* See Study, Math. Annalen , vol. xlvu (1896), p. 298. It was shewn by Study that the 
definitions of functions of bounded total fluctuation, and of bounded variation, are equivalent to 
one another. 
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of D ry with fluctuation F ri will contain not more than two meshes of the 
net ( D rt D m '), with fluctuations F ri , F rt \ we have then 

F r £ F ri -f F r9 z 2F r . 

Now every mesh of DJ is made up of meshes of D r and of meshes of 
( D ri D m '). Therefore the sum of the fluctuations in the meshes of D m ' 
is £ the sum of the fluctuations in the undivided meshes of D r together 
with the sum of the fluctuations such as F rj , F ri . It follows that the sum 
of the fluctuations in the meshes of D m ' cannot exceed twice the sum of 
the fluctuations in the meshes of D r . Since a number r can be determined 
so as to correspond to each number m, we see that the limit of the sum 
of the fluctuations in D m ', as m ~ oo, cannot exceed twice the limit of 
the sum of the fluctuations in D r , as r — oo. If X' be the former limit, 
and X the latter, we have X 's 2X . Thus, if X is finite, for any system 
of nets, it is finite for any other system of nets. 

Since a net can be so chosen that the sum of the fluctuations in its 
meshes is less than F a b f (x) by less than an arbitrarily chosen number, a 
system of nets can be so chosen that the limit of the sums for the nets is 
F a b f(x). It thus follows that, for any other system of nets, the limit of 
the sum of the fluctuations is ^ £ F b f (x), and v, F b f (#). 

246. Since the absolute value of the difference of the functional values 
at the end-points of an interval (a, jS) cannot exceed the fluctuation in the 

r - n 

closed interval, it is clear that the sum E | f (x r ) —f (x r _ x ) | cannot exceed 

r-1 

the sum of the fluctuations in the intervals. It follows that, if a function 
is of bounded total fluctuation, it is also of bounded variation. 

If a function have at no point an external saltus , the total fluctuation , and, 
the total variation , have one and the same value , when estimated as the 
limiting values over a system of nets ; and that value is independent of the 
particular system of nets. 

In accordance with this theorem, V b f (x) and F b f{x) are equal to 
one another, and are the limits of the sums 

2 |/(*r) -/(*r-i) |, 2 (ff r - Lr), 

over a net D n , of a system of nets, as n ~ oo. 

To prove the theorem, let F (r) , F {r) denote | f (x r ) —f(x r _ t ) |, (U r — L r ) 
for a mesh d r , of a net D r . We have clearly F (r) s F {r) . It will be shewn 
that for the net D r+S , where s is sufficiently large, EF (r+,) £ 2^ (r) , the 
summation referring to all the meshes of D r+S that are in the mesh d r , of 
the net D r . In the case in which U r , L r are the functional values at the 
ends of d r , we have U r — L r = | / (x r ) — / ( x |. When this is not the 
case, one at least of / ( x r ), / (^ r _x) is between U r and L r . If e be an arbi¬ 
trarily chosen positive number, there are two points of d rj say £ 2 , for 
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one of which the functional value is > U r — c, and for the other < L r -f c. 
There must exist a neighbourhood of on one side at least, at every 
point of which the functional value is > U r - 2c; for otherwise the func¬ 
tional limits at would be < U r — c, on both sides, and this cannot be 
the case if there is no external saltus at It follows that, if s be suffi¬ 
ciently large, there must be an end-point of a mesh of D r+S at which the 
functional value is > U r — 2c. Similarly we see that, if s be sufficiently 
large, there is an end-point of a mesh of D r + S , at which the functional 
value is < L r ~f 2c; and one at least of these end-points is not an end¬ 
point of the mesh d r . The number s can accordingly be so chosen that, 
for a certain number of meshes of the net D r+S which are in the mesh d r of 
D r , the sum of the absolute values of the differences of the functional values 
is > U r — L r — 4c. Thus the number «s* can be so chosen that the sum of 
the absolute values of the differences of the functions for all those meshes 
of l) r+a that are in the mesh d r is > U r — L r — 4c -f W ; where W is the 
absolute difference between the functional value at one of the end-points 
of d r and the functional value at one of those end-points of a mesh of 
D r \ s at which it is > U r — 2c or < L r -f 2c. If c be so chosen that W > 4c, 
(which can be done, because the functional value at the end-point of d r 
in question is not equal to U r or L r ), we see that the sum of the absolute 
differences of the functional values in those meshes of D r+S that are in 
d T is > U r —L r . Moreover s can be so chosen that this holds good for each 
mesh d T , of the net D r \ and thus, for such value of s, we have F (r+S) > F ir) . 
This inequality, combined with the inequality F (r) £ F {r \ shews that 
lim V ir) — lim F (rfs) ^ lim F (r) \ lim F (r) £ lim F lr \ 

r~oo go oo r~ao r~oo 

and thus that the two limits lim P (r) , lim F {r) have the same value, finite 

r~cc r~oo 

or infinite. 

In order to complete the proof of the theorem it must be shewn that 
the limit of the sum of the fluctuations in the meshes of a net belonging 
to a system of nets is the same for all systems of nets; there being as 
before no external saltus at any point. 

If possible, let the limit for a particular system of nets have a value 6r, 
less than F a b f (x). A net can be defined, for which the sum of the fluctua¬ 
tions in the m meshes is > 0 ; say G + a. Let this net U be superimposed 
on the net D r , of the system of nets; we may suppose r so great th$t not 
more than one of the end-points of the meshes of D' is in any one of the 
meshes of D r . Let us suppose that one of these end-points is x, and that 
it falls in the mesh 8, of Z> r , dividing it into two parts 8' and 8"; let F (8) 
be the fluctuation in 8. Since there is no external saltus, if 8, and conse¬ 
quently 8' and 8", are sufficiently small, we have 

**( 8 )- !/(* + <>)-/(* — 0 )| +G 19 
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where G x is less than the arbitrarily chosen number 77. Moreover, 8 ' and 8 " 
being sufficiently small, 

F (S') = | / (x) — / (x — 0) | + G 2 , where G 2 < 77, 

F { 8 ") = | / (x) - f (x + 0) | 4- £3, where G 3 < 77. 

Now r may be chosen so great, and consequently all the meshes of D r 
so small, that these conditions are satisfied for all the meshes of D'. We 
have then F ( 8 ') -f F ( 8 ") - F ( 8 ) — f- G 3 - since f (x) is between 
f (x + 0) and f (x — 0). Hence the sum 

XF (S') + XF (S") - XF (S) < 2 (m - 1) 77, 
the summation being taken for all those meshes which contain one of the 
m — 1 points x; moreover HF (S') -f Ej F (S") — EjP (S) £ 0. Therefore the 
sum of the fluctuations in the net obtained by superimposing I)' and D r 
is < G + 2 (m — 1)77. But the sum of these fluctuations is certainly 
+ and since 77 is arbitrarily small and independent of m, these two 
conditions are incompatible with one another; thus a must be zero. It 
follows that G cannot be less than F a b f (x). Thus, in every system of nets, 
the limit of the sum of the fluctuations in the meshes of a net is F a b f (x). 
It then follows, from the first part of the theorem, that the limit of the 
sum of the absolute differences of the functional values for the meshes of 
any net is V a b f (x), or F a b f (x). 

In case there be points at which there is an external saltus, the pre¬ 
ceding proof can still be applied to shew that the limit of the sums of the 
fluctuations in two systems of nets has the same value for the two systems, 
provided no point at which there is an external saltus be an end-point 
of a mesh in either system. Moreover, in case there be an external saltus 
at the point x, the value of F ( 8 ) is either | / (x) — / (x 4- 0) | -f G x or 
I / ( x ) ~~ f ( x ~~ 0 ) | + G x , and we see that 

F ($') + F (§") - F ( 8 ) « G 2 + G s - G x + « (z), 

where s (x) denotes the external saltus at x , and is equal to the smaller 
of the two numbers | / (x) — f (x + 0) |, | / (x) — / (x — 0) |. 

It is clear that, for a function with bounded total fluctuation, the sums 
of the saltuses on the right, and on the left, 

Z|/(* + 0)-/(*)|, Z|/<*-0)-/(*)|, 

for all the points of discontinuity, are finite, and there can therefore be 
only a finite number of points at which the external saltus exceeds an 
arbitrarily chosen number p. If we take these points, say n — 1 in number, 
to be the points x above, of D', we see that the sum of the fluctuations in 
the meshes of the net obtained by superimposing D ' and D r is 

XF ( 8 ) + S fi + y y 

where Sp is the sum of those external saltuses, all of which are greater 
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than ft; and y is the sum of the n — 1 numbers 0 2 -} 6r 3 — G lf and is 
therefore arbitrarily small. It thus appears that the total fluctuation for 
a system of nets such that no end-point of any of the nets is a point at 
which there is an external saltus is the number p (s \ F a b f (x)), the lower 
boundary of the limiting sum for all systems of nets. If those points at 
which the external saltus is > ft be end-points of meshes of nets of the 
system, the limit of the sum of the fluctuations in a net of the system is 
p + Sp. If a sequence of descending values be given to ft, the sum S$ 
converges to a fixed number, which is the sum, or the limiting sum, of all 
the external saltuses. Thus we have the following theorem: 

If a function with bounded total fluctuation be such that there are points 
at which the function has an external saltus , then the difference F a b f (x) — p 
between the upper and lower bou ndaries of the limiting sum of the fluctuations 
in the nets of a system is equal to the sum of the external saltuses. If a, system 
of nets be such that no point at which there 4 s an external saltus is an end¬ 
point of any mesh of any of the nets , for such a system the limiting sum of 
the fluctuations in the meshes of a net is p. If however every point at which 
there is an external saltus is an end-point of a mesh of nets of the system , the 
limiting sum is then F a b f(x). 

247 . It has now been shewn that, for a function without points at 
which there is an external saltus, the total variation and the total fluctua¬ 
tion over any system of nets are identical, being both finite, or both 
infinite, and that they are independent of the particular system of nets 
employed. 

If / (x) have an external saltus at each point of some set S, the total 
variation of / (x), when extended over a system of nets such that no point 
of S is an end-point of any mesh, is identical with the total fluctuation of 
that function <f> (x) which differs from / (x) only in having the functional 
values at the points of S so altered that the external saltus is at every 
point of 8 removed. The total fluctuation of <f> (x) is obtained by removing 
from p the total fluctuation of f (x) over such a system of nets, that is 
the number F a b f (x) — p which is the sum of the external saltuses. Thus 
the total fluctuation of <f> (x) is 2 p — F b f (x). If, on the other hand, we 
employ a system of nets such that every point of S is an end-point of 
meshes of the nets, the total variation and the total fluctuation of / (z) 
have the common value F a b f ( x). 

It thus appears that, for a function of bounded total fluctuation , which 
has points with an external saltus , the total variation over a system of nets is 
F a b f(x), or 2 p — F a b f(x ), or has some value between these two numbers , 
according to the particular system of nets employed. 

The necessary and sufficient conditions that a function / (x) may have 
bounded variation and bounded fluctuation in (a, b) are that: 
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(1) . The points of discontinuity must all be of the first species, i.e. 
f {x -f 0), / (x — 0) must everywhere exist. 

(2) . The sum of the absolute values of the external saltuses must be 
finite. 

( 3 ) . A system of nets must exist for which the sum of the absolute 
values of the differences of the functional values at the end-points of the 
meshes of a net must be less than some fixed number, for all the nets of 
the system. 

248 . The following theorem will now be established : 

The necessary and sufficient condition that a function f (x ) is of bounded 
total fluctuation is that it can be expressed as the difference of two bounded 
monotone functions , which are either both non-increasing or both nan- 
diminishing. 

A comparison of this theorem with that in § 244 , for functions of 
bounded variation, gives a second proof that the two classes of functions 
are identical. 

To prove the theorem; let F a x be the upper boundary of the total 
fluctuation in the interval (a, x). We then see that 

/ (x + h) - / (x) i F x +* ± F m *+* - F a x ; 
it follows that F a x — f (x) is a monotone non-diminishing function. The 
function F a x + / (x) has the same property, since 

/ (x + h) — / (x) 2 - F x x +» ^ F* - F a x +K 
Therefore, if <f> x (x) = J {F* + /(#)}, <f > 2 (x) - \ {F x -f(x)}, the function 
/ ( x ) can be expressed as the difference of the two non-diminishing mono¬ 
tone functions <f> x (x), cf > 2 (x). 

Conversely, let / (x) = <f> x (x) - <f > 2 (#); where <f> x (x), (f > 2 (x) are bounded 
non-diminishing monotone functions. In any interval, the fluctuation of 
/ (x) cannot exceed the sum of the fluctuations of <f> x (x) and <f > 2 (x). It then 
follows that the total fluctuation of / (x) in (a, 6) cannot exceed 
[<t>i ( b) - ^ (a)] + [<£ 2 (b) - fa (a)]. 


EXAMPLES 


1.* The function defined by / ( x) —x sin / (0) -0, is not of bounded total fluctuation 

in the interval (0, 1 /tt), although it is continuous in the interval. For, in the interval 

( 7T-- , - V sin * attains the value ( -l) r , and thus the fluctuation in this interval is 
r + lir rnj x 


at least equal to l/(r+I)7r. The total fluctuation in the interval (—, ~) is at least 

\8lT ITJ 

—7 +s ™7 +••• + —!—,4 or + and it is well known that this 

7r(l+I 2+J a-l+u 7r\3 5 2s -1/ 

increases without limit when 8 is indefinitely increased; therefore the total fluctuation in 
(0, 1 /it) is not finite. 


Lebesgue, Lemons sur Vintegration (1904), p. 56. 
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2. * The function defined by f (x) ~x 2 sin *.,,/(<)) -0, is continuous in any interval con¬ 
taining ^~0, and is everywhere differentiable, hut is not of bounded total fluctuation. 

3. f The function defined by J {x) x 2 sin (.r •'),/(0)=0, is of bounded total fluctuation 

in the interval (0, \frA). In the interval ( --- * V the function has a single nmxi- 

V -r Irrj'i (,n)V 

mum, or oInc a single minimum, and the absolute value of the function at thiH point is at 

, » l 

most l/(m)t. The total fluctuation in (0, I/Vt) cannot exceed 2S ... which is finite. 

' 1 fr7t) : i 


RESOLUTION OF A FUNCTION OF BOUNDED VARIATION 

249 . If j (x) have bounded variation in the interval ( a, b ), it can be 
expressed by /(«) -1 1 J (x) N (x), where P (x), -- N (x) are the positive, 
and the negative, variation respectively of / (x) in the interval (a, x). Let 
, f 2 > ••• be the points of discontinuity of P ( x ), and let 6* (x) denote 

thc Mu,n s {P a -\ o) -eg- o)}, 

X 

where the summation is taken for those points f that are in the closed 
interval (ci,x). Thus s (x) is a bounded non-diminishing monotone func¬ 
tion. Since the sum L {P (f j- 0) — P (£ 0)} is convergent, the sum can 

b 

be divided into two parts, the one a finite sum 
X {P (£ + 0) P (f - 0)}, and the other L {P (£ 0) - P (£ — 0)}, 

b, m. b , m ! I 

where m is chosen so great as to include the indices of all those points of 
the finite set at which P (f f 0) P ($ — 0 ) *> c and is also so great that 
the second sum is < t; where € is an arbitrarily chosen positive number. 
We also have s (x) — L {P (£ ■{■ 0) - P - 0)} -f rj (x); where, in the 

x. m 

first sum, only those of the set £ Y , £ 2 , ... occur which are in the closed 
interval (a, x) and where r) (x) y rj (b) < e. Let us consider the function 
P, (x) — s (:r), or P (a:) — 2 {P (f -f 0) — P (£ - 0)} — rj (x). At a point x> 

x, m 

at which P (x) has a saltus < e, the salt us of P (x) — 6* (#) is < 2 e; for the 
saltus of rj (x) cannot exceed e, and the function 2 {P (£ f- 0) — P (f — 0)} 

x, rn 

is continuous. At a point of the finite set at which P (x) has a saltus e, 
P (x) — D [P (£ -f 0) — P (£ — 0)] is continuous. Therefore the saltus of 

x, m 

P (#) — s ( x ) is everywhere < 2e. Since e is arbitrary, it follows that 
P (x) — s (x) is continuous in the closed interval (a, b). A similar result 
holds for the function N (x). Therefore / (x) is the sum of a continuous 
function <f> (x) } of bounded variation in (a, b), and of a function s (a;) — s' (x), 

* Lebesgue, Annali di Mat. (3 a), vol. vii (1902), p. 270. 
f Lebesgue, Lemons sur Vintegration (1904), p. 56. 



338 Functions of a Real Variable [oh. v 

which may be denoted by x (*)• The function \ (x) is of bounded variation, 
and has the same points of discontinuity as f (x). We have 

f( x ) = <f> (*) + X ( x )> 

where x ( x ) denotes 

2 [P (i + 0) — P (f - 0)] — 2 [N (( + 0) — N (i — 0)J. 

X X 

The result obtained may be stated as follows: 

A function of bounded variation in the interval (a, b) is the sum of a 
continuous function of bounded variation and of a, function of bounded varia¬ 
tion with the same discontinuities as the given function , of which the total 
variation is the sum of the saltuses of the given function at its points of dis¬ 
continuity. 


RECTIFIABLE CURVES 

250. Let f be a variable defined for all values in a continuous interval 
(t Q , t x ), and let f x (t),f 2 (t) be two single-valued and bounded functions of t , 
defined in the interval ( t 0 , t x ); the equations x ~ f x (t), y = f 2 (t) may be 
said to define an arc of a plane curve, in a generalized sense of the term. 
Let a net r 0 , r x , r 2 , ... r n , where r 0 = t, 0 , r w = t x , be fitted on to the interval 
(t 0i ^), then the points on the curve corresponding to the end-points of the 
meshes of the net may be denoted by P 0 , P x , ... P n ; and we consider the 
unclosed polygon of which the sides are P 0 P X , P X P 2 , ... P n - x P n , inscribed 
in the arc of the curve. The length of the polygonal line P 0 P X P 2 ... P n , 
measured by r « n 

2 {(x r - x r _ x f + (y r - y r ~i) 2 r y 

r- 1 

depends upon the particular net that has been fitted on to ( t 0 , t x ), and may 
be described as the length of a polygonal arc inscribed in the arc of the 
curve. 

If the arc be such that the lengths of all the inscribed polygonal arcs have 
a finite upper boundary , the curve is said to be rectifiable , and the length of 
the arc is defined to be the value of that upper boundary. Otherwise the length 
of the arc is regarded as infinite. 

This general definition is due to Peano*. An earlier definition given 
by Jordan!, and in the case of an arc defined by an equation y = / (x)> by 
ScheefferJ, is of a less general character, as it requires that the lengths of 
any sequence of polygonal arcs should converge to a fixed limit, inde¬ 
pendent of the particular sequence, provided the polygonal arcs correspond 
to a system of nets fitted on to (£ 0 , ^). It will be shewn that when the arc 

* Rend. Lincei (4), vol. vi, 1 (1890), p. 54. See also Lebesgue, Ann. di Mat. (3 a), vol. vn (1902), 
p. 282, where a more general definition is introduced. 

t Comptea Rendus , vol xon (1881), p. 228; and Cours d?Analyse, vol. m, p. 594. 

X Acta Math. vol. v (1884), p. 49. See also Study, Math. Annalen , voL xlvh (1896), p. 298. 
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is continuous, the definition of Peano is equivalent to that of Jordan and 
Scheeffer. 

The following theorem will be established: 

The necessary and sufficient condition that the arc defined by x = f x (t), 
y =f t (t) } for t Q &tst x , may be rectifiable is that the functions f x ( t ), / 2 (£) 
should be of bounded variation in the interval (t 0 , t x ). 

We see that 

x r-l | + | Vr ~ Vr-S |} S [{.X r - ^r-l) 2 + (Vr ~ Vr-lY^ 

£{\x r - X r . x | + | y t - Vr—l |}; 

by taking the sums of all the expressions for r ~ 1 , 2, 3, ... n, it is clear 
that the necessary and sufficient condition that the perimeter of the un¬ 
closed polygon should be less than some fixed positive number is that this 

r*n r«-n 

should also hold as regards S | x r — x r _ x |, and £ | y r — y r _ x |; and this 

r ~ 1 r I 

is equivalent to the condition stated in the theorem. 

From the known character of functions of bounded variation, it is seen 
that a curve that is rectifiable in the interval (t 0 , t x ) is also rectifiable in 
any interval contained in (t 0 , t x ). 

251 . In case the functions f x (t),f 2 (t) are both continuous in the interval 
(t Q ,t L ), it will be shewn that the lengths of the polygonal arcs of any 
sequence, such that the greatest side of a polygon converges to zero, con¬ 
verge to the length l of the curve, as above defined, whenever the curve 
is rectifiable. To establish this result the following theorem will be proved : 

If the continuous arc defined by x = f x (t), y = / 2 (t) } t 0 ^ t ^ be recti¬ 
fiable, and of length l, then, corresponding to an arbitrarily chosen positive 
number e, a positive number d can be determined, so that the perimeter of any 
unclosed polygon inscribed in the arc is > l — e, provided all the sides of the 
polygon are < d. 

The assumption that the arc is rectifiable, and of length l, implies that 
a net JD 0 can be fitted on to the interval (£ 0 , h), such that the length of 
the corresponding polygonal arc inscribed in the curve is > l — Jc. Let 
D be any other net fitted on to the interval ( t 0 ,t 1 ). The net ( D 0 ,D) 
obtained by superimposing the two nets D 0 , D, is such that the corre¬ 
sponding polygonal arc has a length > l — Since the functions f x (i t), 
f 2 (t) are continuous, if rj be a prescribed positive number, a* number 8' 
can be determined so that, in any interval of t less than 8', the fluctuations 
of f x (t), f z (t) are both less than 77 . We shall suppose D so chosen that all 
its meshes are of breadth less than 8'. Let m be the number of meshes in 
D 0 ; we consider the excess of the length of the polygonal arc corresponding 
to ( D 0 , D) over that which corresponds to D. Consider an end-point t u of 
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the meshes of J ) 0 , that is contained in the interior of a mesh of D ; we can 
suppose the meshes of D so small that only one such point can be con¬ 
tained in a mesh of D , and we also suppose all the meshes of 1 ) to be less 
than 8'. If t r , t rn be that mesh of 1 ), we see that the excess of the sum of 
the chords joining t r , t u and {t u , / r . fl ) over that joining t r , t r+J is 

<\x r -x»\ -I I x rtl -x u \ - I a> - .r, +t | I- | y r - «/„ | 

■I 1 2/r-u - y u | - ^2 I ^ ~ S'*"'I I 

and this is less than 4 rj. 

We now see that the perimeter of the polygonal line which corresponds 
to 1 ) is > l — - 4 :?nrj . 

If rj is so chosen that 4 ?nrj < |e, the length of the polygonal arc that 
corresponds to I) is > l - e. In order that this may be the case, I) has 
only to satisfy the condition that the maximum breadth of its meshes is 
less than some fixed number § (. 8'), and this condition will be satisfied if 
the greatest side of the corresponding polygonal line is less than some fixed 
number d. 

If s denote the length of the arc corresponding to (/ 0 , /), where t 0 < t ^ t 2 , 
s is a continuous monotone function of t; and it is clear that x and y may 
be regarded as functions of s, say x — <f> (s), y — i/j (s). Since Ax/ As, Ay/ As 
cannot be numerically greater than 1, all the derivatives of c/> (.s), (// (s) are 
bounded, and in the interval ( 1,1). It will be shewn in § 298 that, for 

all values of s in the linear interval (0, /), with the possible exception of 

dx dy 

those belonging to a set of measure zero, the differential coefficients ^ 

exist, and are finite. The above theory is applicable, without essential 
change, to the case of a “curve” in three-dimensional space, defined by 
x ~ ii (t)-> V ~ /a (0> z “ /s (0* 

The case of an arc of a curve y —f (x), defined for the interval (a, b ), 
of x, may be considered as the particular case of the above which arises 
when t is identical with x. We see then that, the necessary ami sufficient 
condition that the arc defined by y — f (x), a^x^b, may be rectifiable is that 
the function f (x) is of bounded variation in the interval (a, 6). 

EXAMPLES 

1 2 r + 1 

1. In* the interval (0, 1), let f {x)~^ 2n , when x is any of the points , and let 

/ (a:) =0, at all other points. In accordance with Peano’s definition, the length of the arc is 
00 2 «-i 

1+2 2 - 2 =2. According to the definition of Jordan and Scheeffer, the arc is not rectifi- 


* See Schoenflies, Bericht , vol. n, p. 243. 
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able; for a net can be fitted on to the interval (0, 1), such that all its end-points are repre¬ 
sented by irrational values of x, except the points 0, 1; the length of the corresponding 
polygonal arc is 1. 

2. An arc of the curve y ~x sin ^ which contains the origin is not rectifiable; but a 
similar arc of the curve y ~x 2 sin (jf4) is rectifiable. 

THE VARIATION OF A FUNCTION OF BOUNDED VARIATION 
OVER A LINEAR SET OF POINTS 

252 . Let <f> (x) bo a bounded monotone non-diminishing function, de¬ 
fined for the interval (a, 6), of x. If £ — <f> (#), the segment (a, b) has a 
functional image on a segment of which the end-points are <^> (a), <f> (b) 
respectively, on which the points £ are represented. If <£ (x) be continuous, 
the functional image consists of all the points of the interval (</> (a), </> (6)), 
of £; but if (j> (x) be discontinuous at a point x\ there are no points of the 
functional image in the closed interval (</> ( x ' - 0), (f> (x' + 0)) except the 
single point <f> (x'). We may now suppose that, to each point x, at which 
<j> (x) is continuous, there corresponds the point f ~ <f> (x) on the -segment, 
but that, to each point x\ at which (x) is discontinuous, there corresponds 
the closed interval (</> (x' - 0 ), cf> (x' 4 - 0 )) on the ^-segment. If G ix) denote 
any set of points in the ^-segment (a, 6), there is a corresponding set G 1 ^ 
in the f-segment; to any point of G {x \ at which $ (x) is discontinuous, 
there corresponds a component of < 7 (f) , consisting of all the points of a 
closed interval. If G ix) consists of all the points of a closed interval, G U) 
consists of all the points of a closed interval. Similarly, if G (x) consists of 
the points of an open interval, < 7 (f) consists also of the points of an open 
interval. It follows that, if G (x) is an open set, consisting necessarily of 
a set of open intervals, the corresponding set 6r (f) is open. Consequently, 
if G U) is closed, so also is < 7 (f) . It now follows that, if G ix) be measurable 
(B), on the ^-segment, G (n is measurable (B), on the ^-segment. If G U) 
be any measurable set, it is contained in a set Gj {x) , and it contains a set 
G 2 {x \ such that G^ [x) and G.f x) are measurable (B), and that the set 
G x (x) ~ G 2 {x) has the measure zero. The two sets G^\ G 2 (f) are both 
measurable (B), but it is not necessarily the case that — GJ n has 
measure zero. Unless G\ {x) and G 2 lx) can be so chosen that the measure of 
(7 1 (f) — 6r 2 (f) is zero, the set 6r {f) is not necessarily measurable. Even if the 
function $ (x) be continuous in (a, b), it is not necessarily the case that 
there corresponds a measurable set G ( ^ to any measurable set G ix K Only 
when <f) (; x ) is such that, to every x-set of measure zero, there corresponds 
a £-set of measure zero, is 6 ?(f) necessarily measurable when G {x) is measur¬ 
able. 

In general, there corresponds to a measurable set G {x) a set 6r (f) , with 
exterior and interior measures m e (< 7 (f) ), ((r (f) ). 
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The exterior and interior measures of the set G ( -\ on the segment , 
which corresponds to the set G, on the x-segment , are taken to define the upper 
and lower variations respectively , of the function <f> (x) over the set G; these 
may be denoted by F ((?) <f> (x), V (G) <f> ( x) respectively. In case 

F< G > 4 > (x) - V (0) <f> (x), 

their common value V (G) <f> (x) is said to be the variation of the monotone 
function <f> (x) over the set of points G. 

It is clear from the definition that 

V <*> <j> (x) + Y C < Q > fi(x)^<f> (b) - <f> (a). 

If G (x) be measurable (B), since G U) is also measurable (B), it follows 
that f> (x) has a definite variation over the set G (X K The variation of <f> (x) 
over a closed interval (a, ft), contained in (a, b), is <f> (ft -f 0) — </> (a — 0); 
and the variation over the open interval (a, ft) is </> (/? — 0) -- cf> (a f 0). 
In case a — a, we take <f> (a — 0) = <f> (a); and in case f 3 — b, we take 
<f> (b + 0) cf> (b). 

If the monotone function <f> (x) is absolutely continuous in (a, b), the 
£-set which corresponds to an x-set of measure zero is also of measure zero. 
For the x-set may then be enclosed in a set of intervals (a r , p r ) such that 
the sum of their measures is arbitrarily small, and thus such that the 
sum S | <f) (f 3 r ) — (fy (a r ) | is arbitrarily small. Hence the corresponding 
£-set has measure zero. It follows that, to any measurable set G (x) on the 
x-segment, there corresponds a measurable set 6r ( ° on the f-segment, of 
which the measure is the variation of <f> (x) over G (x) . 

If cf> (x) be a function of bounded variation, defined for the interval 
(a, 6), (f> (x) — <f) (a) is expressible as the difference of two bounded mono¬ 
tone non-diminishing functions (f> x (x) and (f > 2 (x), where <£, (x) and — </> 2 (x) 
are the positive and the negative variation of (x) in {a, x), (see § 244). 

The sum of the variations of </>j (x), <f > 2 (x) over a set G ix) , for which these 
variations exist , may be defined to be the total variation of the function <f> (x), 
of bounded variation , over the set G (x) . 

If the absolutely continuous function <f> (x) be not monotone, since it 
is of bounded variation (see § 243 1 ) it is the difference ^ (x) — <f> 2 (x) of 
two monotone functions, each of which is absolutely continuous. Therefore 
the variation of <f> (x) over any x-set of measure zero is zero. If e be any 
measurable set of points in (a, b ), the total variation of <f> (x) over e is the 
sum of the variations of (x) and </> 2 (x) over e ; and it may be regarded 
as a function $ (e), of the measurable set e. This function O (e) is com¬ 
pletely additive, in virtue of the property of the measures of measurable 
sets of points. 



252, 253 ] Functions of Two Variables of Bounded Variation 343 


FUNCTIONS OF TWO VARIABLES THAT ARE OF BOUNDED VARIATION 


253 . The definition of a function of bounded variation, in an interval , 
can be extended to the case of a function of two or more variables, defined 
in a given cell. It will be sufficient to consider the case of a function 
/ (:r (1) , # (2) ), defined in a closed cell (a a> , a (2) ; 6 (1> , b i2) ). Let a set of n -f 1 
points, in the cell, two of which are the points (a m ,a (2) ), (6 (1) , 6 (2) ), be 
considered; they may be denoted by 

(* 0 (1 >, *0 <2) ), V”), (Zr ll} , *r (2 >), (*n a> , *n< 2> ) J 


where 


o< 2 > - a (2) , x™ = b' 1 ' x™ = b {2) . 


Further, let the points be such that, for each value of r (= 1, 2, 3, ... n), 


and consider the sum 


r < 2 >- 

L r —1 * > 


T | / (x r (l \ x r l2) ) — / (x r _j (1 >, # r _ 1 (2) ) |. 

r = 1 

If this sum does not exceed some fixed positive number, whatever 
value n may have, and however the points are chosen, subject to the con¬ 
ditions above stated, the function / (# (1) , x (2) ) is said* to have bounded 
variation in the cell (a (1) , a (2) \ b a) , b (2) ). The upper boundary of the above 
sum may then be denoted by (s (1) , a; (2> ). 

A function (f> (x a) , x (2) ) is said to be a monotone non-diminishing 
function in the cell, if <f> (x {1) , x i2) ) £ <f> ( x a) ' , x l2)f ) for every pair of points 
(x (l \ x (2) ), (x {1) \ x {2) ') such that :r (1) ^ x {1), t x {2) > x {2) '. If 

x®)£<f>{:VP*', xW), 

for ;r (1) ^ x (2) sr x < 2) ', we have the definition of a non-increasing monotone 

function. 


If we consider separately those terms of the sum 

r i n {f(x r ">, x r W) -f{x r _™, x r ^)} 

r- 1 

which are positive and those which are negative, and denote the two parts 
respectively by , S 2 , we have 
/(6 (1) , 6< 2 >) —/(a (1) , a< 2 >) - Sj {/(*,«>, x r w) ~f{x r ^\ *,.,<«)} 

4 - S 2 {/ (x r il) , x r <») x r _^)}; 

further 




fix Xr-l' 2 ’)}, 


Z 2 {/(*,»>, */*>)-/(* 


are such that their sum cannot exceed (x a) , x n) ). 

It follows that and — S 2 have upper boundaries 

* n) ) +.f(b (V , b"') - f(a «» o«»)l. 


and 


* ,2) ) -/(6 (1) . b ««) +/(»«» ««>)], 


* See Arzel&, Bologna Rend. vol. ix (1904-5). 
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which may be denoted respectively by 


*< 2) ), and JCH 5 ; “”}/(*«“, * <2) ) 


These numbers may be called the total positive variation and the total 
negative variation of f (# (1) , a? (2) ) in the cell (a (1> , a (2) ; 6 (1) , /> (2) ). 


If we take any point (.r m , x {2) ) in the given cell, we may denote by 
P (x (1) , P 2) ), N (x"\x i2) ) 

the positive and the negative total variation of the function in the cell 
(a (1 \ a i2) ; x <n , x (2) ). We have t hen 

vftn’fulfix'", x< 2 >) - P (.r"\ a - * 2 ') + A T (*<‘>, .r l2 >)> 

/(*«>, * (2> ) —/.(a u) , a* 2 ') - A* (.*">, .r (21 ) - A? (x (1) , a; l3 >), 
and therefore, since the two functions P (a; (1) , x (2) ), A 7 (P 1 *, x {2) ) are clearly 
monotone and non-diminishing, we see that / (x U) , x {2) ) can be expressed 
as the difference of two such functions, or also as the difference of two 
monotone non-increasing functions. 


It is seen at once that any function which is the difference of two 
bounded monotone functions is of bounded variation in the cell. It has 
thus been shewn that: 


The necessary and sufficient condition that a f unction f (# (1) , x {2) ) should 
he of bounded variation in the cell (a, b) is that it should be expressible as the 
difference of two bounded monotone functions. 

If U r , L r are the upper and lower boundaries of/ (# (1) , x {2) ) in the cell 

r w 

(^ r _ 1 (1) , £ r V 2) ; x r {1) , x r (2) ), we consider the sum E (U r — L r ), where the 

r I 

n -f 1 points satisfy the same conditions as before. If this sum is less than 
a fixed positive number, however such a system of points be chosen, the 
upper boundary of the sum is called the total fluctuation of the function 

in the cell (a (1) , a (2) ; b {l) , b {2) ), and it may be denoted by F% m’ aW)f (x il) , x {2) ). 
The function is then said to be of bounded total fluctuation in the cell. 


We see that 


fix'" + h,u\x™ 


4- A (2 >) ~f(x {1 \ 


x (2) ) - F 


(x fI > 1 x<2>H A<V) 


jp(x (1 \ * (2 >) 

X a <*>)> 


1 I x< 2) i h<V) 

ancl 4* x („<!>, a u>) — 2 ( rt m, a <2>) , 

and that the function F[ao\aw] f (x {l) , x {2) ) is monotone and non-diminish¬ 
ing. As in § 248 , it thus appears that the necessary and sufficient con¬ 
dition that / (x (1) , x (2) ) should be of bounded total fluctuation is that it 
should be the difference of two bounded monotone functions. Comparing 
this with the corresponding theorem for functions of bounded variation, 
we see that the classes of functions of bounded variation, and of bounded 
total fluctuation, are identical. 



2 r> 3 , 254 ] Functions of Two Variables of Bounded Variation 345 


254 . A definition of functions of bounded variation, differing from that 
given in § 253 , has been employed by Hardy* and by Krausef. 

Let us consider the number 

(**»,*«>) ^ -/os<«>, «<») -/(«»>, p«*>) +/(«<*>, 

related to the cell (« n) , « <2) ; / 3 n) , /I {2) ), where / 3 (1) a (1) , / 3 <2) > a (2) . It is 

seen at once that, if a net be fitted on to the cell (a (1) , a (2) ; 6 (1) , 6 (2) ), 


A WWW) f/ m 


.oo), p<*>) 


right-hand side is taken for all the meshes 


where the summation on the 
(a m , a (2) ; / 3 U) , / 3 (2) ) which make up the cell (n (1 \ a (2) ; /> (1) , 6 (2) ). 


If the sum 1 ] | A^o f (x ( 1 \x (2) ) | is less than some fixed positive 
number, for all possible nets that can be fitted on to the cell (<? (1) ,a (2) ; 
5 m , 5 (2) ), and if further the function / (r (1) , ;r (2) ) is, for each value of :r (1) , a 
function of bounded variation with respect to .r (2 >, and for each value of .r< 2 ) 
a function of bounded variation with respect to x (1) , then f (x {1 \ x {2) ) is said 
to be a function of bounded variation in the cell. 


It has been pointed outl; by W. H. Young that there is a certain re¬ 
dundancy in the last two conditions. It will in fact be shewn that it is 
sufficient that f(x {l hx (2) ) should be of bounded variation with respecting, 
for one fixed value of ,r fl) , and with respect to ;r (1 ' for one fixed value of xS 2) ; 
it being assumed that the first condition is satisfied. 

If the first condition is satisfied, let l%?)f (x a \ x (2) ) be the upper 

, .r (2) ) | for all nets that can be 


boundary of the sum D | A^o>* !u!j/ (x {] 
fitted on to the cell (a {1 \ a i2) ; b {l) , b {2) ). If the 


£ be divided into two 


parts, Hi and X 2 , where X, denotes the sum of those terms for which A is 
positive, and S 2 denotes the sum of those terms for which A is negative, 


we see that i^A j (x 
daries 

(o ll \ f/< 


and 




H 2 A f(x (]) ,x (2) ) have finite upper boun- 




The two functions P {x a) , x (2) ), X (x {X) , :r (2) ), defined as 


P(U<'\ (x { \ 


... . 

h (a (1 \ r/< 2 ') j (x { \ 


respectively, are monotone functions, in the sense that, if ;c (1) 
s< 2 > ^ .r< 2 >, then P (x {1 \ x (2) ) - P x {2) ), X (x"\x {2) ) X (>">, .r (2) ). 


We have 

/(x< 1) ,a?< 2) ) =-/(a (1 ) ,a (2) ) t f (x"\a {2) ) \f(a {l \x^) [ P (xX\ x X2) ) 

- X (x">, x X2) ). 

Assuming that / (x n \ a i2) ) is a function of a; (1) , of bounded variation in the 


* Quarterly Journal of Math . vol. xxxvii (190B), j). 57 cl scq. 

j- Leipziger Her. vol. i,v (1903), pp. 1 04, 239; sec. also Vergerio, ( hor , di Mat . vol. xu\ 
(1911), p. 181. 

t Proc. bond. Math. Hoc. (2), vol. xi (1913), p. 142. 



346 


Functions of a Real Variable [ch. v 


interval (a (1) , 6 (1) ), it is equal to p (£ (1) ) — n (:r (1) ), where p (x a) ), n (x (1) ) 
are monotone functions of x {1) . Similarly, assuming that/(a (1) , x i2) ) is of 
bounded variation in the interval (a (2) , 6 <2) ), of x i2) , it is representable as 
the difference £>' (x (2) ) — n' (x {2) ) of two monotone functions of x {2) . 

If P (x a \ x {2) ) - P (x"\ x (2) ) + P (X<V) + p' (*< 2 >), 

N (x™, x < 2 >) - N (x«\ x <*>) + n (x (1 >) + n' (x {2) ), 
we have / (x<», x < 2 >) = P (x il \ x (2) ) - N (x^\ x< 2 >) - f (a™, a< 2 >), 
where P, iY are monotone non-diminishing functions. It has now been 
shewn that a function / (x {1) ,x {2) ) of bounded variation, in accordance 
with the definition here given, is expressible as the difference of two bounded 
monotone functions. It is therefore of bounded variation, according to the 
definition in § 253. 

The definition here given of a function of bounded variation is less 
general than that of Arzela, given in § 253. An example has been con¬ 
structed* by Kiistermann, of a function which is of bounded variation 
in accordance with Arzela’s definition, but not in accordance with that 
here considered. 


The variation of f(x (1 \k {2) ) for any fixed value k (2 \ of x (2) , clearly 
does not exceed the sum of the variations of f (x a \ a <2) ), of P (# (1) , k {2) ), 
and of N (x (1) , k (2) ). It is therefore bounded, on the assumption that the 
variation of f(x (1) ,a (2) ) is bounded. Similarly, on the assumption that 
/ (a (1) , x (2) ) is bounded, that of / (k (1> , x <2) ) is bounded, for any fixed value 
k a) , of a; (1) . In each case the converse holds good. 


If a function </> (x a \ x i2) ), defined in the cell (a (1) , a (2) ; 6 (1) , b {2) ) be such 
that, for any finite, or infinite, set of non-overlapping cells (any two of 
which may however have portions of their boundaries in common), the sum 


2 I a! 


0M l >, £r< 2 >) 
(a r O), « r <*>) 


<f> (x (1) , x (2) ) I 


is less than an arbitrarily fixed positive number 77 , for all sets of such cells 
which satisfy the condition that the sum of their measures is less than 
some positive number the function </> (x (1) , x {2) ) is then said to be 
absolutely continuous in the cell (a {1) , a {2) ; b a) , b {2) ). The proofs of the 
theorems given in § 243 1 , for the case of linear sets, can be applied, if 
two-dimensional nets are employed instead of linear nets, to prove that 
an absolutely continuous function cf> (: x ll) , x {2) ) is of bounded variation, in 
accordance with the definition of Hardy and Krause, provided <£ (x a \a {2) ), 
<f> (a (1) , xW) are of bounded variation. It may be remarked that the whole 
theory given here is applicable by extension to the case of p-dimensions, 

where A(«(»))«<p!)/ (^ (1) , x {2) , ... x ip) ) is defined as 

... *<*>). 


A( a (p)) . Z\( 0 ( 1), a<*>...a<P-D)/ (X {1) , X {£) 


* Math. Annalen , vol. Lxxvn (1916), p. 474. 
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Thus, for example, 
e' 1 *,*' 

1/ OS' 1 *, p™) - / (jS' 1 *, «< 2 >, /3 (3) ) - / (a (1 >, >3‘ 2 >, /9< 3 >) 


ajs:!:s; : s:3 /<*<», *<» - as;. a$;S3/<*<”, *<» *»>) 


+ /(a«> > a«),j8(»> ) ] 

~ [/ (£ (1) , £ ,2) , a< 3) ) - / (p a) , a (2) , a (3) ) - / (a (1 >, £< 2 >, a< 3) ) 

+ /(a (1) , a <2) , a w )]. 


QUASI-MONOTONE FUNCTIONS 

255. A function / (# (1) , # (2) ) may satisfy the condition 

A(x<i)! r<«>j/ (# (1) , # <2) ) £ 0, for x {1) s ar ( V, 7; (2) £ # <2) , 
and yet / (x (l) , x {2) ) need not be monotone and non-diminishing with 
respect to x (2) , for each constant value of x n) \ nor need it be monotone 
and non-diminishing with respect to x a) , for each constant value of x {2) . 
For the expression may be written in either of the forms 

{/(X' (1) , jr<») -/(*<” 7 (2) )} - {/(*<«, x ' 2 >) -/(*<«, * (2 >)}, 

{/(7<i>, .r< 2 >) x' 2) )} - {/(*«>, .r< 2 >) -f(x {1 \ x' 2) )}. 

The expression in any one of the four brackets may have either sign, con¬ 
sistently with the condition stated, except that the first bracket in either 
case cannot be negative whilst the second is positive. 

We may consequently consider four types of functions / (x {1) , x {2) ) which 
satisfy the condition A^IJ’x<«>j/(# (1) , x {2) ) £ 0, for x {1) ^ x a) , x (2) £ x {2 \ viz.: 

(1) . Those for which f (x {1) , x {2) ) is monotone and non-diminishing with 
respect to x (1) , for every constant value of x (2) ; and is also monotone and 
non-diminishing with respect to x {2) , for every constant value of x (l) . 

(2) . Those for which /( x a \x (2) ) is monotone and non-diminishing with 
respect to x a) , for every constant value of x {2 \ and is monotone and non¬ 
increasing with respect to x i2 \ for every constant value of x ll K 

(3) . Those for which / (x {1) , x {2) ) is monotone and non-increasing with 
respect to a; (1) , and is monotone and non-diminishing with respect to x (2) . 

(4) . Those for which / (£ (1) , x {2) ) is monotone and non-increasing with 
respect to x {1 \ and is monotone and non-increasing with respect to x {2) . 

A function / (x {1) , x i2) ) which satisfies the condition A/ ( x a) , x {2) ) 0, 
for every pair of points, or else the condition A f (x n \ x {2) ) £ 0, for every 
pair of points, and also one of the four conditions given above, everywhere 
in its domain, may be said to be a quasi-monotone functian of one of the 
four types which correspond to the conditions (1), (2), (3), (4), respectively. 

It is easy to see that this definition can be extended* to the case of 
functions of any number of variables. 

* See W. H. Young, Proc, Lond. Math, Soc. (2), vol. xvr (1917), p. 2J77. The term “monotone” 
is there applied to a function described above as quasi-monotone. 
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Since 

f(x^,x^) = A/ (x il) , x {2) ) -f / (x {1) , a (2) ) +f(au\x { *>) - /(fl (1 >fl (2) ) 
it is seen that every quasi-monotone function is the sum of a monotone 
function of the two variables, and of monotone functions of the separate 
variables, such that any one of the three may be either non-increasing or 
non-diminishing. The converse does not however hold; in fact a function 
can be monotone without necessarily being quasi-monotone. 

For example, let the monotone function f(x il) , x i2) ) be defined in the 
cell ( -- 1, -- 1; l, L) as -f J) (xM -{- 1), or (r (1) -f- 2) (x^ + 2), according as 
x {1) -b x {2) is <0, or ^ 0. It is then seen that A f(x {1) , x {2) ) is positive over 
(0, 0; 1,1), and is negative over (0, — 1,1); thus the function is not 

quasi-monotone. 

THE MAXIMA, MINIMA, AND LINES OF INVARIABILITY OF CONTINUOUS 

FUNCTIONS 

256 . Consider a point x x within the interval (a, b), in which a con¬ 
tinuous function is defined; it may happen that a neighbourhood (x L — 8, 
Xi -f 8) of the point x x can be found, by taking 8 sufficiently small, which 
is such that f (x) has the same value at all points in the neighbourhood; 
then the point x x is called a point of linear invariability of the function. 
If the same holds for a neighbourhood of x v on the right only, or on the 
left only, then the point x l is called a limiting point of linear invariability. 

It can be shewn that, if a point x } of linear invariability exist, and the 
function be not constant in the whole interval (a, 6), then there exist two 
limiting points of linear invariability, one of which, however, may be at 
one of the ends of the interval (a, b). Suppose the function not to be 
constant throughout the interval (x l9 6); the points x of this interval may 
be divided into two classes, in one of which x is such that, in the interval 
(a? ls x ), the function has the constant value f (x 2 ), and in the other class 
x is such that (x x ,x) contains points at which the function has values 
differing from / (x x )\ a section is thus made of the interval (x j9 b). This 
section defines a point which is the required limiting point of the linear 
invariability. If the same argument be applied to the interval (a, x x ) we 
see that there is another limiting point in this interval, unless the function 
be throughout equal to/(aq). 

In the interval (a, b) there may be a finite set, or an indefinitely great, 
but enumerable, set of lines of invariability; each point within such a 
line is a point of invariability, and the ends of such lines are limiting points 
of invariability. 

If the point x x be not a point of invariability, it may happen that a 
neighbourhood (x x — e, x y -f- e') exists, such that, for every point in the 
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interior of this neighbourhood, not identical with x l , the condition 
/ (#) < / (x x ) is satisfied; in this case x l is said to be a point at which the 
function has a 'proper maximum. In case the neighbourhood be such that 
at every point x within it, except at x l} the condition f (x) > /(•*i) is 
satisfied, the point x x is said to be a point at- which the function has a 
proper mi nimum. 

It may happen that, when x l is not a proper maximum, a neighbour¬ 
hood (x x — 6, x 3 -f €■') exists which is such that at no point within it the 
condition / (x) > f (x x ) is satisfied, whilst not at every point is the condition 
/ (x) < f (Xi) satisfied. If this is the case for every neighbourhood interior 
to (x x — e, x x + e') then x x is said to be a point at which there is an im¬ 
proper maximum of the function. Jf the condition / (x) £ f (x x ) is satisfied, 
but the condition / (x) > f (x x ) is not everyw here satisfied, then x x is said 
to be a point at which there is an improper minimum of the function. 

A line of invariability, of which the end-points are a, f 3 , these being 
both interior to (a, b), is said to be a maximum of the function, if both 
«, fl be improper maxima, and it is said to be a minimum, if both «, p be 
improper minima. 

It is clear that, in any arbitrarily small neighbourhood of an improper 
maximum or minimum, there is an indefinitely great number of points at 
which the functional value is equal to that at the maximum or minimum. 

At any maximum or minimum there is a greatest neighbourhood 
(x ± - 8, x x -f S') at every interior point of which the condition 
/ (x) < f (x x ), f (x) < / (a:,), or / (x) > f (x t ), f (x) > f {x x ) 
is satisfied. At end-points of such greatest neighbourhood, it follows from 
the condition of continuity of the function, that the functional value is 
equal to / (aq), unless such end-point coincides with a or with b. 

It has been shewn in § 213 that there exists either one point, or a set 
of points, in (a, 6), such that the functional value at this point, or at all 
the points of the set, is greater than at all other points in the interval; 
and it is to be remarked that this set of points may contain lines of in¬ 
variability. Every such point, unless it be an end-point, is said to be a 
point of absolute maximum of the function in the interval (a, 6), and may 
be either a proper or an improper maximum. A similar definition applies 
to an absolute minimum. 

In case an extreme point of the continuous function (see § 209 ) be at a, 
or at b, such point is spoken of as an upper or lower extreme, but not 
always as a maximum or minimum of the function. If / (a) and / (b) be 
equal, and the function be not constant in (a, b), then there is at least one 
maximum or one minimum point, or one line of invariability, in the interior 
of (a, b). This is also true when (6), unless the function be monotone. 
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257 . If, within the interval (a, b), there be two points , or two lines of 
invariability , at which the continuous function is a maximum , proper or 
improper , then there is between them at least one point, or one line of in¬ 
variability , at which the function is a proper or improper minimum; thus 
maxima and minima occur alternately. 

Suppose that a, ft are two points at which the function is a maximum, 
and that ( a , ft) is not entirely a line of invariability ; also that no maximum 
occurs between a and ft. We know that, between a and ft, there is a point, 
or a set of points, at which the function is less than at all other points in 
the sub-interval; and since a and ft cannot belong to such set, there is 
therefore a minimum at a point, or at points on a line of invariability, 
between a and ft, and this minimum is less than either of the maxima at 
a and ft. 

Between a maximum and the next minimum of a continuous function the 
function is said to make an oscillation, the amplitude of which is the excess of 
the maximum over the minimum. 

If x x be a point in ( a , b), it may be possible to choose e so small that, 
within the interval (x x , x x -f- e), no maxima or minima occur, so that the 
function is monotone in this interval. It may however be the case that, 
however small e is taken, there still occur maxima and minima in 
(x x , x x + e). In this case the number of oscillations of the function must be 
indefinitely great, however small e may be chosen; for if there were a finite 
number only, a number e x could be found such that all the maxima and 
minima were in the interval (x x + € lf z x + e), and thus in (x x , x x -f €j) the 
function would be monotone, which is contrary to the hypothesis made. 

It thus appears that, in the neighbourhood of a particular point, a 
continuous function may have an indefinitely great number of oscillations. 
An improper maximum or minimum, not in a line of invariability, is 
certainly such a point. 

The proper maxima and minima of a continuous function form an 
enumerable, or a finite, set of points. 

Consider (x x — e, x x *f rj), the greatest neighbourhood of a point of 
proper maximum x x , which is such that, for all other points x within the 
neighbourhood, f (x) < f (x x ). There can, in a finite interval, be only a 
finite number of such points x x for which e > a, rj > a, where a is a fixed 
positive number; for if there were an infinite number of such points, they 
would have a limiting point and we could choose two points, xf, x x " 
of the set, such that the distance of each from f is less than \a ; then each 
of these points would lie within the neighbourhood belonging to the other, 
and thus we should have f (z x ) >f(x i "), and also /(#/') >f(x l ') i which 
is impossible; thus the set must be finite. Now choose a sequence of 
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descending values of a which converges to zero, say a l9 a^, ... a n , ...; the 
number m n of maxima x x such that for each e > a n , rj > a n being finite, 
we have m ly m 2 , ... m n , .... all finite; and hence the whole set of maxima 
forms an enumerable set. 

If x x be an improper maximum point, and / (aq) = A, then a neigh¬ 
bourhood (x x — e, x x -f~ rj) can be found which contains an infinite set of 
points G a such that/ (x) = A, for each point of the set. If z' be an isolated 
point of the set G A , then x' is clearly a proper maximum of the function; 
and if x" be a point of G A , which is a limiting point of the set, x" is an 
improper maximum. The points x x — €, x 1 f rj need not be maxima, even 
though they be limiting points of 0 A . The condition of continuity of the 
function ensures that the set G A is a closed one; for, at any limiting point 
of the set, the functional value is the limit of a sequence, each member 
of which is A, and this value is therefore itself A. 

Corresponding to a given A , there may be a finite, or an infinite, set 
of detached intervals such as (x 1 — c, x 1 -f rj), each one of which contains 
a closed set such that each isolated point of it is a proper maximum, and 
each limiting point (except an end-point) is an improper maximum. The 
set G a may contain perfect components, and thus the improper maxima 
at which A is the functional value may form a set of the cardinal number 
of the continuum. A similar result holds for minima. 

It can further be shewn that the values of a continuous function at all 
its maxima and minima form a set which is either finite or enumerably 
infinite. 

258 . If, in the interval (a, 6), the function have only a finite number 
of maxima and minima, counting any line of invariability which is a 
maximum or minimum as one maximum or minimum, the interval can 
be divided into a finite number of parts in each of which the function is 
monotone; the function is then said to be* in general monotone (abtheil- 
ungsweise monoton). 

If the function have an indefinitely great number of maxima and 
minima, which occur either at points or at lines of invariability, the 
function then makes an infinite number of oscillations; and these may 
occur in the neighbourhoods either of a finite number of points, or of an 
infinite number of points. 

It can be shewn that, in the case of a continuous function, although 
there may be an infinite number of oscillations of the function, there can 
be only a finite number of which the amplitude exceeds an arbitrarily 
small fixed number a. 

* This term is due to C. Neumann; see his work Ueber die nach Kreis- Kugel- und Cylinder - 
functionen forlschreilenden Reihen , Leipsic, 1881. 
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For it has been shewn in § 217 that a number e can be determined, 
such that, in any sub-interval of length e, the fluctuation of the function 
does not exceed <r; therefore in each of the sub-intervals 
(a, a + e), (a -f e, a f 2c), ... (a + ne, 6), 
the fluctuation of the function is not greater than a. It follows that no 
oscillation of the function which is greater than <j can be completed in 
one of these sub-intervals, and that such an oscillation must require* two 
at least of these sub-intervals for its completion; hence the number of 
such oscillations in (a, b) cannot exceed the finite number u. As the number 
cr is diminished indefinitely, it may happen that the number of oscillations 
of which the amplitude exceeds a is increased indefinitely. 

If the point a is an isolated point of discontinuity of a function / (r). 
and if f (a - f-0), f (a -| 0) be unequal, there is an infinite number of 
oscillations in the interval («, a f e). This may also be the case if f (a j 0) 
have a unique value. 


THE DERIVATIVES OF FI-NOTIONS 


259 . If a function / (j?) be defined for all points in the interval ( a , h ), 
then, for a point x x in this interval, we may regard flu* function 

fW-JM 

X - JT X 

as a function F ( x ), of x, which is defined for all values of x in (</, b), except 
for the point . This function F (.r), although undefined at the })oint ,r,, 
has finite or infinite functional limits at that point, in accordance with 
the definitions in § 220. 


If the limits F (x } -f 0), F (x 1 — 0) both exist, and have the same value, 
either finite, or -f- go, or — go, this value is called the differential coefficient 
at x x of the function f (x). At the point a , if F (a -f 0) exists, it is frequently 
said to be the differential coefficient of f (x) at a ; and at the point b. if 
F (b — 0) exists, it is said to be the differential coefficient of f (x) at b. 

The condition that / (x) may possess a finite differential coefficient at 
x x is that, corresponding to each arbitrarily chosen positive number e, a 
neighbourhood (x x — 8 , x x f S) can be found, sucli that 

_/ (H ~ 

£ - £' - ^ I 

for every pair of points £, £' which lie within this neighbourhood, or within 
such part of it as is interior to (a, b). 

In other words, the condition is that a neighbourhood of x x can be 


found such that the fluctuation of the function 


/ (*) - / (*i) 


X X-t 


within it, or 


within such part of it as lies in (a, 6), may be as small as we please. 
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jr>3 


If, corresponding to an arbitrarily large positive number A, a neigh¬ 


bourhood (tfj - 8, x x (•- 8) can be found, such that ^ ^ ^ > /I, for 

r — 

every point x (+ x x ) in that neighbourhood, / (r) has the differential coeffi¬ 
cient f 00 at the point x x . 

When a differential coefficient of / (x) exists at the point the 
function is said to be differentiable at j \; and the differential coefficient at 
that point may be denoted by/' (.rj. 

In geometrical language, when the function f (x) has a differential 
coefficient at Xj , the curve 1/ f (x) is said to have a tangent at the point 
(r,,/ (x x )). In case the differential coefficient is j oo, or 00, the tangent 
is parallel to the y- axis, and the point is a point of inflexion of the curve. 

That a function f (x) may have a finite differential coefficient at x ls it is 
necessary, but not sufficient , that x x should be a point of continuity of the 
function. 

At a point of discontinuity .r,, of f (x), there always exists a positive 
number o\ such that in any neighbourhood of x u however small, points £ 
exist such that |/(f) — f (x x ) | > a; hence if A be any arbitrarily great 
positive number, in the interval (r, 8, x 1 |-8), where 8 < ajA , there 

exist points £ such that 


/(f) 

t 


f Ui) 


A , 


fix) _ f (x ) 

and it is thus impossible that * should have a definite finite 

limit at x x . On the other hand, the condition for the existence* of a finite 

/ (*r) f (*r ) 

differential coefficient, viz. that J ' v * l - should have an arbitrarily 

r - x x 

small fluctuation within a sufficiently small neighbourhood of x x , is not 
necessarily satisfied when the (‘ondition of continuity, viz. that / (r) should 
have an arbitrarily small fluctuation within a sufficiently small neighbour¬ 
hood of x x , is sat isfied. 

If, at a point x , the function / (x) have a finite differential coefficient 
/' (x), or have an infinite differential coefficient of fixed sign, an interval 

(x — 8 t , x -f 8 e ) can be so determined* that the ratio ^ / (^i) 

x 2 — x x 

between f (x) } e and /' (x) — <r, w here x i is any point in the interval 
(x — 8 e , x ), and x 2 is any point in the interval (x, x -f 8 e ); or that the same 
ratio is > N, an assigned positive number, if /' (x) ----- b 00, or < — N, 
iff (x) = - co. 

A continuous function / (x), defined for the interval (a, b), which has 
a differential coefficient at every point of the interval, is said to be differ - 


See Bromwich’s Theory of Infinite Series , p. 490. 
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entiable in its domain. Continuous functions exist, which at no point in 
their domain possess a differential coefficient. The first example of such 
a function was given by Weierstrass; the construction of such functions 
will be considered in Vol. II. 

That a continuous function possesses a differential coefficient was 
formerly regarded as obvious from geometrical intuition, it being supposed 
that such functions were necessarily representable by curves possessing 
definite tangents at every point. The first attempt to prove the existence 
of a differential coefficient of a continuous function was that of Ampere*; 
this proof was, however, insufficient, even in the case of those continuous 
functions which make only a finite number of oscillations in the intervals 
for which they are defined. It is now fully recognized that the class of 
continuous functions is much wider than that of functions capable of an 
approximate graphical representation; and that the conditions for the 
existence of definite differential coefficients are of a much more stringent 
character than would be the case if they were included under the bare 
condition of continuity of the function. 

f lx\ f (x, j 

260 . It may happen that, at a point x ly the function J ^ x % 

possesses finite, or even indefinitely great, limits on the right and on the 
left, at Xj , which differ from one another; the function is then said to have 
derivatives, on the right, and on the left , at . These are frequently spoken 
of as the 'progressive and regressive differential coefficients, or derivatives, 
respectively. A function may possess a progressive derivative and no re¬ 
gressive derivative, or the reverse. 

When, at the point x lf the function / (x) has no differential coefficient, 
and neither progressive nor regressive derivatives, the function 

/(*) 

x — x x 

has, in accordance with § 220, four extreme limits, an upper, and a lower 
one, on the right, and on the left. Any one of these four may be either 
finite or infinite. 

There is in general an aggregate of functional limits, on the right, of 
and also an aggregate of functional limits on the left. The 

X — iTj 

upper and lower limits, on the right, which are the upper and lower 

boundaries of the aggregate of functional limits of — -- ■ 1 at x 1 , on 

the right, are defined to be the upper and lower extreme derivatives of f (x), 
at x lt on the right , or simply, the upper and lower derivatives on the right , 


* Journ. icol, jfolyt. vol. vi (1806), p. 148. 
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and these are in accordance with the notation of Scheeffer*, denoted by 
D+f D+f (x) respectively. Similarly, the upper and lower derivatives 
otf(x) at x 1} on the left, are the upper and lower boundaries of the aggre- 

gate of functional limits of - ^ on the left, and are denoted by 

D~f (x) t D..f(x) respectively. 

As x has the values of a sequence converging to x x on one side, the 
f (%\ _ f \ • 

values of -- ^ 1 , if convergent, may converge either to the upper or 

to the lower derivative of / (x) at x y , on that side, or they may converge to 
some number lying between these extreme derivatives. In the last case they 
are said to converge to a median derivative on that side. The upper and 
lower derivatives on one side, together with any median derivatives on 
that side which may exist, make up the aggregate of functional limits of 
■f (%\ _ f \ 

at x ls on the side under consideration. 

x x x 


In case the function / (x) is continuous in an open interval, with x x as 
end-point, on the right of x lt the median derivatives have every value 
between D+f (xj and D+f (aq) (see § 229 ), but when / (x) is not continuous 
in such an open interval on the right of x lf the aggregate of derivatives 
on the right may be a closed discontinuous set having Df ( x *) and D+f (x x ) 
for its upper and lower extremes. A similar remark applies to the case 
of the derivatives on the left. 


It is frequently convenient in this general case to speak of the de¬ 
rivatives of / (x) on the right, and on the left, as existent, but indefinite 
in value: and in this case D+f {xf), D+f (x x ) are regarded as the limits of 
indeterminancy of the derivative on the right, and D~f (x x ), D_f (xf) as 
those of the derivative on the left. 

The definitions which have been given for the case in which the domain 
of the function is continuous are applicable, without essential change, to 
the case in which the domain is any perfect set of points. At a point of 
the set which is a limiting point on both sides, there exist in general the 
four extreme derivatives D+f (x), D+f(x) y D~f (x) y D_f(x), two or more 
of which may have equal values; and at a point of the perfect set, which 
is a limiting point on one side only, there exist of course only the two 
derivatives on that side. If the domain be any closed set, the derivatives 
exist only at those points which are limiting points of the set. 

A function defined for a perfect set may, by the method of correspond¬ 
ence, be correlated with a function defined for a continuous interval, the 
relative order of two points in the continuous interval being the same as 


* Acta Mathematica, vol. v (1884), p. 52. The same limits were considered by Du Bois-Rey- 
mond, Programm , Freiburg, 1870, also Milnch. Abh. vol. xn (1875), p. 125, under the name 
Unbestimmtheitsgrenzen. 
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that of the corresponding points in the perfect set (see § 163 ); and thus all 
properties of derivatives of functions defined for a continuous interval 
have their analogues in the case in which the domain is any perfect set. 


EXAMPLES 

1. If f (x) —x sin ^, f (0) —0; we have ^ ^ ^ ^ ^ -sin ^ , and for arbitrarily small values 

of h, this osc illates between 1 and - 1. The function / (x), although continuous at x 0, 
possesses no differential coefficient at that point; in fact 

/>+/(())-=l, l>,/(0)= -b Jrf{ 0) = l, I )../(())= -l. 

2. If / (x) - x 2 sin \f (0) — 0, the differential coefficient /' {x) exists for every value of a\ 
and is finite. At the point x — 0, /' {x) is zero, but has a discontinuity of the second kind. 


3. Let* f {x) -\ x ^1 + x sin 

/ (()) =0. In this case /' (x) every where exists; its value at x ~0 is +«., and although it has 
a finite value at every point except at x it oscillates in the neighbourhood of that point 
between indefinitely great positive and negative values. 


^, for x >0; J‘{jr)— -V x ^1 -t x sin J, for ;r<(); and 


4.f The function defined by / (r) — x (1 + J sin (log r 2 )}, and / (0) -0, is everywhere con¬ 
tinuous, and is monotone, but has no differential coefficient at x — 0. 

- 1 , i 

54 Let j (x) - e x sin /(0) -0; this function has at every point a differential coeffi¬ 
cient, and this is continuous at r=0. The differential coefficient vanishes at x — 0, and at 
an infinite number of points in the neighbourhood of x =0. The function J' ( x ), like / (x) % 
has an infinite number of oscillations in a neighbourhood of ^=0. 


THE DIFFERENTIAL COEFFICIENTS OF CONTINUOUS FUNCTIONS 

261 . Let us suppose that a continuous function, defined for a con¬ 
tinuous closed interval, is such that, at every point interior to an interval 
(a, ft), there exists a differential coefficient; this differential coefficient may 
at any point have a finite value, which may be zero, or it may have an 
infinite value, of which, however, the sign is definite. It will be observed 
that / ( x) is assumed to be continuous at the points a, ft, but it is not 
assumed that definite derivatives exist at those points. It will be shewn 
that, unless the function be constant throughout (a, /l), there exists at least 
one point in the interior of ( a , ft) at which the differential coefficient has a 
definite value , different from zero. 

Suppose /(«), f (ft) to be unequal. If they be not unequal, and the 
function be not constant throughout ( a , ft), we can replace the interval 

* Dini-Liiroth, Grundlagen , p. 112. 

f Pringsheim, Encyclopadie der math. Wisnensch. 11 a. i, p. 22. 

X Dini-Liiroth, Grundlagen , p. 313. 
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(a, ft), by another one contained in it, for which the functional values at 
the ends are unequal. Let us consider the function 


F (*) /(*)-/(«) ~ ~^{f (P)~f (a)}. 

F (a) and F (ft) vanish, and F (x) is continuous in (a, ft), and has a differ ¬ 
ential coefficient in the ordinary sense at each point, with the possible 
exception of a and ft; therefore it follows, by the theorem of § 213, that 
there is at least one point x { , in the interior of (a, ft), at which F (.r) is a 
maximum or minimum: this is the case even if F (x) be everywhere zero 
in the interval. A number's can therefore be found such that 


F (x , t 3) - F (x,), F (x, - 8) - F (x x ) 
have the same sign, or else vanish, provided 8 < e; and consequently the 
derivatives at x l , on the right and left, must have opposite signs, unless 
both of them be zero; therefore the differential coefficient at x l9 which 

must exist, must be zero. It follows that f' (x r ) -^-^1 ^ = 0, and 

p — Cl 

thus the point x 1 is the point of which the existence was to be proved. 
From this theorem we deduce the following general theorem : 

If f (x) be continuous in ike closed interval (a, b), and be such that it has 
a differential coefficient at every point in the interior of the interval, and if 
there be in (a, b) no lines of invariability , then there exists in (a, b) an every¬ 
where dense set of points at which the differential coefficient has finite values 
differing from zero. 

This is proved at once by applying the foregoing theorem to any 
interval contained in (a, b). There may be, in (a, b ), infinite sets of points 
at which the differential coefficient is either zero or infinite. 


262. The following theorem, known as the mean value theorem of the 
Differential Calculus, has been established by the proof in § 261: 

If the function f (x) be continuous in the closed interval (a, 6), and if the 
differential coefficient f' (x) exist at every point interior to the interval (a, 6), 
being either finite , or infinite with fixed sign , then a point a + 6 (b — a) 
exists, where 6 is some number such that 0 < 6 < l, for which 

f (b) - / (a) f (b - a)f (a + 6b - a). 

If we change the notation, so that (x, x + h) is the given interval, the 
result may be written 

/ (x 4- h) - / (x) F hf (x 4- Oh). 

It will be observed that no assumption is made as to the existence of a 
derivative on the right at the point a, or x y or as to the existence of a 
derivative on the left at b, or x -f h. It has only been assumed that the 
function is continuous on the right, and on the left, at these points re¬ 
spectively. 
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In case however it is known that the limit of f (x), as x ~ a, has a 
definite value A, then f (x) has a definite derivative on the right at the 
point a, and its value is A. 

For, since ^ ^ = /' (a -f 8x — a), and the expression on the 

right-hand side converges to A, as x ~ a, it follows that 

X~a X (l 

A corollary from the mean value theorem is that, if / (x 4- h) — / ( x ), 
then/' (x) must be zero at one point at least in the interior of the interval 
(x, x 4 - h). 

The following theorem may be proved by means of the mean value 
theorem : 

If f (x) have a continuous differential coefficient /' (x), at every point of 
an open interval (a, b), then f (x + h) — / (x) = h {/' (x) 4- p {x, h )}, for 
every pair of points x, x + h of the open interval; where p (x, h) converges to 
zero , as h ~ 0, uniformly for all points x of any closed interval (a, ft) con¬ 
tained in (a, b). 

Since / (x -f h) — / (x) = hf (x 4- 6h), where 0 < 0 < 1, for any pair of 
points x, x 4- h, within (a, b ), and since f (x) is continuous in the closed 
interval (a — h lt + h x ), where (a, fi) is a closed interval contained in 
(a, b), and h x is so small that the closed interval (a — h x , p -f hf) is also in 
(a, b), we have, for every value of x in (a, ft), \f'(x + 6h) — /' (x) | < e, 
if | h | < h lt provided h x is taken sufficiently small. This follows from the 
uniform continuity of /' (x) in the closed interval (a — h lt ft 4- hf). We 
may therefore take /' (x -f Oh) = f' (a:) 4- p (x, h); where | p (x y h) | < 6, 
provided | li | < h ly and x is any point in (a, ft). Since e is arbitrary, 
p (x , h) converges uniformly to zero, as h ~ 0, for all points x in («, ft). 

263. An important extension of the mean value theorem is the 
following : 

If f (x) be continuous in the closed interval (x, x -f h), and have a differ¬ 
ential coefficient at every point of the interval , with the possible exception of 
the end-points; and if F (x) be another function which is also continuous in 
the same interval , and at every interior point has a differential coefficient , 
whilst at the end-points there may be no definite derivatives , or they may be 
zero, or infinite, then, provided /' (x) and F' (x) have no common zeros or 
common infinities in the open interval , 

f(x + h)-f(x) f(x + 0h) 

F (x + h) - F (x) F' (x + Oh) 

for some value of 0 such 'that 0 < 6 < 1 . It is assumed that F (x 4 - h) + F (x). 
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To prove the theorem, let 

* «> -no -/(*) - ') «> - * <*»; 

and let it be assumed that F (x + h) — F (x) is not zero. Since 


<f> (x) = <f> (x + h), 

and <f> (£) satisfies the conditions of the mean value theorem, </>' (f) must 
vanish for some value x + Oh, of £, interior to the interval (z, x + h); <f>' (£) 
exists, since /' (£), F' (£) cannot both be infinite. We have then 


/' (x + Oh) - 


f (x -{- h) -f(x) 
F (x f h) - F (x) 


F' (x -f Oh) -=- 0, 


from which the theorem follows, since F' (x f- Oh) and /' (x + Oh) cannot 
be both infinite, or both zero. In the case in which / (x H~ h) /(#), we have 


/' (x + Oh) - 0, 

for some suitable value of 0; and then, since F (x -h h) — F (x), F' (x + Oh) 
are not zero, the theorem still holds. 


264. The last theorem may be applied to obtain a proof of the legiti¬ 
macy, under certain conditions, of a well-known method of evaluating 

limits which appear in the so-called indeterminate forms j-j, 


Let the two functions f (x), F (x) he both continuous at all points interior 
to the interval (a, a -V fl), and let the limits f (a + 0), F (a t~ 0) be both unique, 
and have the value zero; if finite differential coefficients /' {x), F' ( x ) exist at 
every interior point of (a, a 4- jS), and if h (> 0, and ^ (3) can be so determined 
that, within (a, a -f h), f (x) and F' {x) are not both zero, and not both 
infinite, together at any point, then if the limit 


lim 

A-0 


/> + *) 

F f (a + h) 


exist as a definite number, or be infinite with a fixed sign, the limit 


lim 

A~0 


/ (a + h) 
F (a f h) 


also exists, and the two have the same value. 


Moreover, if the first limit has no unique value, every value of the second 
limit is in the interval bounded by the upper and lower values of the first limit, 
and is a value of the first limit. Thus 


fiS ? {a ± h) > lim f {a + h) s lim f ( “- + h) s lim L + h) 
H F' (« + h) “ H F (a + h) - S F (« + A) “ jg} F ' (« + *)' 


The two functional values / (a), F (a) may both be defined to be zero, 
and thus the functions / (x), F (x) are continuous in any interval (a, a -f h). 
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where h < We have then, from the extension of the mean value theorem 

(§ 263 ) f(a 4- h) f (a \ Oh ) 

F (a f ft) F' (a j Oh)’ 

wliere 0 is some number such that 0 < 0< 1, whose value depends in 
general on that of h. If there be assigned to h the values in a diminishing 
sequence which converges to zero, the corresponding sequence of values 
of Oh also converges to zero. 


In case lim ^has a unique value, either finite, or infinite with 
h ^F'(a\ft) 1 

fixed sign, it follows that lim -*~ a * ] ias f} ie same value. 

6 /i~o F (a h) 


In case lim has not a unique value, let the diminishing 

F'(a-\h) 1 

sequence {/q,}, converging to zero, be so chosen that has a single 

limit, as h goes through the sequence {h n }, then lim ^ has the 

(a I 0 n h r ,) 

same single limit, and this must be one of the values of lim ^ . 

6 F' (a H h) 

Therefore any limit of as ft 0, must also be a limit of 

J F(a\hy 

as h ~ 0, and must lie in the interval which contains all the 

F (a -r h) 

limits of f 

F' (a -f h) 

It may happen, in particular cases, that lim has a unique 

A-o r (a -| ti) 

/alue, whilst lim ^ is not unique. 

a~o F (a + ft) 1 

1 f lx) 

For example, let, a = 0 ,f(x) - x~ sin , F (.c) - e x — 1; then lim 

X x yx) 

has the unique value zero, whilst ^ , or ^ 


I 


f(x) 

F f (x) 

unique limit, but its limits are in the interval ( - 1, 1). 


2x sin — cos 
x 


« -) c-*, 
x) 


has no 


If f (x), F (x) have both differential coefficients f r (x), F' (x) at all points 
of the open interval (a, a -f j8), and if f [a -f- 0), F (a -I- 0), both exist, and 
are infinite with a fixed sign; then , provided h (> 0, < j8) am be so determined 
that f (x), F' (x) are not both zero*, or both infinity , at any point in the open 


* This generalization of the conditions to be satisfied by the two functions in these theorems 
is due to W. H. Young; see Proc. Lond. Math. Hoc. (2), vol. vui (1909), p. 51. For a history of 
these theorems, see Pringsheim, Encycl. d. math. Wissensch. ii a. i, p. 20. A detailed investi¬ 
gation of the theorems is given by Stolz, Grundzuge , vol. i, p. 77 el seq.; Stolz assumes that F (r) 
is monotone in the interval. 
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interval (a, a |~ h), if Jim ^ ^ exists , as a finite number , or is infinite 

with a fixed sign , a/so Jim ^ exists , and the two have the same value . 

/i~o r (« -f a) 

Mor eover , if the first limit Iras no unique value , every value of the second, 
limit is in the interval bounded by the upper and lower values of the first, limit, 
and is one of the values of the first limit. 

Consider the interval (a -b 8,, a f 8 2 ) interior to (a, a (- /f); we have then 
/(« i S 8 )- /(« + 8,) _/'(« h &,) 

^ (fit I &,) -- (« I- 8,) T (« {- 8;i) 1 

where 8 3 is a number whieh lies between 8, and 8 2 ; it being assumed that 
F (a + 8 2 ) 4 F (a + 8*) =(= 0. This equation may be written in the form 


/(« + $,) _ / (a + S 2 ) /J<r_+ f, (« + &>) 

^ (a f 8,) ^ (a + 8,) + FT« + 83) ( ^ (« + Si), 

leaking 8 2 as fixed, if «r be an arbitrarily small positive number, a 
positive number 8' (< 8 2 ) can be so determined that for every value of 
8 X (> 0) which is < 8', the inequalities 

I *> + 8.)! >1 |/(«4 S 2 ) |, | /’ (a + S x ) | > “ | f (a + S 2 ) | 

are both satisfied; this follows from the fact that F (a -f 0) is infinite with 
fixed sign. 

We have now 


/(« + 8 j) ... r/ + (1 _ q /' (« + 8 a) 


.F (a 8,) '' ‘ ^ w F' (a + 83) ’ 

where | rj j < e, and | £ | < for values of 8 X such that 0 < 8 X < 8'; 
where 8 2 is fixed, and 8' (< 8 2 ) can then be determined. 

If we assign to 8 2 the values in a sequence that converges to zero, 8' will 
have the values in a similar sequence. 

If U, L denote the upper and the lower limit of ■£ r /, a - ^ ^,1 > as A ~ 6, 

t (a -f h) 

we see that ^ lies between U + e and L — 6, provided that 8 2 is 

r (a f o 3 ) 

sufficiently small. 


Then 


/(« + »,) • 


is between e 1 (1 + e) (U -f e) and — e -f (l — c) (L — e) 


F(a | 8 x j 

and therefore, since e is arbitrary, any value of lim 4 ^ is in the 

h~o * (« + n) 

interval (L, U). 

If U = L, then lim has the unique value lim / ( a . h) . 

Again, let the values of 8 X be restricted to belong to a diminishing 
sequence {S/ 71 *} converging to zero, and so chosen that J ^ --44 converges 

I* (a -f Oj) 
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to the unique limit A, as 8 X goes through the values in the sequence { 8 i <n) }. 
Then, for all sufficiently large values of n, ^ ^ — A -f A, where 

-t yCC -j~ 0 1 ' n, ) 

I A I < c; and then has the values ^ ^ y ^, for all the values 

x (& 03) 4 

of 83 in a certain sequence. It follows that the upper and lower limits of 
for a certain sequence of values of 8 3 , are in an interval 

r (a 4 - o 3 ) 

(A - 2 c A H- 2c \ 

V 1 + € ’ 1-7]' 

Since c is arbitrary, it follows that converges to A. 

x (a -I o 3 ) 


265. Let/ (a:), F (#) be both zero at the point x = x x , and let us suppose 
that /' (a^), F' (a^) both exist, and are finite, the latter not being zero. 
It is unnecessary to assume the existence of/' (z), F' (z) for x^ x x . 


From the definition of /' (x x ), F' (a^), we have 

f(x x + h) - h{f (z x ) 4* p (h)}, 

F (z x + h) = h {F f (x x ) + p (h)}, 
where p (h) 3 p (h) converge to zero, as h ~ 0. Thence we have 

f (?L ±A) = f ( x i) + P W 

F (z x + h) F' (xA +p'ih)’ 


from which it follows that 


lim 


f(x) = f' (Xj ) 

F(z) F'(x]y 


This includes the case in which f (ar*) — 0 , F* (a^) =(= 0 , when the limit 
is zero. 


Next, let it be assumed that/(a:,), /' (a?j), .../ (w ~ 1> (aq) (see § 270) all 
exist, and have the value 0 ; and that F (xj), F' (x x ) 3 ... F (n ’~ l) (a^) all exist, 
and have the value 0 . Also let it be assumed that f in) ( x x ), F (n ) (x x ) exist, 
and are finite, neither of them being zero. It will also be assumed that, 
in some neighbourhood of the point x lf the first n — 1 differential coeffi¬ 
cients of / (x) } and the first n' — 1 differential coefficients of F (x), all exist. 


We have then, 

fin-1) fa. 1 h\ 

lim ----- = / (n) (a^); since/ (n_1) (x x ) = 0 . 

0 n 


It then follows, by the employment of the theorem of § 263, that 


Hm f{x \± h) = 

A" reA "- 1 


A) 


/" fa + A) 


A~0 


— lim ■ ,. , 0 

h ~0 w (w — 1) ^ w ~ 2 


_ fa + * ) _/ (B ’ fa) 

““ ‘ w!A «! • 
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Similarly, we have lim -= —-J—and thus we see that, 
J a~o h n n ! ’ 


smce 


/fo + h ) _ h n~n' 1 (*i + h y h ” 

F (x 1 + h) n F (x x + h)/h n " 


if n' 


lirri {Jy 1 — —)< =-• 0 , if n' < n 
F (x 1 4 h) 

= i oo, if n’ > n 

« f (n) (xi)/F {n) (x ,) 

The following theorem has been established: 

If f (x), F (x) are zero when x = x x , arid have, in a neighbourhood of that 
point, differential coefficients of the first n — 1 , and n' — 1 orders respectively, 
which are all zero at the point x — and i/, moreover, the differential 
coefficients off (x), F (x), of orders n, n', respectively , and are finite, at 

the point x — x 1 , both of them being different from zero, then lim ^ ~ fow a 
f n (x ) 

unique value 0 , i °°> according as n ' < n, n' — n, or n' > n; the 

sign of ± oo 6 emgr that of f n (x^/F"' (xj. 

If, for every value of x 1 in an open interval (a, j 9 ), f (x), F' ( x) are 
both continuous, and F* (x) =f 0 , we see, as in § 262 , that 
f(x + h) - f(x) = h{f (x) 4 - p (x, h)}, 

F(x + h) - F (x) = h {F f (x) 4 - p (x, h)}, 
where p (x, h), p (x, h) converge to 0 , as h ~ 0 , uniformly for all points x 
in a closed interval (a, 6 ) contained in (a, ft). We then see that 

f(x + h)-f(x) , f (x) 

F(x + h)-F(x) ^verges to ^ 

uniformly in the interval (a, b), provided | F' (x) | exceed some fixed 
positive number, for all points in the interval. 


266. If the function f (x) have a discontinuity of the second kind at the 
point a, at least on the side which is towards the interval (a, a 4- h), but the 
function have a finite differential coefficient at every point of the interval 
(a, a 4 h), except at the point a, then the differential coefficient has, in any 
arbitrarily small neighbourhood of a, every finite value. 

Let A be any fixed number, positive, negative, or zero. The function 
f (x) — Xx has a finite differential coefficient at all interior points of 
(a, a 4 h), and it has a discontinuity of the second kind at the point a. 
If 8 (< h) be arbitrarily small, in the interior of the neighbourhood 
(a, a 4 8), / (#) — Xx has maxima and minima, for it cannot be monotone 
in that interval; and at such points /' (a;) — A ~ 0. Since A is arbitrary, 
it thus appears that in the neighbourhood of a,f (x) has every finite value. 
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The mean value theorem / (a + h) — / (a) = hf (a + Oh), where 0 < 0 < 1 , 
affords information as to the existence and value of the derivative at a, 
on the right, provided / (x) satisfies, in a neighbourhood of a on the right, 
the conditions under which the theorem holds. By considering both sides 
of a, information may be obtained as to the existence of a differential 
coefficient at a. 

(1) . If the function/' (x) have a functional limit at a on the right, then 

f(a^h) -/(a) 
h 

has a definite limit for h ~ 0, either finite, or infinite with fixed sign, and 
this is equal to that of /' (x). It follows that, in this case, a derivative 
at a on the right exists, and is either finite, or infinite with fixed sign. 

(2) . If the function /' (x) have no limit at a on the right, it may still 
happen that /' (a -f Oh) has a definite limit at a on the right, because 
a -f- Oh is not necessarily capable of having all values within a neighbour¬ 
hood of a. In this case, either (i), the derivative at a on the right maybe 
definite, and lie between the upper and lower limits of /' (x) at a on the 
right, or it may be equal to one or other of those limits; or (ii), there may 
be no definite derivative at a on the right, but J)f (a), D.f (a) may have 
different values, and these are certainly both finite in case the upper and 
lower functional limits of /' (x) at a are both finite. 

( 3 ) . The derivative on the right at a can only exist, and be infinite, 
(i), if /' (x) have an infinite limit on the right at a, or (ii), if it have an 
infinite upper limit on the right at a. In either of the cases, (i) and (ii), 
/' (x) may be everywhere finite within a neighbourhood of a on the right, 
or it may be infinite at some points in such a neighbourhood. 

* ( 4 ). If the derivative at a on the right exist, and be finite, then either 
(i)./' (x) has a definite limit at a on the right, equal to the derivative at a , 
or (ii),/' (x) has no definite limit at a on the right , but a sequence of points 
can be determined, of which a is the limiting point, such that the values 
of /' (x) for points of that sequence converge to the value of the derivative 
at a. At points which do not belong to the sequence, the values of /' (x) 
may be either finite or infinite. 

267. If f (x) be continuous in a given closed interval , and have at every 
point, with the possible exception of an enumerable set G, a differential 
coefficient of value zero, the function is constant throughout the whole interval. 

At the points of G we may suppose it to be unknown whether a differ¬ 
ential coefficient exists, or, if one does exist, what values it has. 

A more general form of this theorem is obtained by considering not 
the differential coefficient, but any one of the four derivatives, thus: 
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If f (x) be continuous in a given closed interval , and one of the four 
derivatives D + f ( x ), / (x), D ~f (x), D_f (x), be such that it is zero at every 
point of the interval , with the exception of points belonging to an enumerable 
set G, at which nothing is known as to its value , then the f unction is constant 
throughout the interval. 

To prove the generalized theorem for the case of the function Df (x), 
suppose that, if possible, / (x) — f (a) has at some point x 2 a value different, 
from zero, say the positive value p; and let (x, k) denote 
f (x) - f (a) - k (x ~ a). 

Then <f (a, k) -- 0, <f> (x s , k) ^ p - k (x x a). Choose any fixed positive 
number q < p , then </> (x 2 , k) > q , provided A: < -- ^ , or say k < K. Since 

</> (x, A) is continuous in (a, f>), and </> (o, A’) is zero, whilst </> (Xj, A') „> q , 
there exists an upper limit of those values of x between 0 and x i , for which 
</> (x, /’) k. <2, and this upper limit is attained for some value £, of x, which 
is such that £ < x,, and (f> (£ y k)- ~ q. Since <f> (£ + h, k) > q, provided 

0 < /i s Xj - £, we see that, since ^ ^ ^ ' is positive, 

(£, A:) is positive, if it be not zero. Now if f were a point not belonging 
to G, the value of D b </> (£, h) would reduce to — k; and therefore £ must 
belong to G. 

The number q being fixed, £ depends only on k ; and, corresponding to 
a given value of £, there is only one value of k; for 

</> (f> k) — </> (€> k') (C - k) (£ - a), 

which cannot vanish unless k — k\ since (a, k) is zero, and therefore < q. 
For a given value of k, the corresponding number of values of £, all of 
which necessarily belong to G , must be either finite, or enumerably infinite, 
since every part of an enumerable aggregate is either finite or enumerable. 
Therefore, to each value of A:, in the continuous interval (a, K - / 3 ), there 
corresponds a finite, or enumerable, set of values of £, and it would hence 
follow that the continuum (a, K - j8) is itself enumerable, which we know, 
is not the case. It has thus been shewn that, for no point can / (x) — / (a) 
have a positive value; and similarly, by considering 
f(x) — / (a) + Jc(x - a), 

it can be shewn that/ (x) — / (a) can nowhere have a negative value; hence 
/ (x) = / (a) throughout the whole interval (a, b). The case in which one 
of the other three derivatives vanishes, except at points of G , can be 
treated in a similar manner, 

A continuous function can exist, which is constant in each interval 
contiguous to a non-dense closed set G , and is not everywhere constant, 
provided G is unenumerable; but if G be enumerable, the function must 
everywhere have the same constant value. 
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The following theorem* * * § which is of importance in the theory of Inte¬ 
gration will now be established: 


If two functions be each continuous in a given closed interval , and if , of 
one of the four derivatives it be known that , for the two f unctions, this deri¬ 
vative has equal finite values at each point of the interval , with the exception 
of an enumerable set of points at which nothing is known as regards the two 
derivatives , then the two functions differ from one another only by a constant , 
which must be the same for the whole interval. 


It must first be observed that the proof of the preceding theorem 
suffices to shew that, if D'f (x) s 0, at every point of (a, b) not belonging 
to the set G, then / (x) — f (a) s 0 , for every point x of the interval. 
Similarly, if D+f (x) ^ 0 , everywhere in the interval, except at the points 
of G, then f (x) — f (a) £ 0 , at every point of the interval. 


If now/j (x ),/ 2 ( x ) f> e two continuous functions such that 

^7i (*) = Wf* (*) 


at every point of (a, b) not belonging to G , let / (x) = / (xj — / (x 2 ). If e 
be an arbitrarily small positive number, then for any point x not belonging 
to G , the condition 


fi (x + h)-f t (x) 
h 


>D*f l (x)~ 


€ 


is satisfied for a set of positive values of h which are arbitrarily small. 
Also we have, for all sufficiently small values of h , 


<*> + •; 

hence, since D + R (x) = Z ) + / 2 (x), we see that ^ ^- > — 2c, for 


all values of h belonging to some set. It follows that D + f (x) > — 2 c, and 
thencef that D+f (x) £ 0 , since e is arbitrary. By interchanging / 2 (x) and 
/ 2 (x), we see that D+{— f (x)} ^ 0 . From these two results we deduce 
tha t f (x) — f (a) % 0 , and that f (a) — f (x) £ 0 , throughout the interval 
(a, b); therefore / (x) is everywhere equal to / (a), and thus the theorem 
is established. 


Examples have been given by HahnJ and by Ruziewicz§ of continuous 
functions which have everywhere in an interval the same differential 
coefficient (not everywhere finite) but the difference of which is not 
constant in the interval. 


* Scheeffer, Acta Math. vol. v (1884), p. 283. 

f It is erroneously stated by Dini, that D+j{x)=0. See Qrundlagen, p. 275. 

X Monatsheftef. Math. u. Physik , vol. xvx (1905), p. 16. 

§ Fundamenta Mat. vol. i (1920), p. 148. 
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268. At a point x, at which the continuous function / (x) is a maximum, 
since, for a sufficiently small neighbourhood of such a point x , the differ- 

ences /(* + *)-/ (*), / (X - h) - f (*) 

are both negative or zero, for all points x -± h in the neighbourhood, it is 
clear that each of the derivatives D + f (x) y D+f(x) is either negative or 
zero, and that each of the derivatives D~ f (x), D. f (x) is either positive 
or zero. In case the function possess definite derivatives on the right and 
on the left at the point x , the first of these is zero or negative, or possibly 
— oo, while the second is zero or positive, or possibly -}- oo. 

If, at the point x , a definite differential coefficient exist, it must conse¬ 
quently be zero. In the case of a minimum the corresponding statements 
hold, where the positive sign takes the place of the negative one, and the 
reverse. The following theorem has now been established: 

If a continuous function possess a differential coefficient at a point x at 
which the function is a maximum or minimum , then the differential coefficient 
at x must be zero. 

If a function / (x) have, at x 1 , a differential coefficient not equal to 
zero, there is a neighbourhood of x y in which / (x) is monotone. This may 
be stated in the form that, corresponding to a suitable value of h, a number 
k can be determined, so that, to each value of y in the interval 

{yi - k,Vi + lc), 

where y x — / (xj, there corresponds one, and only one, value of # in the 
interval (x x — h, x x H- h). 

FUNCTIONS WITH LINES OF INVARIABILITY 

269. A continuous function may be such that, in the interval (a, 6), 
there exists an everywhere dense set of non-overlapping intervals, each 
one of which is a line of invariability of the function. Within each interval 
of the set, the function has its differential coefficient equal to zero; it 
therefore follows from the theorem in § 267, that the closed set of points, 
of which the given set of intervals is the complementary set, cannot be 
an enumerable set, otherwise the function would be constant in the whole 
interval (a, b). It is further clear that no two of the intervals can abut on 
one another; for the condition of continuity of the function at their 
common end-point would ensure that the values of the function in the 
two intervals were the same, and thus the two intervals would really 
belong to the same line of invariability. It follows that the end-points 
and external points of an everywhere dense set of lines of invariability of 
a continuous function must form a perfect non-dense set of points. 

That a continuous function with an everywhere dense set of lines of 
invariability can actually exist can be easily shewn as follows:—Make 
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the points of a non-dense perfect set correspond in order to the points of 
a continuous interval (a, b), then, as has been shewn in § 163, the corre¬ 
spondence may be such that the whole of a complementary interval of 
the perfect set corresponds to one point of the continuous interval. If a 
continuous function be defined for the continuous interval, we may define 
a new function which has at each point of the perfect set the same value 
as the original function has at the corresponding point of the continuous 
interval; and since all the points of a complementary interval of the perfect 
set correspond to the same point of the continuous interval, the new func¬ 
tion is such that it has an everywhere dense set of lines of invariability. 


EXAMPLES 


1. Take* the non-dense perfect set defined in Ex. J, § 83, by 


X^. 3 -4 - 


where every c n is either 0 or 2. A complementary interval has as its end-points 


3 + 3 a+ ’" +3nl 


I 




3 M 3 3“ 3 n 1 3' 1 ’ 

which may be denoted by («„, b„). Let the function j (r) be defined as follows: 
x of the interval (0, 1) belonging to the perfect set, let 


-For a point 


n*)- 


1 ( C 1 r 2 c n \ 

2V3 + * + - + £ 1 -) S 


when x is in the interval fa,,, />,,), let f (x) - f (a,,) -f (b„). The function f (x) so defined is 
continuous, and varies from 0 to 1, and is constant in each of the intervals {a v > b v ) comple¬ 
mentary to the non-dense perfect set. 

2.f Let the numbers in the interval (0, I) be expressed in a scale «~2 m - I, of odd 
degree; thus x — + ~ + ..., where 0 \ a r < h, and the number of digits a r is finite or 
infinite. For any number x represented in this manner, for which all the «’s are even 
integers, let / ( x ) equal + —* + ...^ . In ease any of the as are odd, let a k be the first 

one which is odd, and let f (x) then equal * ( a - + U \ +... + + \ (-—f . This function 

2 \m m~ m k l J 2 \ ) 

f {x) is continuous, and varies from 0 to 1; for an infinite set of points it has no differential 
coefficient, and for all other values of /' (.r) — 0. 


THE SUCCESSIVE DIFFERENTIAL COEFFICIENTS OF A CONTINUOUS FUNCTION 

270. If a continuous function / (x), defined for the closed interval 
(a, b ), have at every point a differential coefficient /' ( x ), which is itself 
continuous throughout the interval, the function f (x) may itself have a 
differential coefficient /" (x), which is called the second differential coeffi¬ 
cient, or the second derivative, of/ (x). 

* Cantor, Acta Math. vol. iv (1884), p. 386. See also Scheeffer, Acta Math. vol. v (1884), 
p. 289. 

t Grar6, Comptes Rendus, vol. cxxvn (1898), p. 1005. 
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The second differential coefficient /" (xf), of / (x), at a point x l9 at 
which/' (x t ) exists as a definite number, may be defined as 

lim i x JL ±A Lz L (* 1 ) 
h 

when this limit exists, as a finite number, or is infinite with a fixed sign. 
It is not necessary for the existence of the second differential coefficient 
/" (a^), that /' (x 1 ~b h) should have a definite value, for all points x j -f h 
in a neighbourhood of x 1 . 

It may happen that the four ratios 

l {i> + f (*1 ( h ) - f (*])}> \ {V+f (*l + h) - f (-Cl)}, 

\ {D-f (*i + h) - / (a:,)}, A {£>_ / ( Zl + h) - f (xj}, 

all have one and the same limit, finite, or infinite, when h ~ 0; and that 
this should be the case is sufficient for the existence of /" (a^), as here 
defined. 

The second differential coefficient /" (Xj) is thus defined as the repeated 
hmit lim lim -A + h + ~ / (*1 + A) - / ( x i + *). + / (xj) 

A~0 0 hk 

when this repeated limit exists; it being assumed that 

lim / fa + k ) ~f( x 1) 

has a definite value. 

A more restricted definition off " (a: 2 ) is frequently employed, in which 
it is assumed that /' (x) exists as a definite number, not only at x l9 but 
in a neighbourhood of x x . In accordance with this more restricted defini¬ 
tion, it is not sufficient for the existence of /" (a^) that the repeated limit 

lim lim^ (x i.+A+ A) ~ f fa + h )-f(x , + k) + / (x x ) 

A~« k ~0 hk 

should have a definite, finite value, or be infinite with fixed sign, even if 
it be assumed that /' (#]) exists. 

For the sake of generality the less restricted definition of /" (a^), given 
above, will be here employed. 

It is easily seen how the definition may be extended to the case of a 
differential coefficient/ (n) (x) of any order n. 

271. In case the function / (x) possesses a finite differential coefficient, 
of order n, at the point x — x x , and in which there is a neighbourhood of 
that point in which/' (x),f" (x), ... ,/ (w ~ 1) (x) all exist, let 

7,2 ln -1 

/(*! + *) -/fa1) - ¥' fa) - (*1) - ... - ( ^riy l f tn - v fa.) 
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be denoted by <f> (h). Since <f> (h ), </>' (h), ^ n - x) (h) all exist, and are 

zero, when h = 0, it follows from the theorem of § 263 that 

lirn ) </> M (0) — (arj); 

,^0 h n- nr K 

h n 

and then that <j> (h) — ^ f {/ (T,) (#i) f p (^)}> where p (/i) converges to 0, 
as h ~ 0. We have thus the following theorem: 

If f (x) possesses an n th differential coefficient at x = x x , and if the first 
n — 1 differential coefficients exist everywhere in some neighbourhood of the 
point x — Xy , then 

f (A -f J>) ~f(r i) + M' (A) + ^ \f" (A) (-••■ + jj i/ 1 ""” (A) 

+ K{/ ,b, (a)+/>(A)}, 

where lim p (h) — 0. 
o 

We find, by employing this theorem, that when f" (aq) exists, and 
/' (#) exists in a neighbourhood of x^ , 

/(A_+ * + “./(A + *) ~/<A+_*>.± /(A) _. r ( . ri) + CT t)> 

where the double limit lim a (A, k) exists and has the value zero. 

o 

A'~0 

It follows that the double limit of the expression 

/(A ± \ *) -/(A + A) — /(a f k ) + /(a) 

hk 

exists and is equal to/" (Xj). The converse does not in general hold. 

In particular the limits 

l im /(A ± 2h ) ~ 2 /(A + A) -I / (A) li m /(A + A) +/(a ~ A) - 2/(x,) 

a~o A 2 ’ A 2 

exist and have the value/" (aq). 

The converse of this does not hold; for either of these limits may exist, 
and yet/" (xf) need not exist, nor even /' ( x 1 ). An illustration of this is the 

case of the function defined by / (0) -= 0, / (x) — x sin 2 -, for x 2 > 0; at the 

point x = 0, /' (0) does not exist, and yet lim*^ — — — 0. 

zi~o n 

272. The following theorem, due to Schwarz*, is of fundamental im¬ 
portance in the theory of Fourier’s series. 

If, in an interval (a, fi), in which f (x) is continuous, 
f(x 4- h) - 2f(x) -\-f(x - h) 

p 


Oesamm. Abhandlungen, vol. n (1890), p. 341. 
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converge, for each value of x in (a, p), to the limit zero , for h ~ 0, then the 
function f (#) is a linear function in the whole interval , and consequently 
/' (x), f ,f (x) everywhere exist , and the latter is everywhere zero. 

Let us consider the function 

</> (X) = ± ]/(*) --/(«) - pZ “[/O 3 ) -/(«)]} + * 2 (* - “) (* - P), 

where A; is a constant. The function </> (a;), whichever sign be taken, is 
continuous in (a, p), and vanishes at a and p. We find at once 

lim ^ ~ ^ _ 2^2. 

/< 0 A 2 

and therefore, for each value of £ in (a, p), a positive number e can be 
found, such that </> (a; + A) — 2</> (a:) -f <f> (x — h) is positive and greater 
than zero, for all values of h which are numerically less than e. 

If <f> (x) can be anywhere positive in (a, P), there must be a point x 1 
at which it has the greatest positive value, and this point is neither a 
nor p, since (a), </> (ft) both vanish. If 77 be sufficiently small, 

<t> (*i + v) - <f> (*i) £ 0, <f> (X X - q) - <f> (x x ) -S 0, 

hence </> (x 1 -f 77) — 2 cf> (a^) -f <f> (x t — 77) 

is, for all sufficiently small values of 77, either negative or zero, which is 
contrary to what was shewn above. It follows that <j> (x) is everywhere 
negative in (a, ft), and cannot be zero except at a and ft. 

This holds whichever sign be taken in defining ( x ). Now 

k 2 (x — a) (x — P) 

is always negative, except at a and p , and may be taken to have its 
numerically greatest value as small as we please, since k is at our choice. 
It follows that __ 

f(x)-f(a)-~ z ^[f(P)~f(a)} 

can nowhere in the interval be different from zero; for, if at any point it 

had a value p, by choosing Jc such that Jc 2 (x — a) (x - P) is numerically 

everywhere < | p |, the function <f> ( x) could be made positive at the point, 
by proper choice of the ambiguous sign. It has thus been shewn that 
f (x) is linear in (a, p). 

273 . Schwarz’s theorem can be extended to the case in which there is 
an enumerable set of points in the interval (a, p), at which it is not known 
that the limit in question exists, or is zero, provided a certain condition 
be satisfied at each point of the enumerable set. The following theorem 
will be established: 

If, in an interval (a, p), in which f (x) is continuous , the expression 
f(x + h) - 2 f(x) +f(x - h) 
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converge , for each value of x in ( a, p), to the limit zero, for h ~ 0, except that, 
/or on enumerable set of points G, this is not known to be the case, then, 
provided that at each point x, of G, the expression 

f(x + A) -2/(*)+/(* -h) 
h 

converge to the limit zero, for h ~ 0, the function f (x) is a linear function in 
the whole interval (a, p). 

It should be observed that the condition 

lim f (x + h ) ~ 2 f( x ) +f( x ~ h ) = 0 

A-o h 

is certainly satisfied at any point x at which the differential coefficient 
/' (x) exists, and is finite. Moreover, whenever the condition is satisfied, 
f (x) has its derivatives symmetrical as regards the right and left of the 
point x. 

To prove the theorem, let it be assumed that 

/(*)-/(«)-*-“{/ 03 )-/(«)} 
has a positive value p, at some point x 1 interior to (a, j8); and let 
4> (*, *)-/(*)-/ («) - j^ a {/ 08) -/ («)} +k(x- o) ! , 

where A: is a positive number. We have 

cf> (a, k) = 0, <f> (p, k) = k (p — a) 2 , and cf> (x lf k) = p + k (x x — a) 2 ; 

and hence, provided k < = -K", 

the number (Xj, A:) is greater than (/S, A:), and than (a, k). We shall 
suppose k to be so chosen that this condition is satisfied; it then follows 
that <f> (x, k) has a maximum between a and p. The absolute maximum 
value of <f> ( x , k) may be attained once, or a finite number of times, or an 
infinite number of times, in the interval {a, P). 

The points x at which this maximum is attained have an upper extreme 
x (< p), which must itself be a point at which the maximum of <f> (x, k) 
is attained, as is seen, in the case in which x is an upper limit, from the 
condition of continuity of the function. We have therefore 

<f> (x + h, k) — <f> (x, k) & 0 , and <f> (x — h, k) — <f> ( x , &) £ 0 , 
if h be sufficiently small; from which we conclude that, in case 

hm + h ’ - h S exist, 

A-0 h 2 

its value is £ 0. It follows that x must belong to G; because the value of 
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this limit is 2 k, and therefore > 0, for any point which does not belong to G. 
Since x is a point of G, we have 

li m ( x + h ,k) -<f> (r, k ) + < j> (jn - h, k) - < f> ( x , k) } = Q 
h ~o I h h J 

and since the two fractions have the same sign, it follows that 

lim + <? + h ’ k l -£&’ k ) . o, and lim K k \ ~ * = 0. 

h h~ 0 h 

From this result we deduce that 


h ~o h h ~o ~ n p — a 


2 k (x — a). 


To each value of k in the interval (0, K), there corresponds one value 
of x , and it is impossible that the same value of x can correspond to two 
different values k x> k 2 of k. For if this were the case, we should have 
k x (x — a) ■= k 2 (x - a), 

and therefore kj = k 2 , since > a. Now it is impossible that the set of 
points k , interior to the interval (0, A"), can be such that to each such 
point there corresponds a distinct point x belonging to the enumerable 
set G. We conclude that it is impossible that 


/(*)-/(«)-*_“{/(£)-/(a)} 

can have a positive value p at any point x 1 of the interval (a,j8); and it 
can be shewn in a similar manner that there can be no negative value of 
the same function in the interval. It follows that the function must every¬ 
where be zero, and therefore that / (x) is linear in the interval (a, ft). 


274 . Let us suppose that the continuous function / (x) is linear in each 
interval contiguous to an enumerable closed set G, contained in (a, ft), 
and that it possesses everywhere in the interval ( a , ft) a finite differential 
coefficient. 

In this case both the conditions of the theorem of § 273 are satisfied, 
and therefore the function is linear in the whole interval (a, ft ); we have 
then the theorem: 

If f (x) be a continuous function possessing everywhere in the interval 
(a, ft) a finite differential coefficient , and the function be linear in each one of 
an everywhere dense set of intervals complementary to an enumerable closed 
set of points G, then f (x) is a linear function in (a, ft). 

If the closed set of points G were unenumerable, the preceding reasoning 
would no longer be applicable, except that, at an isolated point of G , it 
would establish that the linear functions in the two intervals which abut on 
one another at the isolated point must be identical. Confining therefore 
our attention to the case in which G is a perfect set, we see that a 
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continuous function possessing everywhere a finite differential coefficient 
may exist, which is linear in each sub interval complementary to a non- 
dense perfect set of points contained in the interval for which the function 
is defined, and yet the function need not be linear in the whole interval. 

The existence of such functions will be effectively established in 
Chapter vn, where it will be shewn that they may be obtained by the 
integration of continuous functions which have an everywhere dense set 
of lines of invariability. 

OSCILLATING CONTINUOUS FUNCTIONS 

275 . Let us suppose that the continuous function / (x) has no lines of 
invariability in the interval (a, /?), and that everywhere in this interval it 
has a finite differential coefficient. If, within (a, /?), there be a maximum 
or minimum of / (x), then at such a point/' (x ) 9 which exists and is finite, 
must be zero. If the maxima and minima in (a, j 3 ) be everywhere dense, 
then /' (x) vanishes at every point of the everywhere dense set; and if 
f' (x) were continuous throughout («, /?) it would follow that it was every¬ 
where zero, which would be contrary to the hypothesis that («, £) is not 
a line of invariability. 

It follows from this that, if in an interval (a, /?), which contains no lines 
of invariability of the continuous function f ( x ), the differential coefficient 
/' (x) everywhere exists, and is continuous , there must be in the interval an 
everywhere dense set of sub-intervals in each of which the function is monotone. 

We have further the following theorem : 

Iff (x) be continuous in (a, j 3 ), and have no lines of invariability, but have 
an everywhere dense set of maxim,a and minima , there must be in the interval 
an everywhere dense set of points at each of ivhich /' (x) either does not exist , 
or does exist and is discontinuous. 

A continuous function / (x) which, in a given interval (a, £), has no 
lines of invariability, but has an everywhere dense set of maxima and 
minima, is said to be a continuous function which is everywhere oscillating 
in the interval (a, £). Such a function cannot have a differential coefficient 
which is continuous throughout the interval. 

The continuous functions which are everywhere oscillating in an interval 
may be divided into two classes. 

(1). The function may be such that, if the constants Z, m be properly 
chosen, the function / (x) -f lx ~f m is monotone in the interval. In this 
case / {x) is expressible as the difference of two monotone functions, and 
thus belongs to the class of functions with bounded variation. These 
functions may be said to be of the first species , or to be functions with 
removable oscillations. 
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(2). Such functions as do not belong to (1) may be said to be of the 
second species , or to be functions with irremovable oscillations. 

In order to bring to light the essential distinction between the two 
classes of functions, as exhibited by the properties of their derivatives, 
we first of all remark that, if D+f(x) have a positive lower boundary 
for all points x in the interval (a, fi), then at each point / {x + h) — / (a:) 
is essentially positive for all positive values of h which are less than some 
number 8, dependent on x \ hence the function is monotone in the interval. 
The function would also be monotone in case the specified condition were 
that D + f (x) has a negative upper boundary for all values of x in ( a , /?). 

Now suppose that D+f (x) has a definite negative lower boundary in 
(a, /?); let this be — c, and consider the function cf> {x) = / (x) + lx + m, 
where Z > c; we have then D+<f> (x) ^ l -f D+f (x) l ~ c ; hence the func¬ 
tion (f) (x) is monotone in («, / 3 ). Thus / (x) is expressible as the difference 
of the two monotone functions </> (x) and lx + m. Similarly, if we had 
taken the condition that D l f (x) has a definite positive upper limit c, the 
function / (x) + lx -f m , where Z < — c, could be shewn to be monotone. 

It is clear that, instead of the linear function lx -f- m, we might have 
used any continuous differentiable function whose differential coefficient 
was > c, or < — c, throughout the interval, in the two cases. 

The argument would have been unaltered if it had been assumed that 
there were a finite or infinite set of lines of invariability in (a, fi). 

It has thus been shewn that: 

If the continuous function f (x) be such that either D+f (x) has a negative 
lower boundary for all values of x in (a> ft), or that D + / (x) has a positive upper 
boundary , then all maxima and minima of the oscillating function f (x) are 
removed by adding to f (a:) a properly chosen linear function , and thus the 
function is of the first species , and is of bounded variation . 

In particular, the conditions of the theorem are satisfied if the de¬ 
rivative, on one side, without necessarily having a definite value at any 
point, be such that for the whole interval it is numerically less than some 
fixed positive number. 

A function, such that for a given interval, 

|i> + /(*)|. \D + f(x)\, | D~f (x) |, | D_f(x) | 

are all less than some fixed number, is said to be a function with bounded 
derivatives. Such a function has bounded variation in the interval, and 
if it be everywhere oscillating, it is of the first species. 

If all the four derivatives are, at every point, numerically less than the 
positive number M , in which case the function is necessarily continuous, a 
neighbourhood of any point x may be so determined that, if £ be any point 
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in that neighbourhood, \ f (£) — f (x)\ ^ M \ £ — x\. Hence the fluctuation 
in the neighbourhood 8 is £ 2 MS. 

By the Heine-Borel theorem all the points of (a, b) are interior to a 
finite number of the intervals S. It follows that the interval (a, b) may 
be divided into a finite number of parts, each of length less than a pre¬ 
scribed number d, such that the sum of the fluctuations in those parts is 
£ 2 M (b — a). Since d is arbitrarily small, it follows that the total fluctua¬ 
tion of f (x) in (a, b) cannot exceed 2 M (b — a). 

It is not sufficient, in order that a function may be of limited total 
fluctuation, that, at each point, all four derivatives are finite; but it has 
been shewn to be sufficient that their values in the whole interval should 
be bounded. 

276 . Let us suppose that, for a set of points G , everywhere dense in 
(a, b), progressive and regressive derivatives at a point of G exist, and are 
infinite, but of opposite signs. At any point x 0 , of G, a neighbourhood can 
be found, containing x 0 , such that, for any point x in it, / (x) — /( x 0 ) is 
of fixed sign for the whole neighbourhood, and is never zero, except when 
x = x 0 ; it follows that x 0 is a proper maximum, or minimum, of the function. 

It will be shewn that, in any interval (a, / 3 ) contained in (a, 6), there 
is an infinite number of points at which the function has the same value. 
Let f be a maximum point of f (x), within (a, ft), and let (f — rj, f -f e) 
be the greatest interval enclosing £, within which / (x) — f (£) is negative; 
suppose that the absolute minimum of the function for this interval is in 
(€ — £); taking a maximum point in the interval (£, £ -f e), then in 

(£ — V, £) there is a point at which / (£/) — / (g x ), since/ (g x ) lies between 
the greatest and least values of the continuous function in (£ — rj, £). 

Now there is a maximum interval ^ -- rj l3 -f- ej for the point g l9 
and this lies within (g , £ + e); and in this interval we may as before find 
a maximum point f 2 , such that a point £ 2 ' also exists within the interval, 
for which / (f 2 ) =/(£ 2 '). There is also a point f 2 " in (£ — 77, f), such that 

/ (fn =/«.')=/(&>• 

We may proceed in this manner, until we find n points 

t An 1) 

Sn 1 t S n-1 » 1 5 

such that / (£,.,) = / -...*/ (C-V*)- 

Now let be a limiting point of £ l9 £ 2 , £ 3 , ... £ n , ...; and let be a 
limiting point of f/, f 2 ', , and £/' be a limiting point of f 2 ", £ 3 ", ...; then 

Thus the points £ w , £/, £/',... form an infinite set, in (a, jS), at which 
the functional values are the same. 



275-277] Oscillating Continuous Functions 377 

The points £/, £/', ... have a limiting point £ 0 , at which the func¬ 
tional value is the same as for the set itself; therefore 

/(&)-/(£.) = /(f.. 

hence, at f 0 , either the derivative is determinate, and equal to zero, or 
else it is indeterminate, with zero lying between its upper and lower limits. 
Thus it has been shewn that* : 

If a continuous function have an everywhere dense set of points at which 
there are progressive and regressive derivatives that are infinite and of opposite 
signs , there is an everywhere dense set of points at each of which th& derivative 
is either indeterminate or else zero. Thus a continuous function cannot at all 
points have infinite progressive and regressive derivatives of opposite signs. 

If we apply the above theorem to the function / (x) — cx, where c is 
a prescribed constant, then, since / (x) — cx has an infinite derivative at 
the same points as those for which / (x) has an infinite derivative, we obtain 
the following theorem: 

If the continuous function, f (x), have at an everywhere dense set of points 
infinite progressive and regressive derivatives of opposite signs, there is an 
everywhere dense set of points at each of which the derivative either has the 
prescribed value c, or is indeterminate, and such that c lies between its upper 
and lower limits. 

In geometrical language, a point x, at which there are infinite pro¬ 
gressive and regressive derivatives, of opposite signs, is a point at which 
the curve y = / (x) has a cusp, with its tangent perpendicular to the z-axis. 
At a maximum the cusp points in the positive direction of the y-axis, the 
progressive derivative being — oo, and the regressive derivative + oo. At 
a minimum the cusp points in the negative direction of the y-axis, the 
progressive derivative being -f oo, and the regressive derivative being — oo. 

PROPERTIES OF INCREMENTARY RATIOS 

277. If x x be a point of the interval (a, b), in which / (x) is defined, 

-f _ f \ 

the function*' v ^ , for points x such that x x < x s b, may be called 

the incrementary ratio at x 1 on the right; it may be denoted by I (x, x^; 
and in case / (#) be a continuous function, this incrementary ratio is also 
continuous at every point of its domain. This incrementary function has 
an upper and a lower boundary for its whole domain (a^ < x & b ); these 
upper and lower boundaries may be denoted by U (x^, L (Zj), and either 
of them may be finite or infinite; however U (a^) can only be infinite with 

* Konig, Monatshefte f. Math. u. PhysiJc, vol. I (1890), p. 7. The above proof is that given 
by Schoenflies, Bericht, vol. I, p. 160. 
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the positive sign, and L fo) only with the negative sign. U ( x 2 ), L (a^) 
being regarded as functions of x x , defined for every point of (a, b), except 
the point b, the function U (x x ) has a finite or infinite upper boundary for 
its whole domain, which we may denote by U ; and the function L (x x ) 
has a finite or infinite lower boundary for its whole domain, which we 
may denote by L. There exist therefore two numbers U, L, which may 
have the improper values 4- oo, — oo respectively, such that 

x 2 ~ X x 9 

for every pair of values of x lf x 2 , where x 2 > x x , always lies between them, 
or is equal to one of them. 

The incrementary ratio on the left of a point can be defined in a similar 
manner; and we thus define two functions U' (x x ), L' (x x ), at x l9 as the 
upper and lower boundaries of these incrementary ratios. 

It is easily seen that U', the upper boundary of U' (#) in the interval 
(a, b), is identical with U, and that L', the lower boundary of U ( x ), is 
identical with L . Thus U 9 L are the upper and lower boundaries of 
I (x l9 x 2 ) for every possible pair of points {x x , x 2 ) in the interval (a, b). 

278 . When the function f (x) is continuous {bounded or unbounded) in 
(a, b), U (x) is a lower semi-continuous function, and- L (a:) is an upper 
semi-continuous function. Accordingly, U (x) and L (x) are point-wise dis¬ 
continuous, when they are not continuous. 

If U (x x ) is finite, x 2 can be so determined that 
U fa) - I fa, X 2 ) < | e ; 

and also a neighbourhood fa — r), x l + rj), of .r,, not containing x 2 , can 
be so determined that, if x t ' be any point in this neighbourhood, 

I I fa'. * 2 ) - 1 fa, x-i) | < 

for / (x, x 2 ) is continuous with respect to x, at a:,. We have, in this neigh¬ 
bourhood, I fa', x 2 ) > I fa, X 2 ) - \<l > U fa) - e. 

It follows that U {x x ') > U (x x ) — e; or U (x) is lower semi-continuous 
at x 1 . If U (ajj) % f oo, x 2 can be so determined that / (x l9 x 2 ) > N + 
where N is an arbitrarily chosen positive number. If / (x x ) is finite, the 
neighbourhood (x x — r\, x x -f rj), of x l9 may be determined as before; and 
then I (xf, x 2 ) > N, and thus U (xf) > N, which is the condition that 
U {x) should be lower semi-continuous at x x . li f (x x ) = -f oo, the neigh¬ 
bourhood can be so determined that I (xf, x 2 ) > N, and thus TJ {xf) > N ; 
hence TJ (x) is lower semi-continuous at x x . That L (x) is upper semi- 
continuous can be proved in a similar manner. 

The incrementary ratio I (x, x'), where f (a;) is continuous { bounded or un¬ 
bounded) in the closed interval (a, 6), assumes every value between its upper 
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and lower boundaries U and L , for a pair of points x, x' interior to the 
interval (a, b). 

Let k be a number between V and L , thus U > k > L. The function 
U (x) being a lower semi-continuous function, the set of points at which 
V (x) ^ k is a closed set G l (see § 230); and similarly, since L (x) is an upper 
semi-continuous function, the set at which L (x) - k is a closed set G 2 . 

If, at any point a, U (a) =■ L (a) — k, the function has the value 


f (a) +■ k(x - a), 

for all points x in the interval (a, b). Assume first that no such points a 
exist; then since L (x) < U ( x ), the two sets G x , G 2 can have no points in 
common. Since the continuum (a, b) cannot be the sum of the two closed 
sets G 1 , G 2 , it follows that there exists at least one point which belongs 
neither to G x nor to G 2 , and thus at which U (a) > k > L (a). Tf a be a 
point at which this condition is satisfied, in a certain neighbourhood of a 
the conditions v {x) > v {a) _ e> L {x) < L {a) + f 


will be satisfied; and, if e be sufficiently small, U (x) > k > L (x) f at all 
points in this neighbourhood. Thus each point a , at which 

U (a) > k > L (a), 


is an interior point of a set of points at all of which U (x) > k > L (x). 
In case there is a point a such that U (a) - L (a) — k, and if this does 
not hold for any smaller value of x than a, it is clear that, in the interval 
(a, a), the upper and lower boundaries of I (#) are U and L. The above 
proof applied to (a, a) shews that in this interval there exists a set of 
points at which U (x) > k > L (x). 

At any such point x interior to (a, 6), we can choose positive numbers 


Aj, h 2 such that 


/ (x, x + hf) > k > I (x, x + h 2 ) y 


where x + hj <6, x + h 2 < b. Since / (x, x + h) is continuous in the 
interval of h } of which h x , h 2 are the end-points, a value of h between h x 
and h 2 exists such that / (x, x + h) — k, which establishes the theorem. 

If, at every point in {a, b), f (x) have a differential coefficient (finite, or 
infinite with fixed sign), then /' (x) has every value between its upper and 
lower boundaries in the interval (a, b). 


For, if k be any number between U and L , since, in accordance with 
the above theorem, a pair of interior points a, exists, such that 

1 (a, 0) - k, 

it follows from the theorem of § 261 that there exists a point x, in (a, P), 
at which f (x) = k. Since k is any arbitrarily chosen number between 
U and L,f (x) takes all values between U and L. It is clear that U and L 
must be the upper and lower boundaries of f (x), for if /' (x) > U, x ~f h 
could be so determined that I (x, x + h) > U, which is impossible. 
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PROPERTIES OF THE DERIVATIVES OF CONTINUOUS FUNCTIONS 

279. Let / (x) be continuous in the interval (a, 6), and let U and L 
be the upper and lower boundaries of the inerementary ratios above 
defined. Take (a, p), any interval in (a, 6), and consider the function 

4> (x) =/(*) -/(«)- X pE~lf OS) -/(«)]. 

Since <f> (a) — 0, <j> (p) = 0, unless </> (x) be constant through (a, /?), there 
must be within (a, £) a maximum or minimum of <f> (x ); and thus at least 
one point x 1 exists within (a, ft), such that 

<f>(x 1 ±h) - <f> (xj) £ 0, 
for all sufficiently small values of h , or else 

<f> {x1 ± h) — <f> (X 1 ) 3 0, 

for all sufficiently small values of h. At such a point 

I (x x + h , x x ) & I (a, P), 
and I (x x — h , %) s I («, j3), 

or else / (a^ + h, x x ) ^ / (a, ft), 

and / (Xj — h , Xj) s / (a, £). 

If </► (x) have an infinite number of maxima and minima in (a, ft), there is, 
in (a, p) } an infinite number of points at which the first of the conditions 
for <f> (x) holds, and also an infinite number at which the second holds. 
If there be only a finite number of maxima and minima of cf> (x) in (a, p), 
then this interval can be divided into a number of portions in each of which 
the function <f> (x) is monotone; and in any one of these portions either 

I (x ± h, x) £ I (a, P), 

at all points within the sub-interval, or else 

I (x ± h, x) £ I (a, P), 

for every x within the portion, and for sufficiently small values of h. We 
see then that I ( a , p) and I (x ± h, x) lie between U and L. 

Thus, in every interval (a, P) contained in (a, 6), in which c/> (x) has an 
infinite number of maxima and minima, there are (1), an infinity of points 
x at which I {x + h, x), for all sufficiently small values of h 9 lies between 
L and I (a, P)\ and (2), an infinity of points at which the same is true of 
I (x — h 9 x); (3), an infinity of points at which I (x + h f x) 9 for all sufficiently 
small values of h , lies between U and / (a, P) ; and (4), an infinity of points 
at which the same is true of / (x — h , x). 

_ Ct 

In case f (x) - f (a) — p~z~ a C/ (P) ~ f ^ ave on ty a finite number 

of maxima and minima in (a, P) } there are in (a, P) finite intervals such 
that all the points in one of them belong to both the sets (1) and (2), and 
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also finite intervals in which all the points belong to both the sets (3) 
and (4); each of these sets of intervals is finite, and an interval of one set 
is followed by one of the other set. 

The number L being the lower limit of the function L ( x) in the interval 
(a, 6 ), there exists a point x Y such that L is the lower limit of the values 
of L (x) in any arbitrarily small neighbourhood of x x \ and it follows that 
in such neighbourhood of x x there are points £ such that I + h, £), for 
an infinity of values of h , differs from L by less than a prescribed positive 
number e. Therefore there is, in (a, b), an infinity of pairs of points (a, p) 
one of which is arbitrarily near x ± , such that 1 (a, p) differs from L by 
less than e. 

Similarly, it may be shewn that, in (a, b), there is an infinity of pairs 
of points (a, /?), such that / («, ft) differs from U by less than the prescribed 
number e. 

If U or L be infinite, there exists an infinity of pairs of points such that 
1 ( a , p) is arithmetically greater than a prescribed number c, and has the 
same sign as the infinite U or L. 

We can consequently choose the interval (a, P) so that 

I (a, P) = L + 77 , or else so that I (a, p) = U — 77 , 
where 77 < e, provided U and L are finite. If one, or both, of U, L be 
infinite, (a, P) can be so chosen that I (a, P) has the same sign as U , or 
as L, and is arithmetically greater than a prescribed positive number c. 

We have now obtained the following results : 

If f (x) be a continuous function , and (a, b) be the whole , or a part , of its 
domain , to which U and L correspond , then (1), if L be finite , there exists in 
(a, b) an infinity of points for which D + (x), D + (x) both lie between L and 
L + e, where e is an arbitrarily prescribed positive number; and at these 
points £H (x), 2> + (x) are either equal , in which case a derivative on the right 
exists , or else they differ from one another by less than e: (2), if U be finite , 
there exists in (a, b) an infinity of points for which D + (#), D + (x) both lie * 
between U and U — e; and at these points there exist derivatives on the right , 
or else D + (x), D + (x) differ from one another by less than e; (3), if U or L 
be infinite , there exists an infinity of points at which D + ( x ), D + (x) are both 
numerically greater than an arbitrarily great number c, and have the same 
sign as the U or L which is infinite. A similar statement holds as regards the 
derivatives on the left. 

The above is true irrespectively of the number of the maxima and 
minima of / {x ); but if / (x) have in (a, b) only a finite number of maxima 
and minima, and if the same be true of all the functions f (x) — lx m, 
obtained by the addition of a linear function, then there exist in (a, b) finite 
sub-intervals such that at all points in one of them the above statements 
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hold both as regards the derivatives on the right and as regards those 
on the left. The numbers U and L correspond in each case to the par¬ 
ticular sub-interval. 

It will be observed that the theorem does not assert the necessity of 
the existence of points at which a determinate derivative on the right or 
on the left exists, but it states that there are in every sub-interval points 
at which the difference between the upper and lower derivatives on one 
side is less than a prescribed arbitrarily small number, or else at which 
both such derivatives are arithmetically greater than an arbitrarily fixed 
large number. There are therefore certainly points in every sub-interval 
at which there is, so to speak, an arbitrarily near degree of approximation 
to the existence of a finite or infinite derivative on the right, and also 
points at which the same is true as regards derivatives on the left. 

280. It will now be shewn* that, for a continuous function , of which 
(a, b) is the whole , or a 'part , of its domain , the upper boundary of each of the 
four derivatives D+f (x), D + f (x) 9 D~f (x), D_f(x) for all values of x in 
(a, b) is U, the upper boundary of the incrementary function in (a, b), and 
that the lower boundary of each of the four functions is L. If U and L be 
both finite the function belongs to the class of functions with bounded derivatives. 

A function with bounded derivatives accordingly satisfies the condi¬ 
tion, that for every pair of points x lt x 2 , \f (x ± ) — f (x 2 ) | < k | x x — x 2 |, 
where k is a fixed positive number. It has been pointed out, in § 275, that 
such a function belongs to the class of functions of bounded variation. 

It is clear that the upper boundary of each of the functions D f / (x), 
D + f (x) } D~f (x), D_f(x) is a number which cannot be greater than U. 
Now since it has been shewn that points exist in (a, b) such that, if e be 
an arbitrarily prescribed number, both D+f (x), D + f (x) differ from U by 
less than c, when U is finite, and are arbitrarily great if U is -f oo, it follows 
that U is in either case the upper boundary of D+f (x ), D + f(x). In a 
similar manner it can be shewn that U is the upper boundary of both 
D f (; x ), D_f (#). The proof that L is the common lower boundary of the 
four functions is exactly similar. 

Each of the four expressions D+f (x), D + f(x), D~f (x), D_f (x) may 
be regarded as a function defined for the whole domain of / (x), except at 
one of the end-points; the ordinary definition of a function being extended 
so far as to admit infinite functional values with a fixed sign. 

//, at any point x 0 , interior to (a, b), one of the above functions , say 
D+f (x ), be continuous , either in the ordinary , or in the extended , sense of the 
term (§219), then , at that pointy the other three functions are also continuous , 

* Du Bois-Reymond, Math. Ann. vol. xvi (1880), p. 119; also Scheeffer, Acta Mathematica, 
vol. v (1884), p. 190. 
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and are equal in value to I) f (x 0 ), and thus there exists , at x 0 , a differential 
coefficient . 

To prove this, take any interval (x 0 — e, x 0 -f e); then all four functions 
have in this interval the same upper boundaries, and also the same lower 
boundaries. If D+f (x {) ) be finite, the upper and lower boundaries of 
D+f (x) in (x 0 — e, x 0 -f e) each differ from 1 ) j f (:r 0 ) by less than a number 77 
depending on e, in such a way that, as e is indefinitely diminished to the 
limit zero, 77 also diminishes to the limit zero. Since all four functions have 
the same upper boundary and the same lower boundary in (x 0 — e, x 0 -f e), 
the upper and lower boundaries of each differ from D+f (x 0 ) by less than 
77, and 77 can be made as small as we please by taking e small enough. 
It follows that all four functions are continuous at x 0 , and that all four, 
at x 0 , are equal to D+f(x 0 ); and thus there exists a differential coefficient 
at x 0 . 

In case D + f (x 0 ) is 4- 00, e can be so chosen that, in (a; 0 — e, x 0 -f e), 
D+f (x) is everywhere greater than an arbitrarily large chosen number c, 
and the upper and lower boundaries of each of the four functions are then 
greater than c; by taking a succession of values of c which increase inde¬ 
finitely, and considering the corresponding sequence of values of e which 
converge to zero, we see that each of the functions D + f (x), D~f (x), D^f (.r) 
is infinite at x Q , and is continuous, in the extended sense of the term, at 
that point; there is then a differential coefficient at x 0 which is infinite, 
and of definite sign. 

It follows that, if it be known that any one of the four derivatives is every¬ 
where continuous in an interval , there exists everywhere in the interval a 
differential coefficient in the ordinary sense of the term. 

281 . The upper and lower boundaries of any one of the four derivatives 
of a continuous function are the same in the open interval (a, b) as in the 
closed interval*. 

It is clear that the upper boundary of D+f (x) in the open interval 
cannot exceed that in the closed interval. If possible let it be less by 
some positive number c. This means that D+f (a) exceeds the upper 
boundary in the open interval by c. There exists then no point interior 
to (a, b) for which D'f {x) > D+f (a) — c. Now a point x x can be so deter¬ 
mined that 1 (x ly a) differs from D+f (a) by less than |f, and a point x 2 
can be so determined that 1 (x l9 x 2 ) differs from I (x Y , a) by less than 
and therefore from D+f (a) by less than €. It follows that, in any interval 
which includes x x and x 2 , the upper boundary of D+f (x) is greater than 
D+f (a) — e; and if e < c this contradicts what has been shewn above. 
Hence the upper boundary of D+f (#), in a s x < 6, is the same as that in 
a < x < b. 

* See W. H. and G. C. Young, Quarterly Journal of Math. vol. XL (1909), p. 12. 
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It follows as a corollary that D 4 / (a) cannot exceed the upper limit, on 
the right, of D+f (x), at x = a. 

282. / (x) being continuous in (a, b ), the continuous function 

4 > (*) =/(*)-/(«)-* ~ a a {/ (6) - / (a)}, 

which vanishes at a and b, must have a maximum, or a minimum, at an 
interior point o£ (a, b). At a maximum a, D+(j> (a), D + </> (a) are both 5 0, 
and D~<j> (a), D_<f> (a) are both £ 0, the inequalities being both reversed 
in case a is a minimum. If it be known that, at every interior point of 
(a, b), there is no distinction of right and left as regards derivatives, so 
that D+<f> (x) = D~(f> (x) and D+(f> (x) = D_<f> (x), we see that all four de¬ 
rivatives must vanish at the point a, and thus there is a differential coeffi¬ 
cient at a , which has the value 0. We have thus obtained the following 
extension* of the theorem of the mean: 

If there is no distinction of right and left with regard to the derivatives of 
the function f (x) (bounded or unbounded) continuous in the interval (a, b), 
there exists a point , interior to the interval , at which a differential coefficient 

exists equal to . 

b — a 

Thus f ^ ^ ^ = /' (a + Ob — a), where 6 is such that 0 < 6 < 1 . 

As this may be applied to any interval (a, /?) contaippd in (a, b), we 
obtain the following theorem : 

If there is no distinction , as regards right and left , between the derivatives 
of the continuous function f (x), defined in the interval (a, 6), there exists an 
everywhere dense set of points in (a, 6), of the cardinal number of the con¬ 
tinuum , at which f (x) exists; and f (x) has every value between its upper and 
lower boundaries. 

283. The derivatives D+f (x), D+f (x) of a continuous function are, at 
any point # 0 , such that a ^ x 0 < b, either both continuous on the right , or both 
of them have a discontinuity of the second kind , on the right; but they cannot 
have ordinary discontinuities on the right. 

A similar statement holds as regards the continuity or discontinuity of 
Z>“/ ( x), D_f (; x ), on the left. 

Suppose that D+f (x) has, at the point x 0 , a limit A, at x 0 , on the right; 
then, if 8 be a prescribed positive number, an interval ( x 0 , x 0 + e) can be 
found, such that D+f (x), for every point of this interval, except x 0 , lies 
between A -f 8 and A — 8. The upper and lower boundaries of each of the 
four derivatives D+f (x), D+f (x), D~f (x), D_f(x), for any interval 

♦ See W. H. and G. 0. Young, Quarterly Journal of Math. vol. XL (1909), p. 10. 
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( x o + j x o + €), where < e, must all lie between the values A -f 8, A -- 8; 
hence the upper and lower boundaries of J) f ( x ), for the interval (x 0 , # 0 + e), 
lie between these same values, the function Df (x) being regarded as un¬ 
defined at the point x 0 ; and these upper and lower boundaries of I) j (x) 
are the same as those of Df(x), D+f (x), for (x 0 ,x 0 -f e), the point x 0 
being included. It follows that D + f (a; 0 ), D + f (x 0 ) both lie between A 4 8 
and A — 8; and as this holds for every value of 8, we must have 
D+fM-I> + f M = A-A'; 

where A' denotes the limit of 7 > + / (x), at x 0 , on the right; and thus D j (x ) 7 
D+j ( x ) are both continuous at x w , on the right. If A f oo, then in the 
interval (^ 0 , # 0 -f- e), at every point except x 0 , Df (x) > c, where c is an 
arbitrarily chosen number on which t depends; the argument then pro¬ 
ceeds as before. 

284 . A continuous function f (x) cannot have, at every point of a whole 
interval , a single-valued derivative on the right , which is everywhere infinite 
and of the same sign. 

For if / (x) had this property in an interval (a, ft, so also would 

/(*)-/(«) p-11 17 09 ) -/(«)!> 

and this function necessarily has a maximum or minimum within (a, ft, 
which is contrary to the condition that it has a derivative on the right 
which is always of the same sign; for this involves the condition that the 
function must constantly increase as x increases from a to ft 

Let us now suppose that the continuous function / (x) has, at all points 
of (a, 6), single-valued derivatives on the right (finite or infinite), such that, 
in a part («, ft, of (a , b), this derivative is continuous at least on one side; 
the function f (x) is then such that , at an infinite number of points, it possesses 
an ordinary differential coefficient . 

The derivative Df (ft, on the right, cannot at all points of (a, ft) be 
infinite. For if we take a point x Q such that it is continuous on one side, 
in the extended sense of the term explained in § 219 , then if it were every¬ 
where infinite, its sign at all points in an interval on the one side of 
would be the same; but it has been shewn to be impossible that, everv- 
where in any interval, Df (x) should be infinite and of constant sign. It 
follows that there are points in the neighbourhood of x 0 at which Df (ft 
is finite. If x x be such a point in (a, ft, then, since Df (x) is continuous on 
one side at x 1 , an interval can be found at all points of which it is finite, 
and also continuous on one side. If (aj, ft) be such an interval in (a, ft), 
then since Df {x) is everywhere finite in it, and continuous on one side at 
least, it is a point-wise discontinuous function, if it be not continuous in 
(«i, ft); and there must therefore be an infinity of points in (a lt ft) at 
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which l)f ( x) is continuous. At such points, in accordance with § 280 , 
/ (x) has a differential coefficient. 

285. As regards functions which have an infinite number of oscillations in 
the neighbourhood of a particular point, the folio wing remarks may be made. 

If a continuous function have, in every neighbourhood on the right of 
a point x 0 , an infinite number of maxima and minima, there is, in such 
neighbourhoods, an infinity of points at which the derivatives on the right 
are negative or zero, and an infinity of points at which these derivatives 
are positive or zero. It follows from this, that none of the derivatives at 
a point x, on the right of x 0 , can have a definite limit, as x approaches 
the limit x 0 , unless such limit be zero. 

In particular, if at all such points x , definite derivatives on the right 
and on the left exist, these derivatives cannot be continuous at x 0 , unless 
the derivatives at x 0 are both zero. 

If, at the point x 0 , and at every point in the neighbourhood of x 0 which 
contains an infinite number of maxima and minima, a differential coeffi¬ 
cient exist, which is continuous at x 0 , this differential coefficient must be 
zero at x 0 and at an infinity of points in the neighbourhood of .r 0 , and 
must therefore itself have an infinite number of maxima and minima in 
the neighbourhood of :r 0 . 

If a function / ( x ), which has an infinite number of maxima and minima 
in the neighbourhood of x 0 , have at x lu and in its neighbourhood, differ¬ 
ential coefficients of any number of orders, then they are all functions 
with an infinite number of maxima and minima in the neighbourhood of 
x 0 , and all of them vanish at x 0t except that the one of highest order may 
be discontinuous at x 0 , not then necessarily vanishing at that point. If 
differential coefficients of all orders exist, they must all vanish at x 0 ; and 
such a function is incapable of expansion in powers of x x 0 in the neigh¬ 
bourhood of # 0 . An example* of a function of this kind is 

1 1 

x 2 4- e ^ sin 

* “ *o 

FUNCTIONS WITH ONE DERIVATIVE ASSIGNED 

286. If two functions , defined for a given interval , have each bounded 
derivatives , and if the two functions have each a particular one of their four 
derivatives , say the upper derivative on the right , equal to one another at every 
point which does not belong to a set of points E, of measure zero , then the two 
functions differ from one another by a constant , the same for the whole interval , 

This theorem! differs from that of § 267 , in that the functions are 
restricted to be such continuous functions as have bounded derivatives; 
it is however more general, in that E is not restricted to be enumerable. 

* Dini-Liiroth, Grundlagen , p. 314. f Lcbesgue’s Lemons sur Vintegration (1904), p. 79. 
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Let the points of E be enclosed in the interiors of intervals of a set, of 
which the total length has the arbitrarily small value €. To each point P, 
of E, there corresponds an interval PP', where PP' is that part of the 
interval of the set that encloses P which is on the right of P ; these intervals 
PP' may be denoted by 8'. If the two functions f x (x), f 2 (x) are such that, 
at a point Xj, D i f 1 (rj = Z>+/ 2 (xj, it has been seen in § 267 that 
D + f (*i) 0, D+f(x x ) £ 0, 

where / (x) denotes j\ (x) — f 2 ( x ). Since / (a:) is continuous at x x , it follows 
that there is a set of points x l -f- h, on the right of x x , such that 

\f(*i + -f(* i) | - 

if we suppose h to have the greatest value for which this holds, the interval 
(#!, x x -f h) is an interval on the right of x } , and such intervals may be 
denoted by 8. 

Let f be any point such that a < f 6, and consider the interval (a, f). 
From the point a lay off an interval 8, or 8', according as a is not, or is, 
a point of E\ from the end of this interval lay off another interval 8, or S', 
as the case may be. Proceeding in this manner, we may construct a 
Lebesgue chain reaching from a to f (see § 78 ). The set of points not 
interior to the intervals of the chain is a closed enumerable set. We can 
now find / (f) — / (a) as the sum, or limiting sum, of the differences of the 
functional values at the end-points of the intervals of the chain, and each 
of which is either a 8, or a 8'. It is clear that 

|/(f) -/(a) | * €E8 + AXV < e (f - a + A), 
where the summations refer to those of the intervals 8,8' which have been 
employed in the construction; and A denotes the finite upper boundary 
of | I (z lf x 2 ) |, for every pair of points, x l9 x 2 in the interval (a, f); and 
this is identical with the upper boundary of the absolute value of the 
derivatives of / (x) in the interval. Since e is arbitrarily small, it follows 
that /(f) =/(a), and therefore f x (f) - f 2 (f) =/ x (a) - f 2 (a); thus the 
theorem has been established. 

THE CONSTRUCTION OF CONTINUOUS FUNCTIONS 

287 . One of the most fruitful methods of obtaining continuous func¬ 
tions which exhibit various peculiarities as regards the existence or non¬ 
existence of differential coefficients at all the points, or at sets of points, 
of their domain, consists in defining the functions by means of series 
specially constructed with a view to the purpose in hand; this method 
will be explained and illustrated in Vol. II. Broden, Kopcke, and others, 
have however given direct constructions for continuous functions, which 
illustrate various possibilities in relation to the existence and properties 
of derivatives. 
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The method employed* by Broden is that of defining a continuous 
function, in the domain ( a , b)> as the function obtained by extension (see 
§ 241 ) of a function defined for an enumerable everywhere dense set of 
points in (a, 6), the primary points. A continuous function is entirely 
determinate when the functional values at such a primary set of points 
have been assigned. The necessary and sufficient condition that a function 
defined for the primary set should, by extension to the domain (a, b), give 
a function which is continuous in that domain, is that the primary function 
should be uniformly continuous with respect to the unclosed primary 
domain. To prove this, let {£} denote the set of primary points, and {x} 
the set of secondary points; then the condition that the function / (f) 
may be uniformly continuous with respect to the domain {£}, is that, if 
be any point of {£}, and if 77 be a prescribed arbitrarily small number, 
the condition | /(£)—/ (fi) | < 17 be satisfied at all points £ which are such 
that | f | < c, where e is a number dependent on 77 , the same for all 
points fj, of {£}. Now, assuming that this condition is satisfied, let x 1 be a 
secondary point, and let f 2 , ... £ n ••• > £1 , , ••• in ••• be any two se¬ 

quences of primary points, each of which has x x as its limit; we have to 
shew that each of the sequences 

f til), ({»)-, 

/(&'),/(&'), 

converges to the same number, which will then be the single functional 
value f(x x ). Enclose x 1 in the interval (x x — |c, x 1 -f Je); then, from and 
after some particular value of n, all the points of both sequences of values 
of f lie within this neighbourhood. Let this value of n be m, then 

\f(U ~f(tm + r) | < V> 

for all positive integral values of r; hence the first sequence of functional 
values is convergent, since 17 is arbitrary; and similarly the second is also 
convergent. Also for every 77 there is a definite m such that 

hence the two convergent sequences have the same limit, and this limit 
defines f (x^. We have now to shew that the single-valued function so 
defined is continuous. We have 

|/(*i) -/(&) I < v, provided | x l - & | < 

and |/ (x t ) -/(£>) | < v> provided | x 2 - & | < Je; 

also |/(&)-/(&) I < V, provided | & - £, | < e. 

Hence it follows that | / (x 2 ) — f (x 2 ) | < 377, 
and this holds provided | x 2 — x 1 | < 2 c, 

* Crelle's Journal , vol. cxvm (1897), p. 1. See also Acta Univ. Lund. vol. xxxm (1897): 
Functiontheoretische Bemerkungen und Sdtze. 
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for £i > & can be taken to be between x ± and x 2 ; and therefore / (x) is con¬ 
tinuous at x 1 , since 3 rj is at our choice. The extended / (#) is also easily 
seen to be continuous at any primary point. It has now been proved that 
the condition of uniform continuity is sufficient; that it is necessary follows 
from the theorem of § 217 . 


The derivatives at any point depend only on the functional values at 
the primary points in the neighbourhood of the point. For let x 1 be any 

f (x) ~ /* (x 1 

point, and consider the limit of J K ^ ^ , when x has any sequence of 

values which converge to x x . A set of primary values of x can always be 
found, such that the ratio converges to the same limit, when x has the 
values of this sequence of primary points, as for the prescribed sequence 
consisting of secondary points, or of both primary and secondary points. 
For a primary point £ can be found, corresponding to x , such that 

|/(*)-/(*i) _/(£) -_H x i) 

| x-x 1 f - X 1 

where 8 is an arbitrarily small number. This follows from the fact that 

f (x ) - f(x t ) 
x — x x 

is a continuous function of x at every point except x 2 . 


288 . In order to construct monotone continuous functions, the values 
of the function are first assigned at the end-points a, b of the interval, 
then at two points x 0 , x 1 , where x 0 < x x ; then at four points x w , x 01 , x l0 , x u , 

Wf,ere *oo < , *o < *oi < *io < *1. and X 1 <X n \ 

afterwards at eight points 

^000 j X 001> ^OlO? ^011 > ^lOO » ^101 j ^110 > ^111 > & c - 

lying in the successive intervals measured from left to right, into which 
(a, b) was divided by the four points; and so on. The function may then 
be regarded as the limit of a sequence of continuous functions, each of 
which is representable as a polygon obtained by joining the end-points of 
ordinates which represent the functional values that have been assigned 
at any stage of the process. 

In this manner Broden has constructed a monotone continuous func¬ 
tion / (x), which is such that it has derivatives on the right, and on the 
left, which are everywhere definite, finite, and different from zero; and 
such that a definite differential coefficient everywhere exists, except at the 
everywhere dense enumerable set of primary points. 

He has also constructed a monotone function / (x) which is such that, 
at the everywhere dense enumerable set of primary points, the derivative 
on the left exists, and is zero, and the derivative on the right exists, and 
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is positive; for an unenumerable everywhere dense set of points there is 
a differential coefficient, everywhere zero; and for another such set of 
points, there is no definite derivative on the left, but there is a positive 
one on the right. 

A third case is the following: 

f (x) is continuous, monotone, and increasing; at an everywhere dense 
enumerable set of points the derivative on the left is zero, and that on 
the right is -f oo; for an everywhere dense unenumerable set, both deri¬ 
vatives exist, and are positive; for another such set both exist, and are 
zero; for a third such set, both derivatives exist, and are + oo; for a 
fourth such set, neither derivative exists; for a fifth such set, the derivative 
on the left is zero, and that on the right is indefinite, but has zero for its 
lower limit; for a sixth such set, the derivative on the right is f oo, but 
that on the left is indefinite, with -f oo for its upper limit. 

289. For the construction of everywhere oscillating continuous func¬ 
tions it is more convenient to assign successively the functional values at 
sets of points proceeding by powers of 3 , instead of 2 , as in the case of 
monotone functions. In this manner Brod&i has constructed such a 
function/ (z), which has the following properties: 

At an everywhere dense enumerable set of points, the derivative on 
the left exists, and is positive; that on the right exists, and is negative 
(or the reverse), this set corresponding to maxima and minima of the 
function; for a certain unenumerable everywhere dense set, there is a 
differential coefficient everywhere of the same sign; and for another such 
set, there is a differential coefficient which is zero; for a third such set, 
one or both of the derivatives are indefinite. 

Kopcke* has given the first example of a function which is everywhere 
oscillating, and yet has at every point a definite differential coefficient, 
thus confirming the conjecture of Dini that such functions can exist; and 
Broden| has also constructed such a function. A general theory of such 
functions has been givenJ by Schoenflies. The method adopted by Kopcke 
is to construct the function as the limit of a succession of polygons of 
which the sides are circular arcs. Everywhere oscillating functions have 
also been studied by Steinitz§. A detailed account of all the special cases 
treated of by these writers would require much space; reference can 
therefore only be made to the original memoirs. A simplification of 
Kopcke’s construction, due to Pereno, will be given in Vol. II. 

* Math. Ann. vol. xxix (1887), p. 123; vol. xxxiv (1889), p. 161; vol. xxxv (1890), p. 104. 
See also Pereno, (horn, di Mat. vol. xxxv (1897), p. 132. 

f Stockholm Vet. Ak. Ofv. (1900), pp. 423 and 743. 

X Math. Ann. vol. liv (1901), p. 563; also his Bericht , vol. I, p. 164. 

§ Math. Annalen f vol. lii (1899), p. 58. 
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290. A function / (x) which is of such a character that it can be repre¬ 
sented approximately by a graph that exhibits all the peculiarities of the 
function, so that y - f (x) is the equation of a “curve,” in the ordinary 
sense of the term, must satisfy the following three conditions: 

(1) . The function must be continuous everywhere, with the possible 
exception of a finite number of points, at which it may have ordinary 
discontinuities. 

(2) . Tt must be differentiable, except that there may be a finite number 
of points at which no differential coefficients exist, but at which definite 
derivatives on the right and on the left exist. 

(3) . It can have only a finite number of maxima and minima; and the 
same must hold of every function obtainable by the addition of a linear 
function to the one in question. This condition may be expressed in the 
form, that the function must be in general monotone with reference to 
every possible axis which may be employed for the measurement of 
abscissae. 

A function which satisfies these conditions may be characterized* as 
an ordinary function. As has been already indicated, there exist functions 
which satisfy the conditions (1) and (3), but do not satisfy the condition (2). 
Again, there exist functions which satisfy the conditions (1) and (2), but 
not the condition (3). 

GENERAL PROPERTIES OE DERIVATIVES 

291. A number of important properties of the derivatives of a con¬ 
tinuous function have been investigated in §§ 279-286. Some of the most 
interesting properties of the derivatives of a function hold for functions 
that are not restricted to be continuous, and are either quite general, or 
are only restricted to belong to the wide class of measurable functions. 
It will in general be assumed that a function / (u;), defined for any interval 
of x , either finite or indefinitely great, has a definite finite value, for each 
value of x, whether the function be bounded or not. 

If f (x) be defined in a finite , or an unbounded , interval, and k be any 
fixed number, the set of points x, at each of which the conditions D v f (x) ^ k, 
D~f (#) < k are satisfied, is enumerable |. 

Let O be the set of points at which l) + f (x) £ k ; and let {c n }, {rj n } be 
two monotone sequences of positive numbers that converge to zero. If f 
be a point of G, and h be a sufficiently small positive number, 

1 (f, £ + h) a * — r] n . 

* Du Bois-Reymond, CreUes Journal , vol. lxxix (1875), p. 32. 

t See G. C. Young, Acta Math. vol. xxxvii (1914), p. 143, and Quarterly Journal of Math. 
vol. xl vu (1916), p. 127. 




All the values of h for which this condition is satisfied have an upper 
boundary h, at which the condition may, or may not, be satisfied. Let 
f = £ -f \h, then we have, for such a point f, of G , a definite interval (£, f), 
at every point of which the condition 1 (£, x) ^ k — rj n is satisfied. We now 
take, for the point f, the interval 8g on its right, of length equal to the 
smaller of the two numbers e n , f — so that, in this interval, both the 


conditions 


/ ((, x) a k - rj n , I f - X I s e„ 


are satisfied. Let A n be the set of all such intervals 8g, when every point 
in G, is considered; and let A be the set {A„} of all such intervals, when all 
values of n are taken into account. There exists at most an enumerable 


set H , of the points of G, that are not interior points of any interval of A 
(see § 72). If £' be a point of G — H , it is interior to an interval (£, 77 ), 
of A„, whatever value of n is taken. It follows that I (£, f') k — rj n . As 
n is indefinitely increased, £ will converge to and r) n converges to 0 ; 
it is thus seen that D~f (£') > k. This result holds at every point £', of G y 
that does not belong to the enumerable set H, and thus the theorem has 
been established. 


It is clear that the corresponding result holds, that the set of points 
at which Z)_/ (#) £ k, D*f (x) < k is enumerable. 

If / (x) be changed into — / (x), and k into — k , we see that the set of 
points at which D + f (x) 2; k, D_f (x) > k is enumerable, and that the set 
of points at which D f (x) < k, D+f (x) > k is also enumerable. 

It is clear that the set of points, at each of which D+f (x) =4-00, 
< k , being a part of the enumerable set at which D+f (x) >, k , 
D f (x) < k, is enumerable. If we give to k successively the values in a 
divergent sequence {k n } of positive numbers, the set at which 
D+f ( x ) = + 00, D-f (x) < go, 

is such that each point belongs to one of the enumerable sets for which 
n,f{x) = -foo, D~f (x) < k n ; it has thus been shewn that: 

The set of points at which D+ f (x) — + 00, and D~f (x) is finite, or — 00, 
is enumerable. 


Similarly the three sets at which D__f(x) is -f 00, and D+f {x) is finite, 
or — 00 ; at which D A f (x) is -- 00, and D_f (x) is finite or 4- 00; and at 
which D~f (x) is — 00, and D 4 / (x) is finite or 4 00, are all enumerable. 


292. Except at points of an enumerable set the upper derivative on one 
side is greater than , or equal to, the lower derivative on the other side. 

Let us consider a point x, at which D^f(x) = a , D_f (x) = b, where 
a and b are numbers such that a <b \ the number b may be 4 00 . Let k 
be a rational number interior to the interval (a, 6 ), then x belongs to the 
set of points at which D+f (x) < k, D_f(x) > k, and this set E k , being a 
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part of the enumerable set at which D+f (x) < k , 2>„ / (x) £ k , is enumerable. 
Any point x, at which Z) + / (a:) < />_/(#), belongs to E k for all properly 
chosen values of k. Any point at which D+f (x) is finite, or — oo, and also 
D_f (x) = + oo, belongs to the enumerable set E^. All the sets E k for all 
rational values of k, and also for k = -f oo, form an enumerable set. There¬ 
fore all the points at which D'f(x) < D_f (x) form an enumerable set. 
Similarly, it is seen that the set of points at which I) f (x) < D + f(x) is 
enumerable, and thus the theorem is established. 

In the case of a continuous function / ( x), there is a derivative on one 
side which has any assigned value not greater than the upper derivative, 
and not less than the lower derivative on that side (see § 260). At every 
point x that does not belong to a certain enumerable set, 

!>+/ (a:) -c D-f {x), and D f (x) £ 1) t f ( x ); 
thus at least one point of the closed interval (/)_/ (.r), D~f (x)) lies in the 
closed interval (D+f(x), D*f (x)). We have then the following theorem: 

A continuous function has at least one symmetrical median, or extreme, 
derivative at every 'point that does not belong to a certain set of measure 
zero. 

The following theorem follows at once from the above general theorem: 

Except at points belonging to an enumerable set, a function cannot have 
unique progressive and regressive derivatives which are unequal, whether finite, 
or infinite with a determinate sign. 

293. The following theorem*, which will be of use in the further dis¬ 
cussion, will now be established: 

If f (x) is continuous with respect to a perfect set G, contained in an 
interval (a, b), in which f (:r) is defined, and if, at each point x, of G , there is 
a closed interval (x, x f h x ) on its right, such that, if x' be any point in it, 
other than x itself, f (x) ^ f (x), then an interval (a, fi) exists, in (a, b), con¬ 
taining a part of G, such that, for every pair of points x, x , in (a, ft), which 
are such that x belongs to G, and x' £ x, the condition f (x') & f (x) is satisfied. 

Let a system of nets with closed meshes be fitted on to the interval 
(a, b). If, in any net D n , there is a mesh d n which contains, in its interior, 
points of G such that, for each such point x, the condition / (x r ) s / (x) is 
satisfied for all points x', in the mesh d n , which are on the right of x, the 
mesh* d n , or else that part of it obtained by cutting off a portion of its 
left end, will be an interval such as is required. Let it be assumed that, 
if possible, for each value of n, and for every mesh d n , which contains 

* See G. C. Young, Proc . Land. Math. Soc. (2), vol. xv (1916), p. 367; the proof there given 
differs from that in the text, in that it seems to require an infinite number of undetermined acts 
of choice, as applied to successive pairs of points. For corrections, see Proc. Lond. Math. Soc. 
(2), vol. xix (1921), p. 152. 
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points of G in its interior, there is at least one pair of points x, x f (> x), 
where x is a point of G y and such that / (x r ) > f (x ); and where x is interior 
to d n . 

In D n , let H n be the set of those points x , of G, each of which is interior 
to a mesh of D n , and such that there is a point x' (> x) in the same mesh, 
for which f (x f ) > f (x). Let K n be the set of all other points of G \ thus 
G = H n + K n . 

It will be shewn that, on the assumption that the theorem does not 
hold, the set K n is non-dense in G . Let A be any interval that contains 
points of G in its interior. For a sufficiently large value of n, A will contain, 
in its interior, a mesh of D n that has, within it, points of G; one of such 
points ./* is such that / ( x f ) > / (x), for some point x' (> x) contained in the 
mesh. A neighbourhood (x - e, x + e') of x, contained in the mesh, can 
be so determined that, for all points x , of G , in that interval, / ( x ') > / (x ); 
this follows from the continuity of / (.r) in G. Therefore no point f, of G , 
in (x — e, x + e') is a point of K n . Since, interior to an arbitrary interval A 
which contains in its interior points of G , another interval (x — c, x e') 
can be so determined as to contain points of G , none of which belong to 
K n , it follows that K n is non-dense in G. 

We see that II n contains II n hl ; and the sets K n being all non-dense in 6r, 
the set M (K x , K 2 , ... K n , ...) is of the first category relatively to G. It 
follows that there exists a set II w , a residual with respect to G, that 
consists of points each of w hich belongs to H n , for every value of n. If x 
be a point of i/ w , there exists an interval x + h x , such that, for every 
point x f in it, / (x r ) ± / (x). 

If n be sufficiently large, the part of that mesh of D n containing x , that 
is on the right of x, will be contained in (x y x -f h x ), and consequently, in 
that mesh there is no point x ' (> x) such that f (x f ) > f (x); and this is 
contrary to the hypothesis. There is consequently a contradiction in the 
assumption made. Hence the theorem has been established. 

294. The following theorem can be deduced from that of § 293: 

//, at each point of the perfect set G, with respect to which f (x) is con¬ 
tinuous, D+f (x) < 0, an interval (a, jS) can be so determined that , for any 
interval contained in it , with a point of G as left-hand end-point , the incre¬ 
mentary ratio is ^ 0. 

For if, at a point x, of G, we have D+f (#) < 0, a set of intervals 
(x f x + h) exists, such that / (£) ^ / (#), for all points £ (4= x) of ( x, x -f h). 
There will be an upper boundary h, of h , and at the point x + h, we may 
have / (x -f h) > f (x). The neighbourhood (x y x -f \h) may be assigned 
to x as the interval ( x , x + h x ) in the theorem proved above. In the interval 
(a, j8), we have / ( x } x f ) ^ 0, at every point x, of G. 
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If we replace f (x), in the above theorem, by / (x) — kx, we have the 
following theorem: 

If, at each point of the perfect set G, with respect to which f (#) is con¬ 
tinuous, D + f (x) < k, an interval (a, ft) can be so determined that, for any 
interval contained in it, with a point of G as left-hand end-point, the incre¬ 
mentary ratio is 5 . k. 

If we change / (x) into — / (x), and k into — k ; since 
£ + {-/(*)}= -D+f(x) 

we have the corresponding theorem, in which D + f (x) > k takes the place 
of the condition 1 ) f / (a:) < k, and the incrernentary ratio in (a, ft) is 5 k. 

It is clear that corresponding theorems hold for I)~ f (x), D_f (x). 

295. In order not to interrupt the continuity of the account of the 
main properties of the derivatives of functions, and to secure the greatest 
attainable degree of generality in the results, the conception of a measur¬ 
able function will be here introduced. A function f (x), defined in any 
interval (a, b), is said to be measurable , provided that, for very value of A, 
the set of points x, of (a, 6), at which f (x) > A, is a measurable set of 
points. The properties of measurable functions will be considered in detail 
in the account of Lebesgue integrals which will be given in Chapter vn. 
An important property of a measurable function / (x) will here be assumed, 
viz. that a set E, of points of ( a , b), of measure arbitrarily near to b — a, 
the measure of the interval, exists such that / (x) is continuous with 
respect to the set E. This property of measurable functions will be estab¬ 
lished in Vol. II. If this property of a measurable function be not assumed, 
the properties of derivatives established below must be taken only to 
apply to continuous functions. It has been shewn in § 128, that any 
measurable set E contains a perfect set G, such that m (E) — m (G) is less 
than an arbitrarily chosen positive number, it being assumed that 
m (E) > 0 . It has further been shewn, in § 138, that a perfect set G , of 
measure greater than zero, contains a perfect component G, of measure 
equal to that of G, such that G is metrically dense in itself. It follows 
that a measurable function / (x), defined in the interval (a, b), is continuous 
with respect to a perfect set G, metrically dense in itself, and such that 
m (G) is less than b — a, by less than an arbitrarily chosen positive number. 

If H be a measurable set of points contained in (a, b ), and of measure 
m ( H ) > 0, the set H must contain a perfect component, metrically dense 
in itself, and such that the measurable function / (x) is continuous with 
respect to that perfect component. For G may be so defined that 

b — a — m (G) < m ( H ), 

and thus H and G have a part K in common, such that m (K) > 0. The 
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set K contains a perfect component, metrically dense in itself, and this is 
the required component of H , with respect to which / (x) is continuous. 

296. The set of points at which , for a finite measurable function f (x)> the 
conditions D + f (x) = 4- oo, D_f (x) > 0 hold , forms a set of measure zero. 

A function is said to be finite, whether it be bounded or not, when its 
value at no point of its domain is infinite. 

Suppose, if possible, that the set had a measure > 0. A part 0 of this 
set can be taken, which is perfect, and with respect to which / (x) is con¬ 
tinuous, and such that G is metrically dense in itself. An interval (a, b) 
can be so determined (see § 294) that, if f be any point of it belonging to G , 

^ ^ ------ ^0, if a ^ x < £. Let E be the part of G in (a, b)\ then 

m (E) > 0. 

To each point of (u, b) an interval, with the point as left-hand end¬ 
point, can be assigned as follows: 

To each point of E , we assign an interval such that the incrementary 
ratio in it is ^ A, a fixed number. To each point in an interval contiguous 
to E , we assign that part of the contiguous interval which is on its right. 
To 6, an arbitrary interval can be assigned. One Lebesgue chain can be 
defined, constituted of intervals of the set, which reaches from a to b ; we 
can then omit the interval corresponding to b . 

The sum of all those intervals of the chain of which the left-hand end¬ 
point belongs to E has measure > m (E ); for these intervals contain all 
the points of E , and the other intervals contain no points of E in their 
interiors. 

A finite set A, of intervals of the chain, can be so chosen that the re¬ 
mainder has measure < \m (E). 

The finite set contains therefore intervals A', of which the left-hand 
end-points belong to E, and the sum of such intervals is > \m (E). In 
the intervals of (a, b) complementary to A', the incrementary ratio is £ 0. 
In A' and its complement the increment of / (x) from left to right is £ 0; 
and the total of all such increments is / (b) — / (a). But these intervals 
have been shewn to contain a set in which the sum of the increments is 
> \Am (E), which exceeds / (6) - / (a), provided A is chosen sufficiently 
large; and this is impossible. Therefore the measure of the set is 0. 

If we substitute / (x) -f kx for / (x), we see that the set of points at 
which D+f (#) = -foo, D_f (x) > — k, is zero. 

Giving to k successively, the values in a set k lt k 2t ... which increase 
indefinitely, such that k n <k n+x , each of the sets corresponding to k x , k 2 ,..., 
has measure zero, hence the set at each point of which D + f (x) = 4-00 
!>-/(*) > — k nf for some value of n , has measure zero. 
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Therefore, at every point at which D+f (x) = -f oo, with the exception 
of a set of measure zero, we must have D_ f (x) = — oo. 

Hence we have the following theorems: 

(1) . If f (x) be a finite measurable function, and if, at the points of a set , 
D+f (x) ~ 4- oo, then D_f (x) — — oo a/, all the points of that set, with the 
exception of a part, of which the measure is zero . In the set in which 
D+f (x) = — oo, with a similar exception, D f (x ) —' -f oo. 

(2) . If f (x) be a finite measurable function, the set of points at which it 
has an infinite unique derivative, on one side , has the measure zero. 

Let Df(x) = D + f (x) — -f oo, at all points of a set G. Since 
D f (x) a D J(z), 

except at points of a set of measure zero, we must have D f (x) — r co, 
if the exceptional set be left out of account. By the last theorem, when 
D~f(x) * + oo, we must have D f f (x) — co, except at a set of measure 

zero. It follows that the set G has measure zero. 

297. (3). If f (x) is a measurable and finite function, the points at which 
D+f (x), D+f (x) are finite, and, different from one another, form a set of 
measure zero. 

If S is the set of such points, and we remove from S some set of measure 
zero, the derivatives D~f (x), D_f (x) must also be finite. This follows 
from theorem ( 1 ). We may assume then that S is a set in which all four 
derivatives are finite at each point, and D+f (rr) + D+ / (#). Let us suppose 
that, if possible m (8) > 0 . If N be a positive number, let Sx be that part 
of S in which all the four derivatives are numerically < N. If N have 
the values in an increasing sequence N 1} N 2 , ..., which diverges, we see 
that Sn contains S^ r ; the set S is the outer limiting set of the sequence 
{S Nr }, and since m ( S) > 0 , we must have m (S r ) > 0 , from and after some 
value of r. We may then take N so large that m (S\) > 0 . 

A part 2, of S\, can be determined so as to be perfect, and such that 
the measure of that part of it that lies in any interval which contains, in. 
its interior, points of 2 , has measure > 0 ; and is also such that / (#) is 
continuous relatively to 2 (see § 295). 

Let g (x) = f (x)-j-Nx, then all the derivatives D+g (x), D+g (x) f 
D~g (x), D_ g (x) are, in 2, < 2 N, and > 0 . 

An interval (a, b) can be chosen such that it contains a part G , of 2, 
such that the incrementary ratio of g (x), for any pair of points contained 
in the interval, is £ 2 N, if the right-hand point belongs to G ; or is 2 0 , 
if the left-hand point belongs to G. The interval can be so chosen that 
a and b are points of G. 

A part G k , of G , can be so chosen that D+g (x) — D+g (#) > k , and k 
can be so chosen that m (G k ) > 0 . 
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For, if k have the values of a decreasing sequence of numbers that 
converges to 0, O kr is contained in G kr ^\ thus the measure of G kr , from 
and after some value of r, must be > 0. The set G k is the sum of a number 
of sets G ki , where i is an integer, such that, in G ki , 

\ (i — 1) k $ D+g (x) < Uk, 

where t = 1, 2, 3, ... q; q being the greatest integer for which 

i (t - l)k< 2N, 

One at least of these sets G ki must have measure > 0; let i be the least 
integer for which this is the case. There exists a part E , of G k ^ that is 
perfect and has measure >0. We may, by alteration of a and b, if 
necessary, ensure that a and b belong to E. In the set E , 

| (i — 1) k £ D + g (#) < hk, and D+g (.r) > \ (i f 1) k. 

Choose a finite set of the intervals that are contiguous to E , such that 
the measure of the sum of the remaining intervals is < e, where 

0 < € < m (E). 

Let this finite set be denoted by A', and let A be the finite set comple¬ 
mentary to A', in (a, b). We have then 

m (A) > m ( E ), and rn (A) < m (E) f- e. 

Let us consider one of the intervals (a, p)> of A; a, ft are points of E . 
To each point x , of the part of E interior to (a, fi), and to the point a, 
we assign an interval (x, x -f It ), on its right, such that 
g (x -f h) - g (x) < \ikh. 

To each point in (a, /?), not a point of E , we assign that part of the interval 
contiguous to E in which it lies, that is on its right. There is a unique 
chain composed of intervals so defined, that stretches from a to Remove 
from this chain a set of intervals of total measure < e/w, such that the 
remaining intervals A" a form a finite set. The sum of the finite set of 
intervals complementary to A" a ,^ is a finite set A'" a ^, of total measure 
< e/n, where n is the number of intervals (a, fi), of A. We proceed in this 
way with each interval (a, /?), of A. 

The interval (a, b) is now divided into a finite number of parts con¬ 
sisting of (1), the set A" = DA" a> e, (2), the set A'" == £A'" a>/3 , and (3), the 
set A'. 

We have m (A'") < €. 

The intervals of A" consist partly of intervals ( x , x -f h), where x is a 
point of E, and partly of intervals not containing in their interiors points 
of E. The measure of the latter set is < c, and that of the former is 
> m (E) — e, and < m (E) + e. 

Since, for one of these intervals (x, x 4- h), we have 
g (x + h) - g (x) < \ikh t 
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and in any interval with the left-hand end-point a point of E , the incre¬ 
mentary ratio is 2 N \ we have 

f/ (b) - g (a) < lit \m (E) 4- c} 4- 2Ne 4- p, .(1) 


where p denotes the sum of the increments of g (#) in the intervals A'. 

Again, if we assigned to each point x, of E , an interval (x , x -f h') 
such that g (x 4 - h') — g (x) > h (i 4- 1) k , and proceeded as before, then, 
remembering that, in any interval in which the left-hand end-point belongs 
to G , the incrementary ratio is > 0 , we should find that 

(J(b)- 0 (a) > \(H 1) k {m (E) - e} i p .(2) 

The inequalities ( 1 ), ( 2 ) are in contradiction if 
lkm (E) > e (ck 4- l f 2A), 

which will hold provided e have been chosen sufficiently small. It follows 
that the set S cannot have measure > 0 . The method of the above proof 
can be applied to shew that a similar result holds for D~f ( x ), D + f (x). 
It thus appears that, in the theorem (3), any pair of derivatives may be 
taken. 

298. It has now been shewn that, in the set S 1 , of points for which all 
four derivatives are finite, the upper and lower derivatives on the right 
are equal, and the upper and lower derivatives on the left are equal, 
except in a part of S lt of measure zero; and thus, in a set of measure 
m ($]), there are unique derivatives both on the right and on the left. 
The points of S t at which these are unequal, form a set of measure zero, 
as is seen from the theorem that the points at which the upper derivative 
on one side is less than the lower derivative on the otlmr side form a set 
of measure zero. We have accordingly the following theorem: 

The set of points at which the four derivatives are all finite , and not all 
equal , has measure zero. Thus , in the set of points at which all the derivatives 
are finite , there is a finite differential coefficient everywhere , except at points 
of a set of measure zero. 

Next, let us consider the set S 2 , in which one derivative is infinite, 
4 - oo, or — oo, and the others are all finite. By theorem ( 1 ), S 2 has 
measure zero. 

If, at points of a set S 3 , two of the derivatives are infinite and the 
other two finite, then, except in the points of a part of the set, of which 
the measure is zero, these derivatives must be the upper derivative on 
one side and the lower derivative on the other side, and the two finite 
derivatives are equal. 

If three of the derivatives are infinite at points of a set $ 4 , then, by 
the theorems ( 1 ) and ( 2 ), all four derivatives are infinite at all points of $ 4 , 
except in a part of the set, of which the measure is zero. 
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If all four derivatives are infinite at the points of a set S A , then, except 
in a part of $ 4 , of which the measure is zero, 

D + f (x) = D~f (x) = + oo, and D+f (x) — I) f (x) — — oo. 

The general result may be stated as follows : 

Iff (x) be a measurable function, finite at each 'point, and if a set of points 
of measure zero be left out of account, then at every point x, either 

(1) , there is a finite differential coefficient, or 

(2) , the two upper derivatives are -f oo, and the two lower derivatives are 
— oo, or 

(3) , the upper derivative on one side is -f oo, the lower derivative on the 
other side is — oo, and the other two derivatives are finite, and equal. 

This general theorem was established by Penjoy* for the case of a 
continuous function, and by G. C. Youngf for the case of any measurable 
function. The investigation here given is based upon the work of the 
latter writer. 

The following is a consequence of the general theorem: 

There is no geometrical distinction of right and left with regard to the four 
extreme derivatives of a measurable finite function, except possibly at points 
of a set of measure zero . 

A number of special theorems are particular cases of the general 
theorem. Thus, we have the following: 

If a measurable function have finite derivatives except at points of a set of 
measure zero, it has a finite differential coefficient except at points of a set 
of measure zero. 

This theorem was first given by MontelJ, for the case of a continuous 
function. 

A monotone function f (x) has a finite differential coefficient at every point 
which does not belong to a set of measure zero. 

For, in the case of a monotone function, all four derivatives are ^ 0 , 
or all four are ^ 0 , and thus the cases (2) and (3) of the general theorem 
cannot arise. 

This theorem was first established§, in connection with the theory of 
Integration, by Lebesgue, for the case of a continuous function. It was 
extended by W. H. Young to the case of a monotone function which is 
not necessarily continuous!|. A proof has also been givenby Rajchman 
and Saks. 

♦ Journal de Math. (7), vol. i (1915), p. 105. 

f Proc, Lond. Math, Soc . (2), vol. xv (1916), p. 360. See also Banach, Complex Rendus, vol. 
CLxxm (1921), p. 457, and Saks, Fundamenta Mat. vol. v (1924), p. 98. 

X Complex Rendus , vol. clv (1912), p. 1478. § Lemons sur Vintegration (1904), p. 128. 

|| Quarterly Journal, vol. XLn (1911), p. 79. Fundamenta Mat. vol. iv (1923), p. 204. 
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Since every function of bounded variation is the difference of two 
monotone functions, we have the theorem that: 

A function of bounded variation has a finite differential coefficient at every 
point which does not belong to a set of measure zero. 

The following special case of the general theorem was first given by 
Lusin : 

The points at which a continuous function has an infinite differential 
coefficient (with fixed sign) form a set of measure zero. 

It has been shewn* by A. N. Singh that, if a function is finite and con¬ 
tinuous (or if its points of discontinuity form, a nan-dense set) there exists an 
everywhere dense set of points at which the function has a progressive differential 
coefficient, finite, or infinite. 

299. We have hitherto assumed that / (#), whether bounded or not, 
is finite, for each value of x. But if it be assumed that there is an excep¬ 
tional set of points, of measure zero, at which / (x) is infinite, the general 
theorem will still hold good, as is seen by examining the proofs of the 
various theorems. It is assumed that, in the definition of a derivative 
Df(x), the incrementary ratio, for a pair of points at which the function 
is infinite, with the same sign, is ignored. At a point where f (x) — + oo, 
we have 

D'f (a) = D+f (x) - - oo, and D~f (x) - XL/ (z) = + oo; 

and at a point where / (x) — — oo, we have 

( x ) — D+f {x) — + oo, and D f (x) =* /)_/ (x) = — oo. 

It thus appears that the set of points at which the upper derivative on 

one side is less than the lower derivative on the other side contains the 
set of points at which the function is infinite. 

An extension has been made by G. 0. Youngf to the case in which 
the function is infinite at points of a set of which the measure is > 0. 

It has been shewn by Denjoy that a continuous function can be con¬ 
structed, in which all three cases of the general theorem present them¬ 
selves for one function, or in which one or more of the cases may be absent. 

An investigation has been givenJ by Burkill of the properties of the 
derivatives of functions of intervals. 

EXAMPLES 

The following examples have been given by Denjoy: 

1. Let 0 be a perfect non-dense set of measure > 0; let f{x) -0, at all points of G; and 
in each complementary interval (a n , b n ) of G, let f (x) = * {(x - a n ) (b n -x)}*. At each 

1,0111 ° ’ V+f(x)=+ oo, D + f(x)= 0, Lr~ f (x) =0, Z>_/ {x) = -oo. 

* Bulletin Calcutta Math. Soc. vol. xvi (1925-26), p. 101. 

X Proc. Lond. Math. Soc. (2), vol. xxn (1923), p. 293. 


t Loc. cit. pp. 378-384. 
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In each complementary interval / (x) has the maximum value and thus the function 
is not continuous at the points of G. 

2. Let G be a perfect non-dense set, of measure zero, in the interval (ct, b). Let u n be 
the length of the contiguous interval («.„,/)„); and suppose that u x , ?/ 2 , ... v n , ... are in 
descending order of magnitude. Let h y , h 2 , ... h nt ... be positive numbers such that 2h n u n 

is convergent, and let y n ( x) — ? h n u n sin -1 (- for a n < x £ b n . The function 

n \ u n / 

/ {x) {a 2x) h n u n + ioij m (x), 

where co is zero if x is a point of G , and is 1, if a* is interior to a contiguous interval; and 
(a2x) h n u n denotes the sum of those parts of 2h n u n that are in the interval (a, x). The 
function / (a;) is continuous; it has a finite differential coefficient at every point that does 
not belong to G , and it has an infinite differential coefficient at all points of G. 

3. With the same notation as in Ex. 2, lot / (x) =0, in all points of G, and let 

in the interval (a n ,b n ); where y{x)~(l ~x) 2 x* sin 2 \ The function f (x) is continuous; 

it has a finite and continuous differential coefficient at all points not belonging to G. In 
all end-points of contiguous intervals liA f (x) -- +-<*>; and the other three derivatives are all 
zero at all points of G. 

4. Let E be a perfect set, metrically dense in itself; and let 

y( a: )=4{x(l-x)}»8i^ 8x(1 ,r _ x) . 

Let / (x) =0, at all points of E; in u n let / (x) =f>,^ y (* ^ °"j, where p„ is chosen as follows: 

From the interval (a, b) remove the intervals u x , u. z , ... u n l ; then u n lies in one of the 
remaining segments of (a, b); let p n be the length of this segment. In all points of E , 

D+f {x)~ + °o ; J)_f(x)--a o; I) + f (x) = D~ / {x) =0. 

5. Defining E as in Ex. 4, let f (x) -0, in all points of E , and let 

Vn (X) -{Pn*( 1 - *)}* sin _ x) . 

In all points of E, 

D*-f(x)=D-f{z)= + *, D + f(x) = D_f(x) = ~co. 

300. The following general theorem is due* to W. H. Young. 

The points at which one at least of the four derivatives of any given 
function is infinite form an ordinary inner limiting set , if it exists. The set 
of such points is accordingly of power c, when it contains a component dense 
in itself; and otherwise it is enumerable , or finite. 

It follows that the set of such points is a set of the second category, 
in case it be everywhere dense. 

The special case of the theorem which arises when the function is con¬ 
tinuous, and the set of points is everywhere dense, was given by Brod6nf. 

* Arkiv for Maiematik , Astronomi och Fyaik, vol. I (1903). 
f Acta Univ. Lund. vol. xxxm (1897), p. 31. 
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Let x 0 be a point at which one of the four derivatives is infinite, it 
being immaterial whether the other derivatives are infinite or finite. 
A sequence x l9 x 2 , ... x nf ... converging to x 0 , and on one side of it, can 
be found, which has the property that, corresponding to an arbitrarily 
large positive number a, an integer can be found, such that 

f ^ > a for n> m 1 ; 
x n — x 0 

further, m can be chosen so great that 

I x n ~~ x o | < - , for n > m! > m 1 . 

Let the intervals ( x 0 , x n ) be prolonged on the side beyond x 0 , each being 
increased by -—— of its length; and the whole set of intervals so con¬ 
structed for every point x of the set at which a derivative is infinite may 
be called I a . 

Let <7 1? cr 2 , cr 3 , ... be a set of values of a which increase without limit; 
then the corresponding sets of intervals I 9x ,7^, ... define an inner limiting 
set of points, to which all the points x of the given set belong; and it will 
be shewn that no other points belong to this inner limiting set. If possible 
let f be a point of the inner limiting set which does not belong to the 
given set of points at which a derivative is infinite. There is at least one 
interval of each of the sets 7^, I , ..., such that £ is an interior point 
of it; let such intervals be 8^ 8 2 , ..., and let ^, £ 2 , f 3 , ... be points of the 
given set interior to these intervals. Let fi, ? 2 > ?s» ••• the end-points 
of the intervals on the sides of those intervals which were not lengthened. 
We have / i \ i 

8 ‘ = 0 + a, - l) ~ I < a ; - 1 ’ 

thus the points •••> and also the points f x f 2 » •••> form a se¬ 

quence of which £ is the limit. 

Since ^- > a t , therefore |/(£) — /(?.) | > (cr t - l)8 t . 

f« ~ f* 

Also a positive number A can be determined, such that for all values of i, 

/(f)-/(f.) 

s-i ’ 

for otherwise f would be a point with an infinite derivative; and from 
thia we see that | (f) | < AS,. 

For a sufficiently great value of i, 


cr t — 1 > A ; 
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hence, for such a value of i, 

|/tf) -/(?.) I > W,-A — 1)8. 

and thus f_(€) > CTi _ A - 1. 

f-f. 

Now a t — A — 1 is arbitrarily large for a sufficiently great i; hence, since 
$ is the limiting point of the sequence {f t }, there is an infinite derivative 
at £. This is contrary to the hypothesis made; therefore the points of the 
given set constitute the inner limiting set which has been defined. 

FUNCTIONS OF TWO VARIABLES 

301. It has already been shewn that many of the fundamental concep¬ 
tions and theorems relating to the continuity, uniform continuity, &c., of 
functions of one variable hold good, without essential change, for functions 
of two or more variables. 

Most of the points in which the theory of functions of a number of 
variables involves considerations which are not an immediate generaliza¬ 
tion of those which occur in the case of functions of a single variable are 
sufficiently illustrated by the case of functions of two variables. Accord¬ 
ingly the properties of functions of two variables will be considered in 
some detail. 

That a function / (x, y) should be continuous at a point (a, b), which 
is a limiting point of its domain, it is necessary, but not sufficient, that 
the function f (x,b), of x , should be continuous at the point x — a, and 
that the function / (a, y), of y, should be continuous at the point y = 6. 
Thus a function may be continuous at a point with respect to x, and also 
with respect to y, whilst it is discontinuous with respect to the two- 
dimensional domain (x, y). 

It is not even sufficient to ensure the continuity of / (x, y) at a point, 
that it be continuous in every direction from the point. Thus 
f (a + r cos 0, b + r sin 0) 

may be a continuous function of r, at r = 0, for each value of 0 in the 
interval (0, 2n), and yet* the function may be discontinuous at (a, b). 

The necessary and sufficient condition that / (x, y) may be continuous 
at (a, b) may be expressed in the form that / (x, y) must be continuous in 
every direction at the point, and uniformly so for all directions. 

Thus, if / (a + r cos 6, b + r sin $) be continuous at r = 0, for each 
value of 6, and uniformly so for all values of 6, then, if e be a prescribed 
positive number, a number p can be determined, independent of d, such that 
| / (a -f r cos 6, b *f r sin 6) — f (a, b) | < e, 


See Thom&e, Abriss einer Theorte der komplexen Functionen, 2nd ed. p. 15. 
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provided r < p. From this condition it follows that | / (rr, y) —f (a, b) | < e, 
provided | x — a |, | y — b | are each < p/\/2, and thus the condition of 
continuity of the function is satisfied. 

The remarks which have been made as regards the continuity of a 
function at a point are applicable without essential change if those func¬ 
tional values in the neighbourhood of the point are alone taken into 
account, which are in one of the four quadrants, the values at points on 
the axes bounding the quadrant being either included or excluded from 
consideration, as may be agreed upon. Thus the condition of continuity 
at a point may be satisfied for one such quadrant, and not for another one. 

As regards the points of discontinuity of a function of two or more 
variables reference may be made to a memoir* by W. H. Young. 

EXAMPLES 

L Let f(x y y)~ and / (0, 0)-~0. This function is discontinuous at the point 

x + y 

(0, 0), although it is continuous at that point with respect to x y and also with respect to y t 
since / (x, 0) =0, / ( y , 0) =0. In all other directions the function is discontinuous; for writing 
x — r cos 0, y—r sin 6, the function is £ sin 20 and therefore has a constant value different 
from zero on a straight lino for which 0 is constant, unless 0 has one of the values 0, \ir y n , 
or $n. 

xxi^ 

2. Letf f (x, y) = - - ■ y ~. y / (0, 0)=0. This function is discontinuous at the point 

X 4 If 

(0, 0), although it is continuous in each particular direction, at that point. We find that 

t sin 2 0 1 /— ... ... 

— -z—r—r?; < e, if r < — cosec 2 0 (I - v 1 - 4e 2 cos 2 0}; and in order that this condition may 
cos 2 0 + r l sin 4 0 2e 1 } J 

be satisfied, the greatest value of r diminishes indefinitely as 0 approaches the value Jtt; 

whereas when 0 — J7r, the function is, for every value of r, equal to /(0, 0). It is thus seen 

that the convergence in different directions is non-uniform. 

DOUBLE AND REPEATED LIMITS 

302. Let (a, b) be a limiting point of the domain for which a function 
/ ( x , y) is defined, and let a neighbourhood, of which the comers are the 
four points (a ± c, b ± e'), be taken. Let U , L be the upper and lower 
boundaries of the function for all points of the domain in this neighbour¬ 
hood, the functional value at (a, b) being however disregarded, in case 
(a, b) belongs to the domain. If e, e' be diminished, the number U cannot 
increase; and when values of e, e belonging to sequences € x> e 2 , ... € n , ..., 
and e/, e 2 ', ... c n ', ..., each of which converges to the limit zero, are taken, 
and U n be the value of U corresponding to the values e n , e n ', of e, e 7 , the 
numbers U lf U 2 , ... U ni ... form a sequence of numbers which do not 
increase. This sequence has then a limit U } which may however have the 

* W. H. Young, Proc. Lond. Math . Soc . (2), vol. vm (1909), p. 117. 
f Genocchi-Peano, Calc. Diff. § 123. 
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improper value oo, in case all the numbers U n have this improper value. 
It is easily seen that U is independent of the particular sequences chosen 
for c, e'. This number U is said to be the upper limit of the function at 
(a, b), and may be denoted by 

fim f(x,y)- 

x~a, y~~b 

The lower limit lim / ( x , y) may be defined in a similar manner, as 

y~~b 

the limit of a sequence of values of L ; and it may have the improper 
value — oo. 

At a point of continuity of the function, the condition 
lim / (*, y)= lim / (x, y) 

x~a, y~-b x~~u t y~-b 

is satisfied; and further, each of these limits is equal to / (a, 6), in case 
(a, b ) belongs to the domain of the function. 

Corresponding pairs of limits may also be defined for the case in which 
the functional values in one quadrant only are taken into account, the 
functional values on the axes being either included or excluded, in case 
they exist, as may be agreed upon. 

The saltus, or measure of discontinuity , at the point (a, b) is measured 
by the excess of the greatest over the least of the three numbers 

/(a, 6), lim f(x,y), lim f(x,y). 

x~a t y~b x~~a,y~~b 

The saltus at a point of discontinuity may have a finite value, or it may 
be indefinitely great. 

In case lim / ( x , y) = lim / (x, y), their common value may be de¬ 
ader, y~~b x-^a, y~b 

noted by lim / ( x , y), and the function is then said to have a definite 

x~a, i i~b 

double limit at the point (a, 6); this double limit lim f(x,y) may be 

x~a, y~~b 

finite, or infinite with a definite sign. 

When the upper and lower limits have different values, lim f (x, y) is 
frequently regarded as existent but indeterminate, the upper and lower 
limits being regarded as its limits of indeterminancy. 

303. In considering the functional values in the neighbourhood of a 
point, and the functional limits at the point, it is frequently convenient 
to consider one quadrant only; this we may take, without loss of generality, 
to be the quadrant in which x — a>0, y — b > 0. The results which will 
be established are essentially applicable to any one of the four quadrants, 
and can be immediately extended to the case in which account is taken 
of the whole neighbourhood of (a, 6), by taking the totality of the results 
for the four separate quadrants, and for the lines x = a, y = b. 



407 


302 , 303 ] Double and Repeated Limits 

Assuming that x — a>0 y y — b>0 y the function / ( x , y) considered as 
a function of y only, with x constant, has two functional limits/( x , b -f 0), 
f (x, b f 0), at the point (x, b)\ these may be denoted by lim/(a;, y), 

lim / ( x , y) respectively. In case these two limits are identical, their 

yZb 

common value may be denoted by lim / ( x , y) y the functional limit 

V~b 

/(*>*> H~ 0) having in that case a definite value. 

If either of the limits lim / (x, y ), lim / (x, y) is to be taken indifferently, 

y~-b y~b 

we may denote them by lim / (. x , y). This may be regarded as a function 

y~~b 

of x, such that its value at the point (x, b) is multiple-valued, and has 
lim / (x y y) y lim / (x, y) for its limits of indeterminancy. 

y~b y~b 

It may happen that lim / (x y y) y considered as a function of x , has a 

y~b 

definite functional limit at the point x — a ; this limit may be either finite, 
or infinite with fixed sign. In case such a limit exists, it is denoted by 
lim lim / ( x , y), and is said to be the repeated limit of / (x y y) at the point 

x~a y~b 

(a, b), the order of the limits being, that the limit for y ~ b is taken first, 
and then afterwards the limit for x ~ a. 

In case this repeated limit does not exist, either as a definite number, 
or as infinite with fixed sign, we may regard lim lim / (x, y) as indeter- 

x^a y~b 

ruinate, its limits of indeterminancy being 

lim lim / ( x , y) y and lim lim / ( x , y). 

x~a y~b x ~o y-b 

The repeated limit lim lim / (x y y) y in which the limit with respect 

x~a 

to # is first taken, and afterwards that with respect to y y may be defined 
in a precisely similar manner. 

It is clear that the functional values on the straight lines x — a, y ~ b 
are irrelevant as regards the existence, or the values, of the repeated limits. 

In case the double limit lim / (x y y) y for x > a, y > b y exists at the 

point (a, 6), having either a finite value, or being infinite with fixed sign, 
the existence of the two repeated limits 

lim lim / (x y y) y lim lim / (x y y) 

x~« y~b y~b x~a 

follows as a consequence, their common value being lim / (x y y). In 

x~a , y~~b 

this case lim f {x y </> (a;)} also exists, and is equal to the double limit, where 
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</> (x) is any function of x, which is > b, and is such that lim <f> ( x) = b. 

x~a 

Also lim f {<f> (t), (t)} exists, and is equal to the double limit; where 

t~T 

<f> ( t ), ifj ( t ) are functions of a variable t , such that <f> (£) > a, ip (t) > b, and 
that lim <f> (t) = a , lim i p (t) b. 

The converse of these statements does not hold good. In particular, 
the existence of lim / ( x , y) is not necessary either for the existence, 

with definite values, or for the equality, of the two repeated limits 
lira lim / (x, y) ; lim lim / (*, y). 

V~b x~n 


EXAMPLES 

1. Let / (x, y) be defined for the positive quadrant by / ( x , y) = X — -. We find 

x + y 

lim lim / (x, y) —1, lim lim / (x, y) — - 1; 
x~- 0 y ~0 0 

thus lim / (x, y) cannot exist. 

2. Let f(x,y)-- 9 * ; In this case lim lim /(x, y) and lim lim /(x, y) are 

x 2 y-+(x~y)- j-vO y-o ‘ rO j-o 

both zero, and yet lim / (x, y) does not exist; for if y =x, f (x, y) -1; and therefore 
x~0, 2 /—0 

lim /(x, x) =1. 

3. Let* / (x, y) be defined for x > 0, y > 0, by the expression (x + y) sin sin . In this 

x y 

case lim / (x, y) —x sin -, lim / (x, y) - -x sin - , and lim / (x, y) - lim / (x, y) has for x =0 
v ~0 x x 1^0 

the limit zero. We have then lim lim / (x, y) =0, since x sin - , - x sin - have each the limit 
x~o 2 /—0 x x 

zero for x -0. It is clear that lim lim / (x, y) is also zero. If 0 <x < £e, and 0 <y < £c, we 
2/—0 x~0 

see that | / (x, y) | < e, and therefore lim / (x, y) exists, and is equal to zero. 

x~0, y~-0 


304 . An important matter for investigation is the determination of 
the necessary and sufficient conditions for the existence and equality of 
the two repeated limits at a point. A knowledge of such conditions, as 
also of sufficient conditions, is required in various fundamental theorems 
of analysis which turn upon the legitimacy of inverting the order of a 
repeated limiting process. 

It will be observed that the existence of lim lim / ( x , y) does not 

x~a y~b 

necessarily involve the existence of lim / ( x , y) as a definite number, since 

_____ * V~b 

lim lim/(a;, y), lim lim f (x, y) may both exist and have the same value, 

x~a V~b x~-a y~b , 

without it being necessarily the case that lim/(x, y), lim f(x,y) are 

y~b y^b 

* Pringsheim, Encyclopddie der math. Wissenach. ha. 1, p. 51. 
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identical. It is however necessary that lim f(x,y) — lim / (z, y) should 

y~b y~b 

converge to the limit zero, as x converges to the value a. 

The necessary and sufficient conditions required are contained in the 
following general theorem*: 

In order that the repeated limits lim lim f (x, y), limlimf(x,y) may 

x~-a y~b V~~b x~~a 

both exist, and have the same finite value, it is necessary and sufficient , 

(1) , that limf(x, y) — lim f ( x , y) should have the limit zero, for x ~ a, and 

V~b y~b 

that lim f (x,y) — lim f (x, y) shauld have the limit zero, for y~b; and 

x~a, x~a 

(2) , that, corresponding to a,ny fixed positive number e arbitrarily chosen, a 
positive number ft can be determined, satisfying the condition that, for each 
value of y interior to the interval (b, b 4 fi), a positive number a v , in general 
dependent on y, exists, such that, for this value of y, f (x, y) lies between 

limf(x, y) 4 e and lim f (x , y) — e, for all values of x interior to the interval 
V~~b y~b 

(a, a 4 a y ). 

Let us first assume that the conditions stated in the theorem are 
satisfied. A value of y may, in virtue of (1), be so chosen that the difference 
of the two limits lim / (x, y), lim f (x, y) is less than an arbitrarily chosen 

x~~a x~a 

number rj ; and this value of y may also be so chosen that it is interior to 
(b, b + p). For this fixed value of y, an interval (a, a 4- a v '), for x, may be 
so chosen that / (x, y) lies between lim / (x, y) 4- e and lim / (x, y) — e, 

x~a x~a 

provided y has the fixed value, and a < x < a 4 a y ': this follows from the 
definition of the upper and lower limits. Again, from the condition (1), 
a number a " can be determined, such that, if a; be interior to the interval 
(a, a 4 a"), the difference between the two limits lim / (x, y), lim / (x, y) 

y~h y~b 

is less than rj. Now let be the smallest of the three numbers a y , af, a 
then, if x x , x 2 be any two values of x within the interval (a, a 4 a y ), and 
y have the fixed value; by applying the conditions of the theorem, we see 
that the conditions 

l/(*i., y) -/(*!. y) I < v + 2e, 

|/(* 1 > y) - lim/to, y) | < v + €, 

I v~~b I 

/(*s. y) - y) < ij + e, 

y~~b 

* This theorem was given by Hobson, Proc. Lond. Math. Soc. (2), vol. v (1907), p. 226. It 
was transformed into an equivalent statement by Martinotti, Rendiconti del circ. math, di Palermo t 
vol. xxxvil (1914), p. 19. On the whole subject see also the Dissertation by J. Ridder, Enkele 
algemeene limietsteUingen met toepassingen op reele functiea , Utrecht (1921). 
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are all satisfied. It follows that 

lim/ (x t , y) - lim f(x t , y) | < + 4e 

V^b y-b I 

for every pair of points x l9 rr 2 within the interval (a, a -f a y ). Hence, since 
e, t) are both arbitrarily small, lim f (x, y) converges, for a; ^ a, to a definite 

v~b 

value which is the limit both of lim/ ( x , y) and of lim/ ( x , y ), when x ~ a; 

V~b y~b 

and thus lim lim / (x, y) exists. 

x-a y~-b 

Again, since lim lim / ( x , y) has a definite value, an interval (a, a 4 - 8) 

x~a y~b 

can be determined, such that, for any point x interior to it, 
lim lim/ (x, y ) - Hm/ (x, y) I < e. 

x~a y~b V~~b I 

Now lim lim / (x, y) — lim / ( x , y) is the sum of the three differences 

x~a y^b r~a 

lim lim / (r, y) - lim / (x, y ), 

x —a y~~b y~b 

Em/(*, ?/) -/ (•*> y)> /(*, y) - Em/(a:, y), 

and for a fixed ?/, chosen as before, x may be chosen so that it not only 
lies within the interval (a, a + 8), but is also such that 

\f (x,y) - lim f(x,y) I, I f{x,y) - lim f (x, y) I 

I y~-b I I x~a I 

are each less than rj + 2c. It follows that 

I lim lim / (x, y) — lim f (x, y) I < 5c -f 
I r~a y^b I 

and thus that lim / ( x , y) converges, as y converges to 6, to the limit 

x^a 

lim lim / (x, y). It has thus been shewn that the two repeated limits both 

x~-a y~b 

exist, and have the same value. 

Conversely, let us assume that the repeated limits both exist, and are 
finite and equal. We have then lim lim / (x, y) — lim / (x, y) I < £, pro- 

x~a y~b | 

vided y lies between b and b -f ft, where ft is some fixed number, £ being 
an arbitrarily chosen positive number; from this it follows that 

lim/ (x, y) - lim f (x, y) 

x~a o-~a 

is < 2£, for b < y < b + ft. Also 

I Urn / (x, y) - lim lim / ( x, y) < £, 

I y~~b x ~ a 

provided x lies within some fixed interval (&, a -f $'); and from this it 
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follows that lim/(#, y) — lim/(#, y) is < 2£, for a < x < a + h'. Since 

V-b ^Zb 

£ is arbitrarily small, we now see that the condition (1) of the theorem is 
satisfied. Further we see that 

f (x, y) - lim /(x, y) I < 2£ + £', 

x~~a I 

where £' is any arbitrarily chosen positive number, provided x lies within 
some interval (a, a -f a/), where a v ' depends upon y , and may diminish 
indefinitely as y approaches the value b. It follows from the three 
inequalities, that 

\f(x,y)-Umf(x,y) < 4£ + £', 

1 y-^b 

provided b < y < b -f and provided also x lies within some interval 
(a, a 4- a v ), where a y depends in general upon y. Since £ and £' are both 
arbitrarily small, it follows that the condition (2) of the theorem is satisfied. 

If the condition (2), in the above general theorem, be replaced by the 
more stringent condition that, corresponding to any fixed positive number 
e, arbitrarily chosen, a positive number ft can be determined, which is 
such that, for each value of y interior to the interval (b, b + /?), a positive 
number a v , dependent on y, exists, such that, for this value of y, and for 
all smaller values , / ( x , y) lies between lim / ( x , y) -j- e and lim / ( x , y) — e, 

v-b ' -yZj, 

then this condition and the condition (1) are the necessary and sufficient 
conditions that not only lim lim / ( x , y), lim lim / ( x , y) exist and are 

x^a y~b v~-b r~a 

equal, but also that the double limit lim / (x, y) exists, having a de- 

x~~a, y~b 

finite value, the same as the repeated limits. In case the function be 
defined for values of x , y on the lines x — a, y = b, the additional condi¬ 
tions must be added that the functional values on these lines also converge 
to the same limit lim / (x, y). 

x~~a, y~-b 

For, under the conditions stated, we have, provided y lies within the 
interval (6, b -f fi-f), where < ft, 

- /(a, y) - lim f(x, y) < e + ij, 

1 / — & 

where x has any value in the interval (a, a 4- £), £ being the lesser of the 
two numbers and 8'; the number 8' being so chosen that 

I lim / (x, y) — lim / ( x } y) < rj, for a < x < a + 8'. 

Also I lim / (x, y) - lim lim f(x,y) < e 3 provided x lies within an 

I x ~ a 

interval chosen sufficiently small. Hence the condition 
I fix, y) - lim lim / (x, y) I < 2e + ij 

I x~-a j/—6 I 
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is satisfied, provided 6 < y < b -f &, and provided x lies within an interval 
of which the length may depend upon e and rj. It follows, since c, rj are 
arbitrarily small, that/ ( x , y) has a definite double limit at the point (a, 6). 
That the conditions stated are necessary, follows at once from the defini¬ 
tion of lim / (x, y). 

x~a, v~b 

305. The theorem obtained in § 304 may be simplified in the case in 
which lim f(x y y), lim f(x,y) both have definite values at all points on 

y~b x ~o 

the straight lines x ■■= a, y = b which are in sufficiently small neighbour¬ 
hoods of the point (a, b). We may then state the theorem as follows: 

If limf (. x , y), limf(x, y) have definite finite values in the neighbourhood 

V~-b x~-a 

of the point (a, b), then the necessary and sufficient condition that the two 
repeated limits lim limf (x, y), lim lim f ( x , y) may both exist , and have the 

x~-a y~b y*~b x~a 

same finite value , is that , corresponding to any fixed positive number e, 
arbitrarily chosen , a positive number ft can be determined , which is such that , 
for each value, of y interior to the interval (b,b -f £), a positive number a Vf 
in general dependent on y, exists , such that , for this value of y, 

\f(x, y) - lim f (x, y) I < e, 

I V-b I 

for all values of x within the interval (a, a -f a y ). 

In case the condition | / ( x , y) — lim / (x, y) | < e, for all values of x 

within (a, a + a„), be satisfied, not only for the particular value of y , but 
for all smaller values, and this hold for every e, then the double limit 
lim / ( x , y) exists, and is equal to each of the repeated limits. In this 

x~a t y~b 

case the point (a, 6) is said to be a point of uniform convergence of the 
function / (x, y) to the limit limf (x, y), with respect to the parameter x\ 

V-b 

and thus, for such a point, there exists, for each value of e, an interval 
(a, a + a), where a depends in general upon e, such that, for each value 
of x within this interval, the condition | / (x, y) — lim / (x, y) | < c is 

satisfied, provided y be less than some fixed value which is the same for 
the whole ar-interval (a, a + a). 

It may happen that, as c is indefinitely diminished, a has a positive 
minimum a. In that case the fixed interval (a, a 4- a) is such that, for 

each €, the condition 1/ (x, y) — lim / (x, y) I < e is satisfied for all values 
I 1/—6 I 

of x within the fixed interval (a, a -f a), provided y is less than some fixed 
value dependent on e, the same for the whole ^-interval. In this case 
/ (x, y) is said to converge to limf (x } y) uniformly within the interval 

V~b 

(a, a -h a), with respect to the parameter x. Not only the point (a, b), 
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but also each interior point of the interval (a, a + d), is then a point of 
uniform convergence of / (x, y) to lim/ (x, y), with respect to the para¬ 
meter x. v ~ b 


306 . The necessary and sufficient conditions for the existence and 
equality of the two repeated limits of / (x, y ), at (a, 6), may be put into 
the following form, different from that of the theorem of § 304: 


The necessary and sufficient conditions that 

lim lim f ( x, y) — lim lim f ( x, y), 

x~a y~b y~b x~a 

their value being finite, are (1), that lim f (x, y) converge to a definite value 

a 

lim lim f ( x, y), when y converges to b, and that lim f ( x, y) — lim f (x, y) 

V~b J-a y~b —1 1 

converge to zero, far x~a; and (2), that, corresponding to any arbitrarily 
chosen positive number e, and to an arbitrarily chosen value 6 + ft, of y, 
a value ^ < 6 -f- ft, of y, can be found, and also a positive number a, such 
that the condition that f (x, yf) lies between 

limf (x, y) + c, and lim f (x, y) — e 

V~b 

is satisfied for every value of x within the interval (a, a -f a). 


In case lim f (x, y) everywhere exists in the neighbourhood of x = a, the 

V~b * 

condition (2) is that J / (x, y x ) — lim f (x, y) | < €, for every value of x within 


the interval (a, a -f a). 


That the conditions contained in the theorem are necessary, is seen 
from the theorem of § 304; it will be shewn that they are sufficient. Let 
us assume that the conditions are satisfied. We have 


lim/ (x, y) - lim lim/ (*, y) = ("lim/ (x, y) - f (x, y)“] 
y~b V~>> x ~ a U~i> J 

+ [■/(*, y ) - Em/ (x, y) 1 + flim/ (x, y) - lim lim/ (x, j/)l. 

L J v ~ b r ~ a J 

A positive number fa can now be chosen, such that, iib<y<b + fa, 
the condition I lim / (x, y) — lim lim f (x,y) I < c is satisfied; moreover we 

I ~~ y~b x~a | 

may choose so that it is < /? 0 . 


Next, a value y x , of y, exists, such that / (x , y x ) lies between 
lim/ (a?, y) + c, and Hm f (x, y) - e, 

V~b y~-b 

provided x be within the inteirval (a, a + a): the value of fa may be chosen 
so small that Em/(z, y) — lim/(x, y) < (, for every value of y which is 

x ~ a x~a 
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< b -f /? 0 , and therefore for the value y l9 of y. Again, an interval for x , 
possibly less than (a, a 4- a), can be so chosen that 

y) - lim/ (x, y) < e, 

V~l> 

provided x lies within the interval. It follows that an interval (a, a 4- a), 

for x, can be found, such that lim / (x, y) — f (x, y x ) < 3e. Further, the 

v~b 

interval within which x lies may, if necessary, be so restricted that 
\f{x, Vi) - iini/ (x, «/,)!< 2f. 

I x~a ! 

Hence, provided x lies within a definite interval, we see that 

I lim f (x, y) - lim lim f (a:, y) < 6e; 

I ]FT>‘ v~br~a' 

and since this condition holds for an arbitrary e, it follows that lim / (x , y) 

V~b 

converges, for x~a, to lim Urn f (x, y); and thus the sufficiency of the 

V~-b x~a 

conditions is established. 

THE LIMITS OF MONOTONE AND QUASI-MONOTONE FUNCTIONS 
OF TWO VARIABLES 

307. A monotone function / (x, y) y of two variables x and y, was 
defined in § 253 as a function such that / (pc', y') j f (x, y), or such that 
f (x', y') &f(x, y), for every pair of points (x, y), (x\ ?/), such that x' ^ x y 
y' ^ y. The following theorem, as regards the limits, at a point, of such a 
function, will be established. 

If a function f ( x , y) is bounded and monotone in some neighbourhood of 
the point (a, 6), then, in each of the two open quadrants x > a, y > b; x < a, 
y < b, the double limit of f (x, y), at the point (a, b), has a definite value. 

It will be observed that these limits are independent of the values of 
/ (x, y) on the straight lines through (a, b) parallel to the axes. 

It follows from the theorem that each of the repeated limits 
lim lim / (x, y), lim lim / (x, y) 

x-a y~~b y~b x~a 

in one of the two open quadrants exists, and that they both have the 
value of the corresponding double limit. 

We consider the values of the function in a cell A, of which (a, b) is 
an interior point, and such that / ( x , y) is monotone in the cell, and we 
take that part of the cell which is in the first quadrant x > a, y > b. Let 
<£i (x, y) denote lim/ (x, y + k), k> 0, and let <j> 2 (x , y) denote lim/ (x, y — k); 

k ~0 

the functions <f> t (x , 6), <f> 2 (x, b) are monotone functions of x. 
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We may suppose that / (x' } y') > f (x, y), when x ' 5 x, y f ^ y \ the other 
case may be treated in the same manner. We have then 

lim lim / (a -|- h , b 4 &) = lim </>! (a 4- b) — <f> x (a 4 0, 6). 

h ~~0 A;~0 

A value /q, of can be so determined that f> x (a 4 h r , b) -- (a 4 0 , b) 
is < \e, where e is an arbitrarily chosen positive number; a value k x , of k, 
can then be so determined that / (a 4 h x , b 4 k x ) — <f> 1 (fl 4 h \, b) < 

We have then 0 £ f (a 1- h , 6 4 k) — </> x (a 4 0, b) < e, for all values of h 
and k such that 0 < h < h ly 0 < k < k x . It follows, since e is arbitrarily 
small, that the double limit of / (a 4 h, b 4 k), as h ~ 0, k ~ 0 exists, and 
has the value (a 4 0 , b). A similar proof can be given for the case of 
the other qudrant x < a, y < b. 

The double limits in the other two quadrants x > a, y <b \x < a, y > b 
do not necessarily exist. A theorem has however been given* by R. C. 
Young, from which, in the case of quasi-monotone functions, it follows 
that 1 

If a function f (x, y) is bounded and quasi-monotone in some neighbour¬ 
hood of the point (a, b), then, in each of the four open quadrants , x > a, 
y > b; x < a, y < b; x > a, y < b ; x < a , y > b, the double limit of f (x, y) 
at the point (a, b) has a definite value. 

It is sufficient to consider the case of the quadrant x > a, y > b, when 
A (r.’vV/ ( x > y) d, for any two points (x, y), (x', y) in the cell 

(a, b;a 4 h, b f k), 

such that x' > x, y' > y, and the function / (x, y) is of any of the four 
types specified in § 255. The other cases given there can all be treated 
in a precisely similar manner. 

If P, Q denote the points (x, y), ( x ', y) respectively, we can denote 
A(r,’J ) ) f (#, y) by Ap/; the points (a, b), (a 4 h, b 4 k) may be denoted 
by A, B. 

For any cell (P, Q) contained in (A, B), we have A %f ^ A^/; for the 
cell (P, Q) may be one of a number of cells into which (A, B) is divided, 

and A [if is equal to the sum, for all these cells (a, /3), of A f/; and A f / is 
by hypothesis s 0. 

If (A, Q x ), (A, Q 2 ), ... (A, Q n ), ... be a sequence of cells, such that Q n 

is in the cell (A, Q n _ x ), for every value of n, the sequence {A^ n /} is mono¬ 
tone non-increasing, and therefore converges to a definite limit, as n ~ 00 . 
Let {P n }, {Q n } be any two sequences of points, each converging to A ; a 
sequence P Wl , Q mi , P„ 2 , Q m2> ••• T ni , Q mi , ... can be so determined that 
Q Wi is in the cell (A, P n ), and P n< is in the cell (A, Q nlt _ i ), for all values 


Venaeignement math. 1924-5, p. 79. 
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of i; then the sequence A*/ 1 /, A^"* 1 /, A^" 8 /, A... has a unique limit. 
II {A^"/}, {A^ m /} both have definite limits, it follows that these limits 
have the same value. Hence it is seen that A * A f has a definite limit as P 
converges to A through the points of any sequence of which A is the 
unique limiting point, and that the value of the limit is the same for all 
such sequences. That A A f has a definite limit, in the sense that there 
exists a neighbourhood of A such that in that neighbourhood A^ / differs 
from its limit by less than an arbitrarily prescribed positive number e, 
is proved as in § 211, in the discussion of the equivalence of the definitions 
of continuity given by Heine and bv Cauchy, by a method which requires 
the use of the multiplicative axiom. Since 

/ (x, y)+f (a, b) - f ( X, b) - / (a, y) 

converges to a definite limit as x ~ a, y ~ b\ where (.r, y) is in the open 
positive quadrant, and since f (x, 6), f (a, y) are both monotone functions, 
either of which may be non-increasing, or non-diminishing, according to 
the particular type of the quasi-monotone function, it follows, since 
/ ( x , b) f f ( a , y), have in any case, definite limits as x ~ a, y ~ 5, that 
/ ( x , y) must have a definite double limit. Exactly similar proofs apply 
to each of the four quadrants, whatever be the type of the quasi-monotone 
function. 

308. We now consider further the functions which have been defined 
in § 255, and named quasi-monotone functions. If / (x, y) be such a 
function, the function F ( x , y) defined by 

F (x, y) =J (x, y) - / (x,b) - f (a, y) + f (a, b) 
is a monotone function. 

For a quasi-monotone function, the following theorem* relating to the 
points of discontinuity of the function, will be established: 

If f ( x , y) be a quasi-monotone function , there exist two enumerable sets 
of straight lines parallel to the x-axis and the y-axis , respectively , such that 
every point of discontinuity of the function , with respect to (x, y), lies on a 
straight line belonging to one or other of the enumerable sets. 

Let (x , y), <f> 2 (x, y) be defined as in § 307; then, if / (x, y) be monotone 
non-diminishing with respect to x, for each constant value of y, and also 
monotone non-diminishing with respect to y, for each constant value of x , 
we have 

<t>i ( x + 0, y) zf(x + 0, y) s <j> 2 (x + 0, y) zf(x, y - 0) £ <f> 2 (x - 0, y), 

<t>i (x + 0, y) zf(z, y + 0)z<f> 1 (x-0, y) */(«? - 0, y) £ <f> 2 (x - 0, y). 

For a constant value a, of x , the discontinuities of / (x, y) with respect 

* See W. H. Young, Proc. Royal Soc. vol. xcm (1917), p. 31, where these functions are 
termed **monotone functions.” 
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to y are at points of an enumerable set on the straight line x — a. Let a 
be the value of x on the right-hand boundary of the cell in which the 
function is defined. If x — a, y = p be a point at which / (a, y) is con¬ 
tinuous with respect to y , then / (x, y) is continuous with respect to y , 
at y ---- f$, for each value of a; in the cell. For, let (/3 — Jc, ft -f A;) be a neigh¬ 
bourhood of ft, such that 0 / (a, p + k) — / (a, p — k) < e, then, from the 

definition of / ( x , y), we see that 0 -- / (x t p -f h) — f (x, p — k) < e. It then 
easily follows that / ( x , y) is continuous with respect to y , for y — j8, for 
any fixed value of x. For such a value of y, as P, we have 

<t >i (*> y) = & (*, */). 

for every value of a:. This equality therefore holds for all values of y which 
do not belong to a certain enumerable set. Again / (x, y) is a monotone 
non-diminishing function of x, for each value of y; and, as before, we see 
that it is a continuous function of x 9 for all values of x not belonging to a 
certain enumerable set, for each value of y. 

If ( x , y) is any point not on either set of straight lines parallel to the 
axis, defined by the exceptional values of x and of y, indicated above, we 
have </>! (x + 0, y) — <f> 2 (x — 0, y); and then, employing the inequalities 

given above, we see that the four limits, at ( x , y), of the function, in the 

open quadrants, and the four limits at ( x , y) dependent on the values of 
the function on the parallels to the axes, through the point, are all equal; 
and therefore ( x , y) is a point of continuity of the function. 

In the case in which / ( x , y) is a non-diminishing function of y, for each 
constant value of x, and in which / (x, y) is a non-increasing function of x, 
for each value of y; the inequalities employed above can be replaced by 

<f>t ( x + o. y )(*> y - °) - <l>i ( x - y) » /(* - o, y) s, ^ (x - o, y), 

<t>i ( x + y) ^f( x + 0, y) ^ <f> 1 (* + 0, y) nf(x, y + 0) a ( x - 0, y), 

and a similar argument to that above will shew that 

<f >2 ( x + o> y) = <f >i {x - o, y) 

at every point (x, y) not on either of two enumerable sets of lines parallel 
to the axes. The cases of functions of the other types can be treated in a 
similar manner. 


PARTIAL DIFFERENTIAL COEFFICIENTS 

309 . If, at a point (x 0 , y 0 ), in the domain for which the function / (x, y) 

is defined, the limit ^ ^ exists, having either a 

definite finite value, or being indefinitely great, but of fixed sign, this 
limit is said to be the partial differential coefficient of f (x y y), at ( x 0 , y 0 ), 

with respect to a;; it is usually denoted by J 
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When the limit \inJ {x °’ y " + k ) ^ X °' !h) exists, it is said to be the 
partial differential coefficient of / ( x , y), at (x 0 , y 0 ), with respect to y\ it is 
denoted by 

' Sy u 

In general, h, k , in these definitions, are regarded as having either sign. 
It is possible that either of the above limits may not exist, but that there 
may be two definite limits, one for positive values of the increment h or k , 
and the other for negative values. In that case the two limits are said to 
be the progressive and regressive partial differential coefficients with 
respect to the particular variable, ft is of course possible that, at a 
particular point, one of these may exist, and not the other. 

()f c)f 

That the two partial differential coefficients ^ , ' may exist, it is 

dx () dy 0 

necessary, but not sufficient, that f (x, y) should, at the point (x 0 , y 0 ), be 
continuous with respect to x, and also with respect to y. 

To express the increment / ( x 0 -J- h, y 0 -f k) — f (x () , y 0 ) s of the function 
/ ( x , y), when the two numbers x 0 , y Q receive increments h , k respectively, 
we have 


f(x o -I- h, y„ f k) -f(x 0 ,y 0 ) [f(x„ + h, y 0 f k) -f(x 0 ,y 0 + /•)] 

+ I f(x 0 , y n + k) -f(x„, y 0 )\. 

If we now assume that ^ exists at the point (x 0 , y 0 ), and has a finite 
value, we have 

f{x 0 ,y u i k)-f (*o. y<> ) d f ( x « . y«) ,„ (1A 

k dy 0 

where a (k) converges to the limit zero, when k is indefinitely diminished. 

3 / 

dx 0 ’ 


Again, 


/ (x 0 4- h, y 0 + k) -f (x«, y 0 + k) 
h 


converges to the limit 


when k is first diminished to the limit zero, and afterwards h converges to 

zero, it being assumed that ^ has a definite value, and also that/ (x> y) is 

dx 0 

continuous with respect to y, for the value y ~ ?/ 0 , where x has any value 
in a neighbourhood of x 0 . In order, however, that the double limit 
f(x Q -f h, y 0 + k) - f(x 0> y 0 4 k) 


lim 

*~o, fc~0 


h 

y 


may exist, in which case its value is , being independent of the mode 

ox 0 

in which h, k approach their limits, it is necessary and sufficient that 

/(*b + K yo + k) -/(So>3fo + *) 
h 


should be a continuous function of ( h , k) at the point h = 0, k = 0. 
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If this condition be satisfied, positive numbers h lf k x can be determined, 
such that 

f(x 0 + h , y 0 4 k) - f(x 0 ,y 0 -f k) _ df 
h dx 0 

where 77 is a prescribed positive number, and 0 < | h | ^ h x , 0 ^ | k | £ . 

We have now 


< 7 ], 


f(x 0 + h, y 0 4- k) — f (x 0 , y 0 4 k) df , , 
h dx 0 + p(fl ’ fC) ’ 

where p ( h, k) converges to zero, independently of the mode in which h, k 
converge to zero. 


Under the conditions stated, we have 

f(x 0 + h, y a + k) —f(x„, y 0 ) ^hfi^+k & + hp + ka, 

where p, a converge to zero, when h and k are indefinitely diminished, 
independently of the mode in which they approach their limits. This is 
equivalent to the statement that, corresponding to an arbitrarily assigned 
positive number 77, positive numbers h l9 k x can be determined so that | p | 
and | cr | are each < 77, for all values of h and k such that | h \ < h x , | k | < k x . 


In the notation of differentials, denoting / ( x 0 , y 0 ) by z 0 , we have 

dz 


dzu = ^ Z dx 
ox n 




dy; 


the expression on the right-hand side being termed the total differential 

dz dz 

of z at the point ( x, y), and ~ • dx, ^ dy the partial differentials. In 

ox Q oy 0 

accordance with the arithmetical theory, this equation can only be re¬ 
garded as a conveniently abridged form of the result obtained in the 
present discussion. 

r\f 

The expression h ^ 4 - k ^ may be spoken of as the first differential 
of / (x, y) at ( x Q , y Q ), and may be denoted by 8 (1 > f. 


The theorem obtained may be stated as follows: 


That the increment of a function f (x 0 , y 0 ) when x 0 , y 0 are changed into 

x 0 4- h, y 0 -f k may be h ^ ^ 4- k 4- hp 4- ka, where p, a con- 

dx Q oy 0 

verge to zero, when h, k are indefinitely diminished , independently of the mode 

in which they are diminished, it is sufficient (1), that ^ ^fooj J/ o) 

ox 0 oy Q 

have definite finite values, and (2), that 
a continuous function of (h, k) at the point h = 0, k = 0. 
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3 / 9 / 


It will be observed that no assumption has been made that ^^ have 
definite values except at the point (x 0 , y 0 ) itself. 

It follows from the definition that it is necessary for the existence of 
a total differential at (x 0 , y 0 ) that the function be continuous with respect 

to (*, V ), »t (z 0 , y B ). 

If it be assumed that has a definite value at <*„, y), for all values 

of y in some neighbourhood of y 0 , the condition (2) may be expressed in 

the form that (a), for x — x 0 , must be a continuous function of y , at 

y = y oi and that (6), the point h = 0, k = 0 must be a point of uniform 

convergence of the function - ±—! y °--±M considered 

as a function of h, to its limit for h ~ 0, with k as a parameter, in accord¬ 
ance with the definition of such a point of uniform convergence given in 
§ 305. 

If it be assumed that exists, not merely at the point (x 0 , y 0 ), but 

at all points in a sufficiently small two-dimensional neighbourhood of the 
point, the conditions contained in the theorem may be simplified. For we 
have, in that case, 

f(x 0 + h, y 0 + *) -fl&.jh +:.*) = y {Xo + eh, y 0 + k), 

(x V ) 

where 6 is such that 0 < 6 < 1; and this expression converges to - * yo/ 

dx 0 

(x / u) 

provided J be continuous with respect to ( x , y) at the point ( x 0 , y 0 ). 

It has thus been proved that*, in order that the increment of the function 

may be of the form given in the theorem above , it is sufficient (1), that ^ 

have definite values at the point (x 0 , y Q ), and (2), that one at least of'these 
partial differential coefficients have definite values everywhere in a two- 
dimensional neighbourhood of (x 0 , y 0 ), and be continuous at (x 0 , y 0 ) } with 
respect to the domain (x, y). 

310 . Let it now be assumed that J- exist, and are continuous 

ox dy 

functions, with respect to (z, y), at all points of an open domain D . Let 
D x be any perfect domain interior to D ; at any point of D 1 we have 

0 

f(x + h, y + k) -f(x, y + k) = h^f(x + 6h,y+ k), 


Thomae, Einleitung in die Theorie der bestimmten Integrate, p. 37. 
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where 0 < 6 < 1 , and h, k are such that | h | < £, | k | < £; the number £ 
being so chosen that £ a/ 2 is less than the distance between the boundaries 
of D x and D , so that the cell (x — h, y — k; x + h, y + k) is certainly 
interior to D. 


We have also 

f(x,y + k)-f (x, y) = k (x, y + O'k), 

where 0 < 6' < 1. 

Since ~ are continuous in the perfect domain D Xl they are uni¬ 
formly continuous in that domain. Accordingly, a positive number e (< £) 
can be so determined that, if r\ be a prescribed positive number, 


dx 


f{x + Oh , y 4 k) -g-/(x, y) 


<V> 


and 


dy 


f(x,y + d'k)-2-f(x,y)\<r) 1 

for all points (x, y), of D lf provided | h |, | k | are both less than e. 

The numbers rj , e converge to zero together. We have now the following 
theorem: 


If the function f (x, y) have partial differential coefficients that are con¬ 
tinuous , with respect to the plane domain , at all points of an open domain D , 
then 

f (x -f- h f y 4- k) — f ( x , y) 4 - h -f- k ^ - 4 - hR 4- kR', 

where (x, y), (x 4- h, y 4- k) are any points of D ; and R , R' tend to the limit 
zero , uniformly for all points (x, y) of any perfect domain contained in D, 
as h } k converge , in any manner , to zero. 


EXAMPLES 

1. Let* f (x,y)=V | xy |, where the positive value of the square root is to be taken. 

-Si* 3/ 

In this case both exist at the point (0, 0), and are both =0. We have 

f(h,k)-f(0,k)_ /\k] 

h *~vUr 

and this has different constant values for different constant values of k/h, and is therefore 
discontinuous at the point ^=0, 1=0. In this case the equation f (h, k) -hp + ko, when 
p and a converge to zero with h and k , cannot hold. 

2. Lott f ( x * V) ~ x sin (4 tan~ l y/x), for x>0; and / (0, y)~ 0, for all values of y. We 

find ^ =0, ^ -0, and thus ^ ^ ^ is continuous with respect to y, at (0, 0). 

Also, we find ^*--=4, ^~0, and therefore is discontinuous with regard 

dy dy dy * 

* Stolz, Grundzilge , vol. i, p. 133. 

| Hamack’s Introduction to the Differential and Integral Calculus, Cathcart’s Transi tion, p. 93 






422 


Functions of a Real Variable 


[oh. v 


to x , at (0, 0). The value of 


f(h , k) -/( 0 ,*) . 


sin ^4 tan - " 1 ^, and this is discontinuous 


at. 


rif 

h =0, k -0; hence the relation (if =^ dx + ^ dy does not hold at the point (0, 0). 


HIGHER PARTIAL DIFFERENTIAL COEFFICIENTS 

311. If the function / (x, y) have the partial differential coefficient 

n/> or 

it may happen that, at the point (# 0 , y 0 ), the function ^ possesses a 

partial differential coefficient with respect to x. This is denoted by 
0 2 / (*o> Vo) 


0*o 2 


, and is spoken of as the second partial differential coefficient of 


/ (x, y) with respect to x , at the point x 0 . The second partial differential 
02 r i X y) 

coefficient J , with respect to y , is defined in a similar manner. 


d 2 f ( x o> Vq) • 


dx () 2 


is defined as the limit of 


1 \df (x 0 + h, y 0 ) _ d f(x 0 , y 0 )\ 


being assumed that 




h ( 




0a: o 


; it 


has a definite value. It is not necessary for 


the existence of this limit, as a definite number, that ( x o+Vo), f or 

ox n 


h =|= 0 , should have a definite value. When ( x o Vo) j jas ]in) its of 

0* o 

indeterminancy 


0/ + h < Vo) ?/ (*o + A, ;V«) 


0*0 


0*n 


it may happen that lim \ + h ’ ^ - d f^ + h ’M\ _ 0j and that 

h~ O'* 


dx, 


dx„ 


iim 1 (x ° + h ’ - d f y») 

H * 1 0*0 0*0 


has a definite value. 

Thus, in accordance with the definition of this second partial 
differential coefficient may exist, as a definite number, at the point (a? 0 , y 0 ), 


ay 


d i 

dx 


but not at points in the neighbourhood; it is however assumed that 
exists at (x 0 , y 0 ). 

It may happen that, at the point (# 0 , y 0 ), the function ~ has a differ¬ 
ential coefficierft with respect to y : this may be denoted by -- or 

oy 0 \vx 0 J 

d 2 f (x y ) df 

-y-p . Similarly, when has, at the point (x 0 , y 0 ), a partial differ¬ 
ed cx 0 oy 
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ential coefficient with respect to x , this is denoted by ^ Idy')’ ° r 

\ £’ yo '. These partial differential coefficients are said to be the mixed 
Modya 

partial differential coefficients of the second order at (x 0 , y d ), of / (x, y), with 
respect to x and y, the order of differentiation being different in the two. 

Under certain conditions which will here be investigated, the two 
mixed partial differential coefficients of the second order, with respect 
to x and y , satisfy the relation 

3 2 / (*, y) _ 3 2 / (x, y) 

dxdy dydx ’ 

which may be regarded as the fundamental theorem for partial differential 
coefficients of the second order. 


The differential coefficient ^ V or _ ^ i , is defined as the partial 

foo \oy 0 / foody o 

differential coefficient at (x 0 , y 0 ), with respect to x, of t it being assumed 


0 /* ( OTj 'll ) 

that ----- - exists, as a definite number. 


dy’ 


Thus --1-- - lirn ! 
dx 0 dy„ h 


lim f 1 A’ y ° + k ) ' f (x ° 1 h ’ Vo) 

k~0 k 


I ?/o I k) -/(*„, J/„)' 


ay 


- lim J 

a-~o k 


For the existence of . i as a definite number, it being assumed that 
ox 0 cy 0 


5/ (■*-<> > ?/o) j iag a <Jt-fi nHe* value, it is not necessary that vo; should 


K y») 


t>Vo 


ay 


<>y u 


have a definite value for h + 0. - i. may exist as 

foo dy 0 


lim 1 ■P/i* 0 + A ’ -%) _ d lA x V 


***** 2 ) 




oy 0 


ay 


When the differential coefficient ^ exists, it is equal to the repeated 

dx 0 dy 0 

limit 

lim lim fJQffJhJl* iJO ~/( g o + f/o) ~ / (- r o> ffo + jQ 4 - f (x 0 , y 0 ) 

A~0 fc~0 

Conversely, if this repeated limit has a definite value, and if also 
lim 

k~ o & 

0 2 f 

exists, then a exists, and is equal to the repeated limit. 
oXq oy 0 
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It is however possible that the repeated limit may exist, and yet that 


a 2 / 


rif 

- exists be not satisfied. 


-a ~ may not exist, if the condition that 
cxotyo oy 0 

For example* let / (x , y) = Xl (x) 4- X 2 (,V); where Xl (z), X2 (y) are non- 
differentiable functions of x and y respectively. In this case 

0/ (xp + h, y 0 ) df (x 0 , y 0 ) 

tyo 




a 2 / 


do not exist, and thus ^ i does not exist, but 
dx 0 dy 0 

lim lim ^ 1 h ’ Vo -t " l) ~ f »>. 4- &) +f(Xo>y 0 ) 

h~0 k ~0 ^ 

exists, and is equal to zero. 

0 2f ( X y \ 

It would be possible to define ®’ ^ 0/ as the value of this last 

dx 0 oy 0 

0 /* (jC XJ ) 

repeated limit, when it exists, in which case J would not necessarily 

exist. It is however more convenient to restrict the definition, as has 
been done above, to apply only to the case in which J exists, as 

a definite number; accordingly this definition will be adopted. 

It will also be assumed that ^ ffao^yo) ex j g £ Sj on jy w hen ^Z(^o>J/o) 

OXq OXq 


d 2 f 


exists. A similarly restricted definition of , t will be employed. 

oy 0 OXq 

312. Denoting 

/ (x 0 f h, y 0 + k) - f (x 0 + h , y Q ) -f(x 0i y 0 + k) + / (x 0 , y 0 ) 

dH d 2 f 

by F ( h , k ), the condition that v i holds is identical with the 

OXq Oy Q 0y Q OXq 

condition that the two repeated limits of —~jr ^, for h ~ 0, k ~ 0, should 

both exist, and have the same finite, or infinite, value; it being assumed 

that ^^ both exist at the point (z 0 , y 0 ). The necessary and sufficient 

conditions for the equality of the two partial differential coefficients may 
be obtained by applying the conditions given in either of the theorems in 
§ 304, and § 306, to the function F ( h , k)/hk. It is however convenient, for 
application in particular cases, to possess sufficient conditions, of a simple 
character, relating to the partial differential coefficients in the neighbour¬ 
hood of the point (x 0i y 0 ). 

* See Hobson, Proc. Lond. Math. Soc. (2), vol. v (1907), p. 233. 
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The following theorem will be established: 
d 2 f (x y) 

If (1), exists, ami is finite, at all points in a plane neighbour¬ 

hood (e) of the point (x 0 , y 0 ), except that its existence at the point (x 0f y 0 ) itself 

02 f t x y\ 

is not assumed , and if (2), ^° s a un ^I ue double limit A , at the 

point (x 0 , y 0 ), which is either finite , or infinite with a fixed sign, and if 

o ), df <?’ %>, d f (?.’ y») both existt and arefinite> lUn m^io) m*o,yo) 

dx 0 dy Q J dx 0 dy 0 dy 0 dx 0 

both exist, and have the same value A. 

d 2 f 

In particular, the conditions of the theorem are satisfied if exists 
everywhere, without exception, in a neighbourhood (e) of (x 0 , y 0 ), and is con- 
tinuous at (x 0 , y 0 ), and if ^ exists at (x 0 , y 0 ). 

df (x y) 

In the first place, we observe that, as a consequence of (1), • y 

exists everywhere in the neighbourhood (e) of (# 0 , y 0 ); its existence at the 

0/* (x y ) 

point (x 0 ,y 0 ) being assumed, in accordance with (3). Moreover J - 

must be continuous with respect to y, except possibly at the point (x 0 , y 0 ) 
itself. It also follows that / (x, y) is continuous with respect to x, at every 
point of the neighbourhood of ( x 0 , y 0 ), including the point ( x 0 , y 0 ) itself, 
on account of (3). 

Let us suppose that h, k are both positive; the other three quadrants 
may then be considered separately, in the same manner. 

Since ^ where 0 < h < e, is continuous with respect to y, 

in the closed interval y Q £ y £ y 0 + k, where k < e, and since it possesses 
a differential coefficient with respect to y, at every interior point of that 
interval of y, we have, by the mean value theorem (§ 262), 

df(x Q -f h, y 0 + k) __ df (x 0 + h, y 0 ) = d 2 / (x 0 + h, y 0 + 6k) 
dx dx dydx * 

where 0 is some number such that 0 < 6 < 1. The expression on the right- 
hand side is, by ( 2 ), equal to k [A -f a (h, k)\, if A be finite, where 
a (h, k) | < , an arbitrarily chosen positive number, provided h, k are 

each less than some fixed positive number rj, dependent on e 2 . In case 
A = + oo, the corresponding result is that 

df (xp f h, % + k) _ df(x a + h,y 0 ) > kN 
dx dx 9 

for 0<h<r),Q<k<r), where N is an arbitrary positive number, and 77 
depends on N. 
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Since the function / (x, y 0 f k) — / (x, y 0 ) is a continuous function of x, 
in the closed interval (.r 0 ,rr 0 -[ h), and since it possesses a differential 
coefficient with respect to x , at all points x in that interval, we have, by 
applying the mean value theorem, 

F (h, k) = h { 0/ ( *“ + 2’ y ° ±k) - 
where 0' is a number such that 0 < O' < 1. 


From the two results obtained, we have F (h, k)jhk ~ A -f- a (h, k ), 
when A is finite; where | a ( h, k) | < e x , provided h, k are both less than rj. 

In case A ^ oo, | F ( h, k)jhk | > N, if h, k are both less than 7?. 

The case A = — oo can be treated similarly. In either case F (h, k)/hk 

has the double limit A. Since f f both exist at (x 0 , y 0 ), as finite 

ox o ey 0 

d 2 f d 2 f 

numbers, it follows that -- i. , _ both exist, and have the value A. 

ox 0 cy 0 dy 0 dx 0 

The sufficient conditions in the foregoing theorem are somewhat 
simpler than those stated by Schwarz*, who assumed the additional con- 
0/* (x U ) 

dition that - \-- - exists, and is finite, for all values of x in the neigh- 

bourhood of x — x 0 , for the constant value y Q , of y. 

A somewhat different set of sufficient conditions has been given| by 
W. H. Young, in the following theorem : 

d 2 f 

If ( 1), ^ exists, and is finite, at all points of the neighbourhood (e) of 

the point (x 0 , y Q ), except those points for which x = x 0 or y = y 0 , its existence 

d 2 f 

on these straight lines not being assumed, and if (2), these values of 
have a unique double limit A ( finite, or infinite with fixed sign), and if (3), 
exists on the two straight lines x — x 0 , y — y 0 in the neighbourhood (e) of 
(x 0 , yf), including that point itself, and is continuous with respect to y, when 
V = Vq > # 4= %o f an d if (4), ^ exists when y — y 0 , for all values of x in the 

neighbourhood (e) of x 0 , including x Q itself , then ^ ^ both 

ox 0 oy 0 oy 0 ox 0 

exist, and have the value A. 


* OesammeUe Abh. vol. ii (1890), p. 275; see also Peano, Mathesis , vol. x, p. 153, and also 
Stolz, Orundziige d. Dijf. Rech. vol. i, p. 147. Sufficient conditions were also investigated by 
Hobson, Proc. Lond, Math. Soc. (2), vol. v (1907), p. 225. See also Fubini, Qior. di Battaglini , 
vol. xxxvm (1900), p. 72, and Dini, Lezioni di Analyst infinitesimale , vol. i (1907), p. 164. 
t Proc. R.8.E. vol. xxix (1908-9), p. 136. 
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Only a slight modification of the proof of the last theorem is required 
in order to prove this theorem. 

From (1) it follows that exists, and is a continuous function of y, 

when x ^ y + t/ 0 ; this, combined with (3), shews that ^ exists every¬ 
where in the neighbourhood (e) of (# 0 , y 0 ), and that it is everywhere con¬ 
tinuous with respect to except possibly at the point ( x 0 , y 0 ). It follows 
that / (x, y) is everywhere continuous with respect to x. The proof then 
proceeds as before. 


then 


313. If ^ ^ ^ have total differentials * at the point (x 0 ,y 0 ), 


f/JMo) = ^ 2 /^p,^o) 

~dx 0 dy 0 c)y 0 dx 0 


of V 


In accordance with the condition stated, the two functions f ~ are 

dx oy 

continuous with respect to (x, y) at the point (x 0 , y 0 ), and they accordingly 
exist in a neighbourhood of (a: 0 ,y 0 ). Hence f(x,y) is continuous both 
with respect to x , and with respect to y. We have 

df(xo'\-h,y 0 + k) df(x 0 ,y 0 ) h d 2 f(x 0i y 0 ) h d 2 f(x 0 ,y 0 ) , 

"~~dx - " dx 0 ^ h ~dx~^ + L ^i 0 d^ 0 + ,lp + ka ’ 

where | p |, | a | converge to zero as h, k do so in any manner. 

We have 

F ( h, k) = { f(x 0 + h, y„ + k) -f(x 0 + h, y„)} - {/ (x 0 ,y 0 + k) -f(x 0 , y 0 )} 

, (*o + Oh, Vo + k) of (x 0 + 6h, y 0 )) 

— t a*. 8* j ’ 

where 0 is such that 0 < 6 < 1. From this, we see that 

f ( h, k) = h \en d y ( f 0,yn) + k d X {x<s ' y ° ] + ehp + w 

{ Sx„ 2 dy 0 dx 0 


f, h d 2 f(x 0 ,y 0 ) ,) 

~ 0h av ~ p )’ 

where | />' |, | o' |, | p" | converge to zero, as h ~ 0, k ~ 0. 

We now have 

F (h, k) _ d 2 f (x 0 , y 0 ) h _ h „. 

hk dy 0 dx 0 k p k^ 5 

hence, if h and k converge to zero in any manner such that kjh is greater 

02 f Ly. y \ 

than a fixed positive number e, F (h , k)/hk converges to ^ V ~ • 

oy Q ox 0 


* See W. H. Young, loc . cit. 
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02 f ( X y \ 

Similarly, it can be shewn that F (h , k)/hk converges to , if h 

oXq oy 0 

and k converge to zero so that h/k > e. 

By letting h and k converge to zero, so that both h/k and k/h are > e. 


we see that 


3 2 / (*o. Vo) _ 3 2 / (x 0 , y 0 ) 


dx 0 dy 0 


dy 0 dx 0 


It has been shewn, in § 309, that the conditions of the above theorem 
are satisfied if the four partial differential coefficients of the second order 

0 2 f 02 f 

all exist at the point ( x 0 , y 0 ), and if ~~ exist in a neighbourhood of 

that point and are continuous with respect to (x, y), at the point ( x 0 , y 0 ). 
The following theorem has thus been proved: 

If the four partial differential coefficients of f (x, y) exist at the point 

d 2 f d 2 f 

(x 0 , y 0 ), and if g™, exist in a neighbourhood of (x 0i y 0 ), and are con¬ 


tinuous relative to (x, y ), at the point (x 0) y 0 ), then 


3 2 / (*o. Vo) _ 9 2 / (*o. y») 


dx n dy 0 


dy 0 dx 0 


314 . The partial differential coefficients of higher order n of a function 

B n f (x y ) 

f(x, y) are of the form dxPdyQ ^ xT ’’ dx * dy i> where P,q,r, ...l are positive 

integers, including zero, such that p + q + r -f ... + l = n. Here,/is first 
differentiated l times with respect to y, then k times with respect to x, 
and so on. The total number of possible partial differential coefficients of 
order n is 2 n ; the number of those in which r differentiations with respect 

n f 

to x , and n — r with respect to y are involved is ^ } . 

Sufficient conditions for the existence of all the partial differential 
coefficients of order n may be obtained by extending the theorem of § 311, 
which refers to the case n = 2. The following criteria*, which can be 
proved by induction, will be sufficient for the purpose: 

If then- \ differential coefficients d fS dy , d J-{ dyt . - ^ve 

definite finite values for all points in a two-dimensional neighbourhood of the 
point (# 0 , y 0 ), and are continuous at the point (# 0 , ?/ 0 ), with respect to (x, y), 
then all the other mixed partial differential coefficients of order n exist at the 
point (x 0 , y Q ); and each one of them has the same value at the point as that 
one of those given above in which the same number of differentiations with 
respect to x , and with respect to y, occurs , as in the one considered. 


See Stolz, Grundziige , vol. I, p. 153. 




313-315] Higher Partial Differential Coefficients 


429 


EXAMPLE 

Let* the function / ( x , y) be defined by / (a*, y) - 


■ xy 


for all values of x and y 


x“ +y* 

except when x -0, y- 0; for which / (0, 0) =0. At the point (0, 0), the partial differential 
d 2 f d 2 / 

coefficients both exist, and have different finite values. 

ox dy dydx 

The function / {x, y) is continuous at the point (0, 0); for, writing x =r cos y = r sin 0, 
the function becomes \r 2 sin 46; and this is numerically less than e, provided r <2 \ e. 


We find 


df (x>y) 

dx 


jx 2 

= y w 


4 x 2 y 2 
+ y 2 " {x 2 +y 2 ) 


y- 




at any point except (0, 0); at which point 


% is lim^*’ 0) 0) , which is = 0. 

dx X 


The value of ^ ^ is - y, and that of —^ — is x . 

dx dy 


We then find 


d 2 f (°» °) _ l* 1 f 3/(0. y ) 

dy dx y ^ o y I dx 


d f(Q, 0) 1 __ _ 1 
' da; / ' ’ 


and 


a 2 / (o, o) = ]jm i fifjx, o) _ a/(o, o) 


dxdy 


-o x ( 


d 2 f (x, y) 

The value of , as also that of 


dy 

d 2 f (x, y) 
dydx 9 


dy 

x 2 - 


:{ 


Hx 2 y 2 | 

(1?+W J 


, at every point 


dxdy ’ "" . dydx ’ x £ +y £ 

except (0, 0). This value is cos 20 (1 +2 sin 2 20), which is constant for a constant value of 0, 
but has different values for different values of 6 ; and thus the partial differential coefficients 
are discontinuous at the point (0, 0). The conditions of the theorem giving sufficient con- 
d 2 f d 2 f 
dxdy dydx ' 


315. Let </) (h) be a function of h , such that </> (0) — 0, and such that 
in an open neighbourhood of the point h = 0, the first n — 1 differential 
coefficients of <f> (h) all exist, and all have the value zero at h — 0. 

We have then, from the theorem of § 263, 

<f> (h) ^ v (K) = <rjh 2 ) ^ ^ Am) 

h n nh^- 1 n (n — l) h/ 1 - 1 n\ A n _j * 

where 0 < | ^ n _ x | < | h n _ 2 1 < ... < | | < | h |. 

Now let <f> (h) = F (h , x, y ); and let us assume that, for all points (x, y) 
in a given open plane domain D, and for all values of h in a given open 
neighbourhood of h = 0, the first n - 1 partial differential coefficients of 
F ( h , x y y) with respect to h exist, and are all zero, when h = 0; and also 
that F (0, Xy y) = 0. We have then 

F(hyX, y) ^ 1 d*-'F(h n _ liX , y) 

h n n\h n _i Shl\ 

where h n _ x is such that 0 < | h n _ x | < | h |; the value of h n _ x depending on 
the values of h, x , and y . 


Peano, Calc. Diff. p. 174. 
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If 


d"- 1 F (h, x, y) 

a*"- 1 


have partial differential coefficients with respect to 


h, x, y, that are continuous in the open domain for which ( x , y) is in Z), 
and h in the given open interval, then, employing the theorem of § 310, 
as extended to the case of a function of the three variables h, x } y y we 
see that 

\d n F(h,x, y)> 

1 dh- L„ + jK J’ 

where R converges to zero, as h n _ x converges to zero, uniformly for all 
points ( x , y) in a perfect domain I \, contained in Z>, and for all points 
h n _ y that correspond to a point h in a closed neighbourhood of h — 0, 
interior to the given open neighbourhood. 


d»-\F(h n _ lr x,y) 

?h”, ! 


Therefore, if h be sufficiently small, and | h | //, we have 

I F (h, x, y) _ 1 j d n F(h,x,y)} I 

I h n n\ | dh » l*.o | 

for all points ( x , y) in D l . 


Now let 


F (h, x, y) = / (x + h, y) - / (x, y) 

_ T 3 / (x, y) _ A 2 3 2 / (x, y) _ _ A"- 1 3 n_1 / (x, y) _ 

2! r)a; 2 (w — i)! ’ 


this satisfies the conditions that F and its first n — 1 differential coeffi¬ 
cients with respect to h all vanish when h — 0; we have therefore, pro¬ 
vided / ( x , y) and its partial differential coefficients of the first n orders 
are continuous in I), 


n, + Ky) , n *,y ) + KV%y) + + 


h n \d n f (x. y) 
n ! I 8x n 


where R' converges to zero, as h ~ 0, uniformly for all points ( x , y) in the 
perfect domain D l , interior to I). 


Similarly, we find that 

f(x,y + k) =■/ (x, y) + k 

where E" converges to zero, as k 


3 / (x, y) , k n (<ff 

dy r n ! j dy n 

~ 0, uniformly in D x . 


+ R" 


In the first of these equations, we can write y -f k for y, where k is so 
small that ( x , y -f k) is interior to Z), for all points (x } y), of D x . We have 
then 


f(x + h,y + k) -f(x,y + k) 

... h + k ) i , h ' 3 f / (x, y + k) h n 

dx ”' r! 3x r "' n ! 


(3 n / (x,+ k) 

\ .3x" ' 1 R 


where R'" converges to 0, as h — 0, uniformly in D x , and uniformly with 
respect to k , for all values which do not numerically exceed a fixed number. 
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Since all the partial differential coefficients of the first n orders are 
continuous in /), we have, provided | h |, | Jc, | are so small that, when 

(x, y) is in D x , the points (x | h , y -f k) are all interior to D, 

d r f (», V + k ) _ d r f (*, V) 7 . dr il f (*, 2/) , , p”/ 0*4 2/) , o } 

0# r 0x r 1 dydx T 1 ”* T (n — r)! \dy n ~ r dx r r ) 

where it? r converges uniformly to zero, as & — 0. 

Substituting this expression, for r — 1, 2, 3, ... ?/, in the expression for 
/ (x + /#., y f A:) — f (x,y f A:), and adding the expression for 

/ (*, y 4- Jc) - / (x, y); 

we obtain the formula 

/ (* + h ’ y + *> - f <*> y) “ A I + * | H h [ h L +* 4)7 

+ - 1 r!(*a* + *0y) f 

+ - + i\\{ h L^ k dy) f i R \’ 

where /£ *= « nl /* n 4- a n2 h n l k 4- ... 4- ct nTl Jc n ; and a wl , a w2 , ... a„„ all converge 
to zero, as //, Jc, converge in any manner to zero, uniformly for all points 
(x, y) of the perfect domain D l , interior to D . 


’ dx 


4- Jc 


dy 


.3 r f 


j / is used to denote 

0 r / , r (r - 1) 


7* r + rJi?~ l Jc a - - a - 1 

0# r dydx r 1 


2! 


-* drf + + *»/ 

dy*d&-* + + dtf' 


That 


0 n / 


0 W / 


^ ^ -~ follows from the condition of the continuity 

dx r dy n ~ r dy n ~ r dx r J 

of the partial differential coefficients of the first n orders of the domain D. 

That this is the case is verified by the fact that the expansion obtained 

above may also be found by exchanging the parts which x and y play in 

the process by which the expression was obtained. 

It has thus been shewn that: 


If f ( x , y ), and all its first n partial differential coefficients with respect to 
x and y are continuous in an open domain D, of (x, y), then for any pair of 
points (x -f h y y 4- Jc), (x,y), in D , 

f(x\h,y + k) - / (x, y) I (h l + k !)/ + 2f (* + k f 

+ -- + n\]{ h L + k dy) f+R }’ 

where R = a nl h n 4- a n2 h n ~ l Jc 4- ... 4- a nn Jc n ; and a nl , a n2 , ... a nn converge to 
zero , as h, Jc converge in any manner to zero , uniformly for all points (x, y) 
in a perfect domain interior to D. 
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This theorem is the generalization of that of § 310, which corresponds 
to the case n = 1. 

The expression (h ^ 4- k ^ f (x>y) is called the rth differential of 

f(x,y), and may be denoted by & (r) f (x, y); the theorem may then be 
written in the form 
f(x + h, y + k) - f(x , y) 

= «<«/(*, y) + 8<»/(*, y) + ... + ?/) + H). 


If ( x , y) be a fixed point (x 0 , y 0 ), and 
f(x„ + h,y 0 + k)-f(x 0 ,y 0 ) 

=■ 3 (1) /(*o, 2/o) + 2 | 8(2) /( a; o. 2/o) + ... 4 {8 <n) /(* 0 , i/ 0 ) -!-«}, 

where has the form and properties given above, the function / (.r, y) is 
said to possess an nth total differential at the point ( x Q , y 0 ). In accordance 
with the above theorem this nth total differential certainly exists in case 
there exists a plane neighbourhood of (.x 0 , y Q ) in which all the partial 
differential coefficients of the first n orders exist, and are continuous. But, 
as has been shewn in § 309, for the case n =- 1, less stringent conditions 
than these are sufficient to ensure the existence of the nth total differential 
at the point (x 0 , y 0 ). It has been shewn* by W. H. Young that, if / (x. ?/) 
have an (n — l)th total differential at ( x Q , y 0 ), while the partial differential 
coefficients of order n — 1 all exist, and are independent of the order of 
differentiation, in a closed neighbourhood of the point ( x 0 , y 0 ), and further, 
if they all have first differentials at that point, then the function has an 
nth total differential at (# 0 , y 0 ), so that the above expansion theorem holds 
for the number n. The subject has also been treatedf by Rademacher. 


FUNCTIONS DEFINED IMPLICITLY 

316. A function y, of the variable x , is said to be defined implicitly 
when there is given a functional relation between the variables of the 
form F (x, y) = 0; provided that this relation suffices to determine 
uniquely, in some domain of x , the corresponding values of y. 

The first general theorem as to the existence of the function y = (f> (x) } 
determined implicitly by a relation F (x, y) = 0, was given by Cauchy, 
for the case in which F ( x , y) can, in a neighbourhood of a fixed point 
(a, j3) at which F (a, [2) = 0, be represented by a convergent series pro¬ 
ceeding by powers of x — a, y ~ p. 

The theorem was freed by DiniJ from the restriction that F ( x , y ) 
must be analytic, and his theorem may be stated as follows: 

* Proc. Land. Math . Soc. (2), vol. vn (1908), p. 171. 

t Math. Annalen , vol. lxxix (1919), p. 340. X Analyst infinitesimale, vol. I, p. 102. 
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If F (x } y) is continuous in a neighbourhood of the point («, ft), and if 
exists , and is continuous in this neighbourhood , and does not vanish at 


dF 

dy 


the point (a, ft); then, corresponding to a sufficiently small positive number 8, 
another such number e can be so determined that , to each value of x in the open 
interval (a — e, a + e), there corresponds a unique value of y in the open 
interval (ft — 8, ft f- 8), so that these corresponding values of x and y satisfy 
the condition F (x, y) — 0. The function y — </> ( x ), so determined , is con¬ 
tinuous in the open interval (a — e, a I e). Moreover , if the further condition 
dF 

is satisfied that ^ exists , and is continuous , in the neighbourhood of (a, ft), 

the function </> (x) has a continuous differential coefficient in the open interval 
(a --€,« + e). 

dF 

Since is continuous in a neighbourhood of («, ft), and does not 

vanish at that point, a positive number h can be so determined that, if 
a - h < x < a 4- h, ft - h < y < ft -f h, 


dF 

dy 


does not vanish at (x, y), and has the same sign as at («, ft). Let it be 


assumed that the sign is positive. The number h can be so chosen that 
the closed neighbourhood (h) of the point (a, ft) is interior to the neigh- 

dF 

hour hood in which F and . are continuous. 

dy 

dF 

Since ^ is positive for ft — h < y < ft -f h, x a, and F (x, y) vanishes 

when x a, y ft, F (a, y) must be negative for ft - h s y < ft, and it is 
positive for ft < y ^ ft -f h. Since F (a, ft — h) is negative, a positive 
number k x (< h) can be so determined that F ( x, ft — h) < 0, for 

a — k x < x < a + k x . 

Similarly, a positive number k 2 h) can be so determined that 

F (x , ft + h)> 0, 

for a — k 2 < x < a f- k 2 . If k be the smaller of the two numbers k x , k z , 
we see that, for a — k < x < a -f~ k, F (x, ft -f h) > 0, and F (x, ft — h) < 0. 
dF 

Since > 0, for a ~-k<x<a + k, there must be one value of y , 

and only one such that | y — ft | < h, at which F (x , y) vanishes, for each 
value of x such that | x — a | < k. Thus the function y — </> (x) is deter¬ 
mined for | x — a | < k. When h and k have been determined, we may take 
8 ^ h\ then a value of c, ^ k, can be determined, corresponding to 8. 


If ^ are both continuous, when I x 
dx dy 1 


« I s h', | y - /S | £ k', 
where h' < h, k' < k, the difference AF of the values of the function F at 
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any two points (x )- Ax, y + Ay), (x, y) in that neighbourhood of (a, /}), 
is given by . _ ,SF \ . S F \ . 


(dx + p ) Ax + [dy + a ) Ay * 


where p, <7 are continuous functions that converge to zero, as Ax, Ay do so, 
uniformly for all the points (x, y). 

When y = <f> (x), y -f Ay = <f> (x + Ax ), we have AF = 0; thus 

a! = + + a )’ 

and the expression on the right-hand side converges uniformly to 

_ dF f dF 

dx/ dy ' 

Thus the differential coefficient <f>' (a;) exists, and is continuous in the open 
interval a — k < x < a + k. 


In case the function F (x, y) has the form / (y) — x, we have the case 
in which the function </> (x) is determined as the inverse of the function 
x = / (y). The theorem then takes the following form: 

Iff (y) and its differential coefficient /' (y) are continuous in a neighbour¬ 
hood of y — and if f (y) does not vanish when y = j8, then , corresponding 
to a sufficiently small positive number 8, another such number e can be so 
determined that , to each value of x in the open interval (a — e, a + c), there 
corresponds a unique value of y in the open interval (fi - 8, ft -f 8), such that 
the corresponding values of x and y satisfy the relation x ^ f (y). The function 
y — </> (x), so determined , is continuous , and has a continuous differential 
coefficient in the open interval (a — e, a -f e). 


316 1 . The following more general theorem was given* by W. H. Young: 

Let F (x f y) be continuous in a neighbourhood of the point (a, 6), at which 
F (a, b) — 0, both with respect to x and with respect to y, and let it be assumed 

dF 

that , in that neighbourhood , exists and is everywhere finite , and that it 

does not vanish at the point (a, b); then (1), there exists a function y = <f> (x) 
such that b = <f> (a), and such that , when this value of y is substituted in 
F (x, y), the condition F (x, y) == 0 is satisfied everywhere in a certain neigh¬ 
bourhood of (a, b). Also (2), if it be the case that , for each fixed value of x , 
F (x, y) is , in some closed neighbourhood of (a, b), a monotone function of y 
which is not constant in any interval , then the function y = <f> (x) is unique . 
Further (3), if F (x, y) be a continuous function of (x> y) in the dosed neigh¬ 
bourhood of (a, 6), then the unique function <f> (x) is a continuous function 


Proc. Lond. Math. Soc. (2), vol. vn (1909), p. 404. 
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of x. Lastly (4), if F ( x , y) possesses at (a, b) a first differential , then the 
function <f> (x) has, at x = a, a first differential coefficient <f>' (a), given by 


SF (a, b) 
da 


+ it (a) 


Si 11 (a, 6) 
db 


= 0 . 


We may assume that 


0F (a, 6) 
db 


is positive; then a positive number 


k 


can be so determined that F (a , y) is positive when 6 < y £ b -f k, and that 
it is negative when b — k ^ y < b. We may choose h so small that the 
points (a, b -f k), (a, b — k) are both interior to the neighbourhood of 
(x, y) in which F (x, y) is continuous with respect to x and with respect 
dF 

to y, and for which exists and is finite. A rectangle of which the corners 

are (a — h t b — k), (a + h, b — k ), (a — h, b 4- k), (a + h, b + k) may then 
be constructed, so as to be interior to this neighbourhood, and to be such 
that F (x, b + k) > 0, for a — h^x^a + h, and that F (x, b — k) < 0, for 
a — hsx&a + h, since F (x, y) is continuous with respect to x. The 
rectangle so constructed is such that the conditions of (1) are satisfied. 
For on each ordinate parallel to the y-axis, and contained in the rectangle, 
F (x, y) has both positive and negative values, and from the continuity of 
the function with respect to y , it follows that the function vanishes at one 
or more points which form a closed set. Taking on each ordinate the lower 
boundary of this closed set we obtain a function <f> (x) which satisfies the 
condition (1). 


In condition (2), F (x, y) is monotone along each ordinate and is not 
constant in any interval, the function accordingly vanishes at only one 
point on that ordinate, and the function <f> (x) is then unique. 


If further (3), F (x, y) is continuous with respect to (x, y) in the rect¬ 
angle, the plane set of its zeros is a closed set. Taking a sequence {x n } 
of values of x, which converges to the value x, the set of zeros (x n ,y n ) 
of F (x, y) converges to the zero (x, y), so that, for every such value of x, 
<f> (x) is the limit of <f> (x n ), and thus <f> (x) is a continuous function. 


If lastly (4), F (x, y) has a first differential at (a, 6), there is a closed 
neighbourhood of the point such that 

r dF (a, b) 


F (x, y) = (x- a) + (y - b) d 


db 


+ a 


where p and o converge to zero with x — a and y — b. Taking (x, y) to 
be a zero of F , we have ^ % ^ 


y-b 
x — a 


da 


+ P 


dF (a, 6) 


db 


+ a 
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As x approaches a in any manner, y being continuous has the limit b 9 
and thus p and o have the limits zero; we have then 

dF ^ b) + <t>' (a) 8F ^ b) -0. 
da db 

In the theorem either of the following conditions may be substituted 
for (2): 

dJF 

(2)', if exists throughout a closed neighbourhood of (a, b) y and has at 
no point therein the value zero , then the function y — <f> (x) is unique . 

tinuous at the point ( a , b) with respect to (x } y) y then the function y — <f> (a;) 
is unique. 

dF 

For, on an ordinate in the rectangle, g— must everywhere have the 

same sign, since, if it had both positive and negative values, it must take 
every value between its upper and lower boundaries (see § 278), and thus 
it must at least at one point have the value zero, which is not the case. 
It then follows that, for each value of x y F (x y y) is a monotone function 
of y y and thus the condition in (2) is satisfied. Therefore (2)' may be 
substituted for (2). 

Again, if the condition in (2)" is satisfied, it follows that, in some 

dF . 

closed neighbourhood of (a, b ), g- is everywhere different from zero; and 
thus the condition in (2)' is satisfied. 


317. With a view to the extension of Dini’s theorem of § 316, the 
following lemmas* will be required: 

If f (x i, # 2 , ... x v ) is continuous in an open p-dimensional set O , and is 
either (1), positive , or (2), different from zero , at every point of a closed set G 
contained in O y then a neighbourhood of G y open or closed y can be so deter¬ 
mined that the function has the same property (1), or (2), in this neighbourhood , 
as in G itself. 

Let K be a closed set that contains G , and is contained in O. Those 
points of K (if any) for which, in case (1), f & 0 , or in case (2), / = 0 , 
form a closed set k ; this follows from the continuity of /. The two closed 
sets G y k have no point in common, and therefore a neighbourhood of G 
can be determined (see § 113) so as to contain no points of k \ this neigh¬ 
bourhood can also be so determined as to be interior to K, and therefore 
to 0. In this neighbourhood the condition (1), or (2), is satisfied. 

* These lemmas were established by Bolza, see Vorlesungen iiber Variationsprinzip, pp. 155-- 
158. The proofs in the text were given by Hobson, Proc. Lond. Math. Soc. (2), vol. xiv (1914), 
p. 151. 
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If f , x 2 , ... x v ) is continuous in an open set O , and if it have the value 
zero at all points of a closed set G, interior to 0, a neighbourhood of G can be 
determined, interior to (), such that at every point of that neighbourhood 

\f(Xi,x 2 , ... x p ) I 

is less than an assigned positive number e. 

Let a closed neighbourhood H , of G , be determined, that is contained 
in O. The points of H (if any) at which | / | ^ e form a closed set L , which 
has no points in common with G. Determine a neighbourhood of G that 
contains no points of L, and is interior to II. At every point of this neigh¬ 
bourhood the condition | / | < e is satisfied, and it is interior to II , and 
therefore to O. 


318. Let us now suppose that x , in F ( x, y ), denotes a point 

(**1 > #2 > * * * *^ m ) 

of an ra-dimensional set, and further that F (x, y) involves p parameters 
Cj , c 2 , ... c P . We may then state the following generalization of the 
theorem of § 31G. 

Let c 1 , c 2 , ... c p have the systems of values corresponding to points in a 
p-dimensional open domain X v , and let the m -b 1 variables x lf x 2i ... x m , y 
have the systems of values representing the points in an (m -b 1 )-dimensional 
open domain D m , 1 ; the p -f m f 1 variables having accordingly the systems 
of values representing the points in an open domain 1) P fm fl . Let 

F(xj,x 2 , ... x m ,y\ Cj,c 2 , ... c„) 

be defined for all points in D p + m +i , and be such as to satisfy the following 
conditions: 

dF 

(1) . F is continuous , and ^ exists, and is continuous, in D v + m + x . 

(2) . At a fixed point (a } ,a 2 , ... a m , /9) in D m+l , F has the value zero, and 
dF 

4 0 , for all points (c x , c 2 , ... c p ) in X p . 

If 8 be an arbitrarily chosen positive number sufficiently small, another 
such number e can be determined, such that, to each value of y for which 
| y — | <8, there corresponds one and only one set of values of 

(*ii ^2 > ••• *m), 

for which \ x r — a r | < e,for r = 1 , 2, 3 , ... m, and for each point of any given 
closed domain X v interior to X v ; and these values of x l9 x 2 , ... x m , y are such 
that F (x } , x 2 , ... x m ,y\ c 1 ,c 2 ,... c p ) = 0. Thus a unique function 

y = <f> {x ly x 2 , ... z m ; Cj, c 2 , ... cf) 


is determined. 
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,, . x dF dF dF jdF dF dF „ . . . 

Moreover , if ^ ~- , ... =— , and * -, 6 , ... all exist , and are con- 

9 J dx 1 9 dx 2 dxj dc 1 i dc 2 dc v 

tinuous in D p+m+1 , the function <f> has continuous partial differential coeffi¬ 
cients with respect to the m + p variables x 1 , x 2 , ... x m , c 2 , c 2 , ... c P . 

The proof is similar to that in § 316, but it is necessary to employ the 
lemmas in § 317. 

The function F (x ly x 2 , ... x my y \ c Jy c 2 , ... c P ) has the value zero, when 
Xi — , x 2 — , ... x m — ct m , y — p y 

dF 

whatever set of values c 1 ,c 2f ... c P may have in X p . Since ^ - is, for the 

same sets of values, always different from zero, and is continuous in X py 
it has everywhere the same sign in X P \ let it be assumed that that sign is 
positive. A neighbourhood (h) of the set of points 
(«i, a 2 , ... a m , P; c lt c 2 , ... c P ) y 

where c 1 , c 2 , ... c v is in X P , a closed set interior to X P , can be so determined 

dF 

that, for all points in that neighbourhood, is positive. Thus for 

a r - h<x r < a r + h, (r =■= 1, 2, 3, ... m), ft —h<y<p + h, (Cj,c 2 , ... c P ) 

- dF 

in X py is positive. If x lf x 2f ... x m have the values a ly a 2y ... a mf since 
dF ^ 

g- > 0, as y is increased from p — h to p ~f A, F is increased, and therefore, 

since it is zero when y = p y it nrnst be negative when p — h £ y < p, and 
it must be positive when p < y < p + h. This holds for every point of X P . 
A neighbourhood of the (m -f p )-dimensional set of points 

( a l 3 a 2> ••• a m \ C 1 } C 2i *•• C J>) 

at which, for y = p — h y F is negative, can be so determined that F is 
negative in that neighbourhood. Thus, for a properly chosen positive 
number k x h) y 

F (x ly x 2 , ... x my p — h; c ly c 2 , ... c p ) < 0 
for a r — k Y < x r < a r + k ly (r = 1, 2, 3, ... m). 

Similarly, a positive number k 2 (^h) can be so determined that 
F (x ly x 2 , ... x my p + h] c lJ C 2 , ... c p ) > 0 
for a r — k 2 < x r < a r -f k 2y (r = 1, 2, 3, ... m). 

If k be the smaller of the two numbers k ly k 2y we see that, for 
a T — k < x r < a r -f k y (r = I, 2, 3, ... m), 

JP is negative when y = p — h y and positive when y = p -f A; whatever 

point (c t , c 2 , ... c„) may be of X P . Since ^ > 0, for all points such that 
a r - k< x r < a, + k, (r = 1, 2,... m), - h< y < p + h,(c lt c 2 , ...c p )inX p , 



318,319] Functions Defined Implicitly 439 


we see that, for each point of X P there is one point y, and only one, such 
that | y — | < h, at which F vanishes, for each set of values of 

X\ f %2» • • * • 

Thus a function y — <f> (#,, x 2 , ... x rn ; c lt c 2 , ... c„) is determined, for 


| x T - a | < lc, (r = 1 , 2 , ... m), 
and (c,, c 2 , ... c„) in X„, 

such that F vanishes when y has the value of <f>. 

The difference of the values of F at two points 
(x u x 2 , ... x m , y\ Cl, c 2 , ... c P ), 

(*i + Axj, x 2 + Ax 2 , ... x m + Ax m , y + Ay, c, f Ac lt c 2 + Ac 2 ,... c p + Ac„) 
is given by 




r ~PfdF 

L I ~ -f- T r 

r = i \dc r 


A c r ; 


where all the m -f p -f 1 letters oy, p, T r denote continuous functions that 
converge to zero, uniformly for all values of the variables in the closed 
set j5 m + P+1 interior to D m+P+1 . 


When y = <f> (x lf x 2 , ... x m ; c lf c 2 , ... c P ) 

and y -f Ay = (f> (x x + A^, x 2 -f A^ 2 , ...; c x + A c x , c 2 -f Ac 2 , ...), 
we have AF — 0. Thus if A^, Ac 2 , ... Ac„ all vanish, and 
Ax x , Aa; 2 , ... Ax r _ x , Aay. n , ... Ax m 


also vanish, we have 



and since oy, p converge uniformly to zero, we have 

d<f> __ __ dF idF 
dx r dx r / dy * 

In a similar manner, we see that 

d<t> = __ bf /dF 

dc r dc r / dy ' 

Thus the second part of the theorem has been established. 


319. The theorem of § 318 may be extended to the case in which y is 
replaced by n variables y 1 , y 2 , ... y n , and there are now n functions 
F Xi F lt ... F n which involve the m + w+ p variables. The theorem then 
takes the following form: 

Let c x , c 2 , ... c p have the system of values corresponding to points in a 
p-dimensional open domain X v , and let (x x , x 2 , ... x mt y l9 y 2 , ... y n ) have 
the system of values corresponding to the points in an (m -f nydimensional 
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open domain D m fn ; the m -f n -f- p variables having accordingly the values 
represented by points in an open domain D m+n + v . Let n functions 

Fr (p^l > *^2 f •** Vn i ^1 > ^2 > ^j>)> (r — 1, 2, 3, ... 7?/) 

be defined for all points in J) m tn fl) , awi be such as to satisfy the following 
conditions: 


(1) . The functions F r are continuous , and all the functions 

dF 

and are continuous , in D m+n ^ v . 

(2) . ^4/ a pom/ (^, a 2 , ... « TO , &, 0 2 , ... fi n ) in D m+n , all the r 

functions vanish , and the Jacobian 

d (F t , F 2 , ... jP w ) 

3 (.Vn y 2 , • •• y») 

a value which is not zero , /or all points (c x , ... cj in X v . 

If 8 be an arbitrarily chosen positive number , sufficiently small , another 
such number e can be so determined- that , /o each set of values of y x , y 2 , ... y 7J 
/or which | y r — fi r | <8, (r ■= 1, 2, 3, ... n), there corresponds one , and only 
one , 56/ o/ values of Xj, x 2 , ... /or which \ x r - a r \ < e, (r — 1, 2, ... m), 
/or each set of values of (c t , c 2 , ... c P ) m a closed domain X v , interior to X v ; 
and these values of x and y are such that the n functions F r are all zero. Thus 
unique functions y r --- <f> r (x l9 x 2 , ... # m ; c,, c 2 , ... cj, (r 1, 2, 3, ... n) are 
determined. 

,, .. 0/V C)F y . - 

Moreover , if ~ for 


r ~ 1, 2, ... w; .9 — 1, 2, 3, ... m; t -= 1, 2, 3, ... p, 

and are continuous in i) m+nfl) , the functions <f> r are all continuous and have 
continuous partial differential coefficients with respect to the m + p variables. 

The theorem of § 318 is the particular ease of this theorem which arises 
when ~ 1. Assume that the theorem holds when n ^ v ~ 1; it will then 
be shewn to hold when w, -= y, and thus it will hold generally. 


As before, if 


d(F 1 ,F 2 ,...F v ) >() 

d (y^ Vi* ••• y») 


where x r - a ri (r - 1 , 2, ... m), y r - fi r , (r = 1 , 2, 3, ... v), 

for all points (a l5 ... a v ) in X v , there exists a positive number h , such 
that the same inequality holds, provided | x r — a r | < h, \y r - fi r \ <h, 
for all points of X p a closed domain contained in X v . 

At a point at which the Jacobian is not zero it is impossible that all 
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dF dF dF 

the partial differential coefficients „ x , ^ — 1 , ... - 1 can vanish. Choosing 

fyi dy* dy* 

a number li (< h ), at each point of the closed domain E given by 
I Vr - Pr | ^ h', (r - 1, 2, 3, ... v), 

| x r — a r | g h', (r — 1 , 2, ... m), and (c l5 e 2 , ... c P ) 

in A",,, one at least of the above partial differential coefficients is different 
from zero. For any point of the closed set E, a closed neighbourhood of 
the point can be determined, in the whole of which one and the same 
differential coefficient is different from zero. By employing the Heine- 
Borel theorem, we see that a finite number of the cells that constitute 
these neighbourhoods exists, such that each point of E is interior to one 
at least of the cells of this finite set. We obtain in this manner a division 
of the set E into a finite number of parts, such that, in each part, one of 

the differential coefficients ^ 1 , ... 1 is different from zero in the 

tyi oy* oy* 

dF 

whole of that part. Let us assume that, in one of these parts, ^ 1 > 0, 
and that this part contains the points 


(«j, a 2 , ... a m9 ft, ft, ... ft; Cj, c 2 , ... c,p) 

in its interior, provided that (Cj, c 2 , ... c v ) belongs to a certain closed part 
of X p which we may denote by X p °. 

We now consider F 1 x 2 , ... a: m , y l9 y 2i ... y v \ c l9 c 2 , ... c P ) as a func¬ 
tion of x lf ;r 2 , ... x m , y l , and of the parameters y 2 , y 3 , ... y v , c lf c 2 , ... c v . 
Applying now the theorem of § 317, we see that positive numbers 
Aj, k x (u h') can be so determined that one and only one value of y l exists, 
such that \ yi — f$i \ < hi corresponding to each set of values of 


x i, x 2 , ... x m 

such that | x r — a r | < ft, (r — 1, 2, 3, ... m) 9 and provided 
(.y 2 > y 3 , ••• 2/.<> ci, c 2 ,... c P ) 

is in a certain closed domain of v | p — 1 dimensions. This value of y x 
may be denoted by 

lh =/i (* 1 > ••• x m, 2 / 2 , 2 / 3 > ••• 2/.d c 1# c 2 , ... c„). 

On substitution of this value of y } in the equations 
F 2 - 0, F s - 0, ... F v - 0, 

we obtain a set of v — 1 equations which may be denoted by 
<f> r (i x l9 x 2 , ... x m9 y 2 , y 3 , ... y v \ c l9 c 2 , ... c p ) - 0, 
where r 2, 3, ... v. These hold when | x r — a r | < k l9 and provided 
(2/2,2/3, 2 h\ Ci, c 2 , ... c P ) 
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is in the specified domain. Assuming the theorem to hold for n = v — 1, 
we see that there exist values of y 2 , t/ 3 , ... y v , given by equations 
y r = <f> r (x ly x 2 , ... x m ; c 1} c 2 , ... c p ) (r = 2, 3, ... v), 
and that these values are unique for every set of values of 

x 1} x 2i ... x mi Cj, c 2 , ... c py such that | x r — a r | < k 2 (r = 1,2,... m), 
and such that (<q, c 2 , ... c„) is in X v ° ; where ft is some positive number. 
The values of y 2 , y 3 , ... y v are such that | y r — ft | < h 3 > some fixed positive 
number; this must be so small that the points (y 2 , y z , ... y ¥) c x , c 2 , ... c*,) 
are all in the specified closed domain. This holds, subject to the condition 


that 


3 (<f> 2 > ft) 


+ 0 


3 (2/2,2/3, -•* y *) 
at the points (a ly a 2 , ... a m , ft, ft, ... ft.; c,, c 2 , ... c P ), 
where (c 1? c 2 , ... c p ) is in Jft 0 . 

The function f x has continuous partial differential coefficients with 
respect to all the variables contained in it. Thus 

% = - fiF 1 ( r== 2 3 y) 

dyjdyi^ ’ ” j 

8/, ! o 

w,~~ dxj >.*.•••») 

a/, _ w, /a^\ 

dc„ 


Now 




It (_ Wi /dFi\ 
V, V dyj dyj' 


dF r dF r 
3y s dy. 


On substituting the values of the partial differential coefficients in the 
Jacobian, we obtain, after a slight transformation, 

0 (F v ... *\) dF^ 
d (2/i, 2/ 2 > ••• 2/*) / fyi 9 
and thus the condition is that 


0(^,^ 2 ,...^) 

3 (2/1, 2/2, ••• 2/*) T 

at the points (<q, cq, ... a m , ft, ft, ... ft; c 2 , c 2 , ... c P ), where (c 1? c 2 , ... c*) 
is in X p °. 

It has now been shewn that there is a unique value of y 1 , such that 
I Vi — ft I < ft, for each set of values of 


y 2 > 2 / 3 , Vv, x ly x 2) ... a? m , (q, c 2 , ... c*, 
such that (y 2t y 3 , ... y v , c lt c 2 , ... c P ) is in a certain closed domain which 
includes a set of points for which 

y% = ft, 2/3 = ft, ••• V* = ft; and that | a: r - a r | < ft. 

It has also been shewn that a unique set of values of t/ 2 , y z , . .. y* , 
such that | y r — ft | < ft, (r = 2, 3, ... v) can then be determined for each 
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set of values of , x 2 , ... x m , c x , c 2 , ... c P ) such that | x r — a r | < k 2 and 
that (Cj, c 2 , ... c p ) is in X P °. We can choose any value of h ly or of Ji 2> 
smaller than the value fixed; then k x and k 2 will in general have to be 
diminished. By altering the value of one of the numbers h lf we may 
make them have equal values, say h\ and for k we then take the smaller 
of the corresponding values k lt k i . It now appears that there exists a 
unique set of values oiy lf y 2i ... y vy such that | y r — f$ r | <h f (r~ 1,2,... v), 
corresponding to each point 

(# 1 , x 2 , ... X m , Cj , c 2 , ... c p ) 

such that | x r — a r | < k, (r — 1, 2, ... m), and that fo, c 2 , ... c v ) is in X p °. 

As every point of X v is in one or more of a finite number of such sets 
as X P °, corresponding to each of which one of. the partial differential 
dF 

coefficients 1 is different from zero for the whole of that set, the above 

Zy r 

reasoning may be applied to each such set X v °. 

We have, in each case, a pair of positive numbers h , k, determined as 
above; moreover we may take, instead of h, any number less than h; 
k being correspondingly diminished. If we take for 8 the smallest of the 
numbers h, it is clear that a corresponding number e may be determined 
so that 8 and e can be used, instead of h , k, for all the parts of X p . It 
has thus been shewn that the theorem holds in the case n = v, if it is 
assumed to hold in the case n = v — 1. The theorem was proved in § 318, 
for the case n = 1; and therefore it holds generally. 

It is easy to see that the above general theorem may be modified so 
as to apply to the establishment of unique values of yi,y 2 , ... y n which 
satisfy the n equations 

F r (x 1 ,x i , ... x m , y t ,y 2 ,... y n \ c,,c 2 , ... c„) -/ r (c^c^ ... c„), 
where the functions f r (c x , c 2 , ... c P ) satisfy the conditions that they have 
continuous partial differential coefficients in the domain X v ; the functions 
F r are to satisfy the same conditions as before. 

We have, in fact, only to consider the functions F r — f rf instead of 
the functions F r . It is then assumed, as before, that F r — f r = 0 , when 
x r = a r , y r — p r , for all values of (c lf c 2 , ... c P ) in X P . 

An important case of the general theorem is that in which n = m, 
and in which F r is of the form <f> r (y lt y 2 > ••• Vn\ c i> c z> ••• c p) — x r - We 
have then an extension of the theorem of inversion. 

The case of the general theorem which arises when c 1 ,c 2i ... c p are 
absent is the well-known theorem of Dini relating to implicit functions*. 

* See Jordan's Cours d'Analyse, vol. I, §§ 91, 92; also Osgood’s Lehrbuch der Funktionentheorxe, 
vol. I, pp. 47-57. On the general theory of functions defined implicitly, see W. H. Young, Proc. 
Lond. Math. Soc. (2), vol. vii (1909), p. 397; Hobson, Proc. Lond. Math. Soc. (2), vol. xiv (1914), 
p. 147, and Hedriok and Westfall, Bull, de la eoc. math, de France y vol. xlvi (1916), p. 1. 
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MAXIMA AND MINIMA OF A FUNCTION OF TWO VARIABLES 

320. Let us suppose that a function / ( x , y) is defined at all points in 
a two-dimensional neighbourhood of the point ( x 0 , y 0 ). 

If the function be such that / (a; 0 -f h, y 0 + k) — f (x 0 , y Q ) < 0, for all 
values of h , k which arc not both zero, and are such that | h |, | k | are 
both less than some fixed positive number 8, then the function / (x,'y) is 
said to have a proper maximum at the point (# 0 , y Q ). 

In case the fixed number 8 can only be so determined that the con¬ 
dition / (x 0 + h 9 y 0 -f- k) — f ( x 0 , y 0 ) 0 is satisfied, the function is said to 

have an improper maximum at the point (x 0 , y 0 ). 

If the conditions contained in these definitions be replaced by 
/ (3b + Vo + k) - f (x 0 , y 0 ) > 0, and / (x 0 + h, y 0 + k) ~ f (x 0 , y Q ) > 0 
respectively, the function / (x, y) is said to have, in the first case, a proper 
minimum , and in the second case, an improper minimum , at the point 
(* 0 > Vo)- 

A proper or improper maximum or minimum may be spoken of as an 
extreme of the function. 


At an extreme (x 0 , y 0 ), 

/ (*0 + A, y 0 ) -f(x 0 , y a ), f (x 0 - h, y„) -f(x„, y u ) 
both have the same sign, or are zero, for all sufficiently small values of h; 
8/* (x u ) 

it follows that, if - ■— exist, it must be zero. A similar remark applies 


to 


3 /(* 0 > Vo) 

tyo 


These conditions are necessary, under the hypothesis of the existence 
of the two partial differential coefficients, but not sufficient, for the 
existence of an extreme at the point (x 0 , y 0 ). 


If we write x = x 0 ~f r cos 6 , y — y 0 + r sin 6, f (x, y) -= </> (r, 6), it is 

clearly necessary for the existence of an extreme of f(x,y) at (x 0i y 0 ), 

that (f> (r, 6 ), for each constant value of 6 , should have an extreme at 
r — 0. Thus, for an assigned value of 6, a positive number a& can be 
determined, such that one of the four conditions 

4> (r, 0) -f (x 0 , y 0 ) < 0, 4> (r, 0) - f ( x 0 , y 0 ) i 0, 

<f> (r, 0) ~f(x o, y 0 ) > 0 , <j> (r, 0) ~f(x 0 , y 0 ) a 0 , 

according as the point is a proper maximum, an improper maximum, a 
proper minimum, or an improper minimum, shall be satisfied for all values 
of r, different from zero, and such that | r | < a&. Thus an extreme of a 
function is necessarily an extreme for values of the function on eaoh 
straight line drawn through the point. 
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This condition, though necessary, is however not sufficient; for a B may 
have a definite value for each value of 0 , and yet the lower boundary of 
for all values of 6 , may be zero. In this case, no value of 8 can be deter¬ 
mined, as required in the definition of the extreme in the two-dimensional 
domain. It has thus been shewn that, in order that (# 0 , yf) may be an 
extreme point for the function f ( x , y ), it is necessary and sufficient (1), that 
r = 0 should be an extreme point of <f> (r, 6) for each value of 0, and (2), that* 
the number ao which is so determined for each value of 0 that , for | r | < a#, 
the condition as to <f> (r, 6) — f (x 0 , y 0 ) may be satisfied , should have a finite 
lower boundary , when all values of 6, (0 ^ 6 ^ tt) are considered. If the lower 
boundary of ao be zero, the point is not an extreme point of the function. 


When the lower boundary of a B is d (> 0), the neighbourhood of (x 0f y 0 ), 
which must exist in accordance with the definition, is the square of which 


the corners are the four points 


d d. \ 

± V2’ 2/ “ *V2'- 


EXAMPLE 

As an example of a function which possesses no minimum at a point, although the point 
is a minimum for each straight line through the point, we may take the function 
(y - ax 2 ) (y - bx 2 ) = y 2 - y (ax 2 + 6a: 2 ) + abx*, 
where a and b have positive values. 

The function is positive outside the two para- 
bolas y _ ax 2 — 0 , y - bx 2 — 0 , 

and in the space interior to the inner parabola; in 
the space between the parabolas, the function is 
negative. Along any straight line QAR through 
A (0, 0), the function exceeds / (0, 0) at all points 
interior to AP, and everywhere in PA produced; 
thus for the line QAR the function has a minimum 
at A. The point (0, 0) is not a minimum of the 
function, since the lower limit of AP for all posi¬ 
tions of QAR is zero; and thus there exists no two- 
dimensional neighbourhood of A, in which the function is never less than at A. 

321. We may, without loss of generality, take the point at which the 
conditions for the existence of an extreme of the function / (x , y) are to 
be investigated to be the point (0, 0). It will be assumed that, at all 
points in the neighbourhood of (0, 0), / (x } y) is continuous with respect 
to x , and also with respect to y. The following theorem contains a criterion 
for the existence of a proper maximum (minimum) at the point (0, 0). 

* The necessity for this condition has been disregarded in many text-books. The insufficiency 
of (1) was first pointed out by Peano, Calcolo diff., Turin, 1884, p. 29, in connection with the 
example given in the text. See also Dantscher, Math. Annalen , vol. xlii (1893), p. 89, and 
Scheefifer, Math. Annalen , vol. xxxv (1890), p. 541. 
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The necessary and sufficient conditions that the point (0, 0) may be 
a point at which / (x, y) has a proper maximum (minimum) are the 
following*:— (1). A positive number 8 must exist which is such that, if x be 
any number different from zero, and numerically less than 8, the upper (lower) 
limit of f (x , y), for such constant value of x, and for all values of y for which 
— x £ y £ x, being f (x, <f> (x)), this upper (lower) limit is, for every value of 
x (-- 8 < x 4= 0 < 8), less (greater) than f (0, 0). 

(2). A positive number 8' must exist which is such that , if y be any number 
different from zero, and numerically less than 8', the upper (lower) limit of 
f ( x > y)> f°r such constant value of y, and for all values of x for which 

- y ^ x y, 

being f (if (y), y), this upper (lower) limit is, for every value of 

y(-8'<y*0<8'), 

less (greater) than f (0, 0). 

It will be observed that, since / (x, y) is assumed to be continuous with 
respect to x, and also with respeot to y, the limit / (x, <f (x)) is actually 
attained for some value (x), of y, in the interval (— x, x), and the limit 
f (if (y), y) is actually attained for some value if (y), of x, in the interval 
(— y, y). It is clear that, unless both the conditions stated in the theorem 
be satisfied, / (0, 0) cannot be a proper maximum (minimum) of the 
function. If, for example, no such number as 8 in (1) can be determined, 
there are points in every neighbourhood of (0, 0) at which / (x, y) is 
*(*)/( 0 , 0 ). 

The conditions are sufficient. For, if 8, 8' exist, the value of / (x, y) at 
every point, except (0, 0), within the neighbourhood, the comers of which 
are the four points (± 8", ± 8"), is less (greater) than /(0, 0), where 8" 
is the lesser of the two numbers 8, 8'. 

The necessary and sufficient conditions that the function / (x, y) may 
have an improper maximum (minimum) at (0, 0) are similar to the above. 
In this case / (x, <f (x)) must be less than, or equal to (greater than, or 
equal to) / (0, 0), for all the values of # in the interval, and / (if (y), y) 
must be less than, or equal to (greater than, or equal to) / (0, 0), for all 
values of y in the interval. Further, corresponding to every positive 
number 8 < 8, there must be a value of x (< 8), for which 

f(x, <f (x)) = /(0, 0); 

or else a similar condition must hold for / (if (y), y) \ or in both cases, the 
condition may be satisfied. 

Other methods of determining whether (0, 0) be a point at which there 
is a maximum or minimum of / (x, y) will be dealt with in Vol. II. 

* See Stolz, Wiener Sitzungsber ., vol. on a (1891), p. 1167; also OrundzUge, vol. I (1893), p. 213. 
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PROPERTIES OF A FUNCTION CONTINUOUS WITH RESPECT TO 
EACH VARIABLE 

322. Let a function / (x, y), defined for all values of x and y in a con¬ 
tinuous domain, be everywhere continuous with respect to y, and be also 
continuous with respect to x for every straight line parallel to the x-axis, 
belonging to a set cutting the y-axis in an everywhere dense set of points. 

Let A be the point (x, y), and let BC be drawn with A as its middle 
point, parallel to the y-axis, and of length 2 p. If cu ( p ) be the fluctuation 
of / (x, y) in the interval BC , then oj (p) is a continuous function of p\ and 
lim a j (p) = 0 , since / (x, y) is everywhere continuous with respect to y. 

p~o 

Let a be a fixed positive number, and let (x , y) denote the upper limit 
of those values of p for which t o (p) < a: thus a > (p) £ a, if p ^ j 8 a ( x , y); 
and w (p) > a, if p > (x, y). 

The function p a (x, y), thus defined for every point (x, y), is everywhere 
positive; and it will be shewn to be an upper semi-continuous function 
with respect to the two-dimensional domain (x, y), in accordance with 
the definition in § 230 and § 234. 

Take B 0 A 0 ~ ^4 0 ^o “ (x 0 , y () ); and also ~ where e is 

a fixed positive number. The fluctuation of f (x, y) 
in B 1 C 1 is greater than cr; let it be a + k. If k x be 
a fixed positive number < k , two points M , N can 
be found in B 1 C 1 , such that 

\f(M)-f(N)\><r + k 1 . 

Moreover, these points M, N can be so chosen as 
to lie on two straight lines parallel to the x-axis, 
which belong to the set along each of which / (x, y) 
is continuous with respect to x; this follows from 
the fact that this set of straight lines cuts B 1 C 1 in 
an everywhere dense set of points. Since / (x, y) is 
continuous with respect to x, at each of the points 
M , N, two segments M'M", N'N", with M and N 
as their middle points, can be determined, so as to 
have equal lengths 28, and to be such that 

\f(P)-f(M)\<\k l , 

\f(Q)-f(N)\<lk 1 , 

provided P be any point in M'M ", and Q be any 
point in N'N". Fig. 5 

From these inequalities and the former one, we deduce that 

I/(P)-/(«)|> a. 
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Take the square of which A 0 is the centre, and of which the sides are 
parallel to the axes, and are at a distance from A 0 less than the smaller 
of the two numbers and 8. If A be any point in this square, the distance 
of A from each of the straight lines M'M", N’N" is less than (x 0 , y 0 ) t e. 
Through A let a straight line be drawn parallel to the y- axis, and mark off 
on it the segment of which A is the centre, and of which the half-length 
is P*(x 0i y Q ) -f- this segment will cut M'M " and A 7 '.V", and therefore 
contains two points P, Q which are such that 

\f(P)-f(Q)\>o. 

Therefore the fluctuation of f (x, y) in this segment is > <r, and hence, at 
the point A (x, ?/), we have ( x , y) < (x 0 , y 0 ) -he. A square having 
been determined with its centre at A 0 , such that for every point in this 

S( i uare P.(*,y)<P*(*o,yo) +€, 

it follows that (x, y) is an upper semi-continuous function at A 0 , with 
respect to the two-dimensional domain (x, y). 

323. Let us now consider the linear set G , of points (x, y) defined by 
y = <f> (&), where </> (x) is a continuous function of x . At each point of C, 
the function ( x, y) is defined, and it has at every point a minimum 
relatively to the set C, the term being used in accordance with the defini¬ 
tion given in § 224. 

If, at a point A 0 (x 0 , y 0 ), of G , the function (x, y) have its minimum 
with respect to C positive, then we shall prove that the saltus of / (x, y) 
at A 0 , with respect to the two-dimensional domain (#, y), is 2cr. Let y 
denote this minimum, and let y x be a positive number < y. 

Let an interval (x 0 — 8, x 0 |- 8) on the line y — y 0 be so determined 

that in this interval 

I <f> (*) - <f> (xo) I < h\ ■ 

This interval may, if necessary, be so 
reduced, that for all values of x in it, 

P* {*. <t> (*)} > Yi ■ 

Describe the rectangle B, with A 0 as 
centre, the sides parallel to the axes 
of x and y being 28 and y x respectively. 
On every segment PQ , of R, parallel 
to Oy , the fluctuation of the function 
is r, <t; for there is on PQ a point A 
of the set C , and the segment with 
centre A , and length (A) > 2y ly 
contains the whole segment PQ, 

Taking a fixed positive number €, an area surrounding A 0 can be 
determined, in which the fluctuation of / (x, y) is £ 2a + c. To effect this. 
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take a point A x on the ordinate through A 0 , and in the rectangle R, such 
that A 1 is a point of continuity of / (x, y) with respect to x ; then on the 
straight line y — y x , take a segment B 1 G\, with centre A x , and of length 
28' 28, such that the fluctuation of / in B l C\ is < e. Consider the rect¬ 

angle R' contained in R , such that the sides of R' are of lengths 28' and y q 
parallel to the axes, its centre being at A 0 . The fluctuation of / ( x, y) in 
this rectangle is < 2a + e. For if M, N be any two points in it, let M x , A { 
be their projections on B 1 C 1 \ then 

I / (M) ~ / (M x ) I s or, | / (A 7 ) — / (A-,) | £ a, \f(M 1 )~f(N l )\<e; 
from these inequalities we deduce that 

I f{M) — / (N) | < 2<r • c. 

Since this holds for every e, the saltus of f (x, y), at A {) , is 2a. If, at a 
point A 0 , the saltus of f (x, y) be > 2a, then, at A 0 , the minimum of 
with respect to C must be zero. 

Since is positive at every point of 6/ and is an upper semi-continuous 
function of (x, y), it follows from the theorem of § 232, that, in every arc I) 
of the curve G, there exists an arc I) l in which the minimum of is 
positive. Let us take a sequence a,, a 2 , ... a n ... of positive decreasing 
numbers, of which the limit is zero. It is then clear that, in every arc D, 
there exists a point whore f3 an has its minimum, with respect to C , positive, 
for every a„. At this point the fluctuation of / ( x , y) with respect to the 
two-dimensional domain ( x , y) is 2a n , for all values of n, and is therefore 
zero. This point must be a point of continuity of / (or, y) with respect to 
(*, 2 /). 

The following general theorem* has thus been established: 

If f ( x > V) be a function of the two variables x , y, which is everywhere 
continuous with respect to y, and is continuous with respect to x along straight 
lines parallel to the x-axis , which cut the y-axis in an everywhere dense set 
of points, then in every portion of a curve y — </> (x), where (f> (x) is a con¬ 
tinuous function, there exist points at which f (x, y) is continuous with respect 
to the two-dimensional domain (x, y). 

It follows from this theorem that points of continuity exist in every 
area, that is / (x, y) is at most a point-wise discontinuous function. 

The whole of the reasoning above is applicable, if only those points 
of ( x , y) are taken into account which belong to a perfect set G. It thus 
appears that, under the conditions stated in the above theorem, / (x, y) is 
a point-wise discontinuous function relatively to every perfect set G f of 
points in (x, y). The points of continuity of / (x, y), on the curve y ^ <f> (x), 
are everywhere dense with respect to every perfect set of points on the 
curve. 


* Baire, Annali di Mat. Ser, m a , vol. in (1899), p. 27. 
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EXAMPLES 

1 . If* / (x, y) be a function which is everywhere continuous with respect to each of the 
variables x, y , then the points at which the saltus of / (x, y) f with respect to the two- 
dimensional continuum (x, y), is ^ a, form a set of points such that the projection of the 
set on either axis, by lines parallel to the other axis, is a non-dense set. 

2. Iff a function f ( x , y. z ) of throe variables x, y, z be everywhere continuous with 
respect to each variable, then / (x, y, z) is at most a point-wise discontinuous function rela¬ 
tively to the three-dimensional continuum (x, y, z). Further, on every surface x—<f>(y y z ), 
where <f> is continuous with respect to (y, z), the function / (x, ?/, z) is at most a point-wise 
discontinuous function with respect to ( y , z). The set of points at which the saltus of 
/ (x, y, z) ^ a may contain all the points of a continuous curve. 

3. Letf 4> (z> y) be a function which is continuous with respect to each of the variables 
x and y y and let (0, 0) be a point of discontinuity of <f> (x, y) with respect to (x, y). Define 
/ (x, y, z) by the condition / (x, y, z) = tf> (x, y); then the function / (x, y, z) is continuous 
with respect to each of the three variables, but every point on the z-axis is a point of dis¬ 
continuity with respect to (x, y, z). 

4. Letf fi x > V > z ) be a function which is constant along any straight line parallel to the 

xy (x _ y) 

straight line x = y-z, and is such that / (x, y, 0) = —-/(0, 0, 0) = 0. This function 

(X 2 + ?/ 2 )5 

is discontinuous at every point on the straight line z = y = z. 

5. LetJ /(0, 0) = 0; at all points at which y is positive and x 2 /y ^ 1, let /(x, y) = x 2 /y, 
and when x 2 /;j/ £ 1, let / (x, y) = y/x 2 . Also let / (x, -y) = / (x, y). The origin is a point of 
discontinuity, at which the saltus is 1; elsewhere the function is continuous with respqpt 
to (x, y). The function is however continuous with respect to every straight line; for it is 
continuous along the axis of x, where it has the value 0. Consider now the straight line 
y — rnx, where m + 0; it lies entirely in the part of the plane in which 

I/(*. y) |=*V I y 1 = 1 */*» |. 

so that, as the point (0, 0) is approached, / (x, y) converges to 0. 

(i. LetJ ( a i> &i)» ( a 2 > ^)> ••• ( a n>b n ) ••• denote an enumerable set of points, and let 
F{x, y) = } 2 f(x-a 1 ,y-b J ) + ~f{x-a i , y - b 2 )+...+^ h f (x-a nt y-b n ) + ..., 

where / (x, y) denotes the function defined in Ex. 5. 

The function F (x, y) is continuous with respect to any straight line, and is continuous 
with respect to (x, y) at all points except those of the enumerable set. The enumerable set 
may be everywhere dense in the plane. 

324. The methods developed by Baire of dealing with functions of 
two or more variables, in relation to the distribution of the points of dis¬ 
continuity, have been applied by him to the consideration of the following 
three problems: 

(1). What must be the nature of a function <f> ( x ), defined for a & x £ 
in order that a function f ( x , y) can exist which is defined for all points 

* Baire, loc. cit. p. 94. t Baire, loc. cit. p. 99. 

% W. H. and G. C. Young, Quarterly Journal of Math. vol. xli (1910), p. 87, where an example 
ia also constructed of a function that is continuous with respect to every straight line but is 
discontinuous with respect to (x, y) at points of an everywhere dense unenumerable set. 
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in the square a & x & /?, a & y & ft, and is continuous at every point with 
respect to x and with respect to y , and moreover is equal to <f> {x) on the 
straight line x = 

(2) . What must be the nature of a function <f> (x), defined for a ? x s 

in order that a function / {x,y) can be defined for all points in the square 
a £ x £ ft, a £ y £ ft, which shall satisfy the conditions that it is continuous 
with respect to ( x , y) at every point for which y > 0, is continuous with 
respect to y at the points of y — 0, and is equal to <f> (x) when y — 0? 

(3) . A function / (x, y) is defined in the rectangle a ^ x ft, y % y £ 8, 
and is everywhere continuous with respect to y. Further, there is a set of 
parallels to the #-axis, along each of which / (x, y) is continuous with 
respect to x; these parallels intersecting the straight line x ^ a in a set of 
points which is everywhere dense in the interval (y, 8). What is the nature 
of the function / (x, y) on a continuous curve drawn in the rectangle ? 

The problems (1), (2) are particular cases of (3). It has been shewn 
above that a necessary condition satisfied by / ( x , y ), in (3), is that it 
should be a point-wise discontinuous function relatively to every perfect 
set of points. That this condition is also sufficient, has been demonstrated 
by Baire in his memoir quoted above. A proof of this will be given, for 
the case of problem (2), in Vol. II, in connection with the theory of 
functions representable as the limits of sequences of functions. 

THE REPRESENTATION OF A SQUARE ON A LINEAR INTERVAL 

325. Let a point of a square whose side is unity be denoted by ( x, y) y 
where 0 £ x £ 1, 0 £ y £ 1; and let t denote a point of a linear interval 
(0, 1). An account has been given in § 62 of Cantor’s method of estab¬ 
lishing a (1, 1) correspondence between the points of the square and those 
of the linear interval. Such a correspondence denotes functional relations 
x = / (t), y = <f> (t) between x> y as dependent variables, and t as an inde¬ 
pendent variable. It will be shewn however that no (1, 1) relation between 
the two sets of points can be a continuous representation*; i.e. it is im¬ 
possible that the functions/ (t), </> ( t) can be both continuous. 

Let us assume that such a continuous representation can be defined. 
To any closed set of points {£}, in (0, 1), there will correspond a closed set 
in the plane area. For if ... t nf ... be a convergent sequence of 

points t , of which t u is the limiting point, then the point / (£ w ), <f> (£ w ) is 
the limiting point of the set of points (x lt y x ), (x 2 , y 2 )> ... (# n > Vn), ••• which 
correspond to t 2t ... t n , ... respectively; therefore to a closed set {£} 

♦ See Netto, “ Beitrag zur Mannigfaltigkeitslehre,” CreUe's Journal, vol. lxxxvi (1879), p. 263, 
also Loria, (Horn, di Mai. vol. xxv (1887), p. 97. In the proof given by these writers it is assumed 
that a closed curve corresponds to a linear sub-interval of (0, 1); this is not necessarily the case, 
for a non-dense closed set may correspond to the closed curve. 
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there corresponds a closed set {(, x , y)}. Again, to a convergent sequence 
(a?i, y x ), ( x 2y y 2 ), ... of points in the plane area, there corresponds a set of 
points t Xy t 2y ... in the linear interval, the latter of which has a limiting 
point which must correspond to (x u , yj ); and since only one value of t 
corresponds to one set of values of ( x , y), there can be only one such 
limiting point t M . Thus, to a closed set in the plane, there corresponds a 
closed set in the linear interval. Take two points t x , t 2 in the interval 
( 0 , 1 ); these points correspond to two points P x , P 2 in the square area. 
To the closed linear interval (t x , t 2 ) there corresponds a closed set 8 which 
contains the points P Xl P 2 . It can be shewn that there are points other 
than P l9 P 2 on the frontier of 8. Denote by C (8) the set of those points 
of the square area which do not belong to 8. Two points Q , R in the 
square can be determined, such that Q lies on the straight line P X P 2 , and 
R does not lie on this straight line; such that neither Q nor R coincides 
with P x or P 2 , and such that one of the two belongs to 8 and the other 
to C (8). The closed set consisting of the straight line QR contains points 
both of 8 and of G (8 ); those points of 8 which lie on it form a closed 
set, and there must be one such point of S at least which is on the frontier 
of 8; such a point may, or may not, coincide with Q or R. Since then 8 
contains points on its frontier besides P x and P 2 , we can take a point t> 
within the linear interval (^,4), such that the point T in the square 
which corresponds to it is on the frontier of 8. Since T is the limiting 
point of a sequence of points of C (S), it follows that t must be the limiting 
point of a sequence of points all of which are external to the interval 
(4, t 2 ); and this is impossible. It has thus been established that: 

No continuous (1,1) correspondence can exist between all the points in a 
square and all the points in a linear interval. 

In particular, the correspondence shewn by Cantor to exist must be 
discontinuous. 

326. The reasoning of § 325 would be inapplicable if the correspondence 
x = / (t), y <f> (t) were such that, to a given point (x y y) more than one 
point t may correspond, the functions / (t), <f> (t) being still one-valued 
continuous functions, so that if t be assigned, (x y y) is uniquely determined. 
In this case, the limiting point of the set of points external to the interval 
(ty , t 2 ) would be not t y but another value of t which also corresponds to 
the point T. 

Peano* gave the first continuous correspondence of the kind just 
indicated, thus defining what, by a considerable extension of the use of 
geometrical language, may be called a continuous curve which passes 
through every point of the square at least once. 


“Sur une courbe, qui remplit toute une air© plane,” Math. Ann. vol. xxxvi (1890), p. 157. 
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Let the points in the interval ( 0 , 1 ) be expressed in the form 
t ^ ^3 • •. • *» , 

in radix fractions in the ternary scale, so that each a is either 0 , 1 , or 2 . 
Let k (a) denote the number 2 — a, so that k ( 2 ) = 0 , k ( 1 ) = 1 , k ( 0 ) = 2 ; 
and let k n (a) denote the result of performing this operation n times, so 
that k n (a) is a or 2 — a, according as n is even or odd. 

Let x , y be defined, for a prescribed t , by 

x~\b t b 3 ..., y - -CjCgCj 

the ternary scale being again employed; the numbers 6 , c being defined 
by the relations 

= a 1; 6 2 = (a 3 ), ... 6 n +««-. (a 2 „_j), 

Cl = (a 2 ), c 2 = £«■ *-«» (o 4 ), ... c„ = a.+...+o»-. (a 2n ); 

thus 6 n is equal to a 2n _ x or to 2 — a 2n , according as a 2 -f- a 4 -f ... -f a 2 n -2 
is even or odd. 

The numbers t may be divided into two classes: 

( 1 ) . Those, other than 0 or 1 , which are capable of a double represen¬ 
tation j •a 1 a 2 a 3 ... a n 222 ... = ... a n -f- 1000 _ 

( 2 ) . Those which have a single representation only. 

If t be a number of the second class, x and y are uniquely defined. If 
t be a number of the first class 

t = 'a x a 2 ... a n 222 ... = -a x a 2 ... a n + 1000 ..., 
let * 6 1 6 2 6 3 ..., \'b 2 b 2 • •• denote the numbers obtained by applying the 
definition of x to the two modes of representation of t. If n is even, say 
2 m, it is clear that 

6 i = , ft 2 = ^ 2 , ... b m = b m ; 

also b m+1 ~ k a * la * f -■ f atm ( 2 ), b' m+1 = k a * +a *+- +atm + 1 ( 0 ), 

bm+2 = 1 a2w 1 2 ( 2 ), 6' m+2 = k a '+ a *+-- ^ a*m+l ( 0 ), 


hence ^ m+l> ^m+2 ^ m+2 > *•* » 

and thus x has the same value whichever of the two forms for t is em¬ 
ployed; the case in which n is odd may be similarly treated. 

The same result can readily be shewn to hold for y. Therefore, corre¬ 
sponding to any assigned t, x and y are uniquely determined. 

Next, let us suppose x and y to be assigned. We have 

«i = b l9 a 2 -= ^ (cj, Os - 4* (6 a ), a 4 = k b >+ b > (c 2 ), ... 

»2n-l = + (f>n)y «2n = (C n ) ; 

for, if p = & r (q), then p + q is an even number. 

In case x , y are both of the second class, t is uniquely determined. 
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If x is of the first class, and y of the second; let 

x = 'b x b t ... 6 n 222 ... = ^b x b 2 ... b n -f 1000 ..., 
y — ‘CjCj ... c n c n+1 ..., 

and let the two values of t be denoted by -a x a z a z ..., -a/ a 2 '_ 

It is clear that 

= a x , a a = a 2 , ... ®2n-l ~ ® 2«-l> 

also 

a 2n = * ( a ' 2n )> «2n+l = * C, + C, + - 1 < ’" (K), = &>+'• + - + * (6„ + 1); 

thus a 2n+1 , &' 2 «+i are n °t identical, although a 2n , a 2n ' will be so if each is 
unity. It is thus seen that t has two distinct values corresponding to one 
point ( x , y), when x is a number of the first class, and y is of the second 
class. It can be shewn in a similar manner that there are four points t 
corresponding to a single point (x, y) such that x, y are both numbers of 
the first class. 

The correspondence is continuous. For if t , t' are identical as regards 
the first 2n figures, x and x' are identical as regards their first n figures, 
and the same is true of y and y'. 

The curve which has thus been defined is a continuous curve which 
passes through each point in the square at least once; there is an every¬ 
where dense enumerable set of points through each of which the curve 
passes twice, and another everywhere dense enumerable set of points 
through each of which it passes four times; through each point of the 
remaining unenumerable set of points, the curve passes once only. 

The plane measure of an arc of Peano’s curve which corresponds to 
an interval (t 0 , t x ) is not zero, i.e. the area which a number of rectangles 
enclosing all the points of the arc have in common has a lower limit 
greater than zero. 

The two continuous functions f (t), <f> (t), which define x , y as functions 
of t , do not possess, for any value of t , definite differential coefficients, and 
are perhaps the simplest examples of continuous non-differentiable func¬ 
tions. 

327. It might at first sight appear that a curve having the same pro¬ 
perties as that of Peano might have been defined by restricting t = 'a x a z ... 
to be such that an infinite number of digits other than 0 are present, and 
then defining x, y by 

X = 'CliCL^d^ ..., y = *& 2 ®4^'6 

If however the double representation of x , y were not restricted, as in the 
case of t , there would be no value of t corresponding to, say, 

x = -1000 ..., y = *2000 _ 
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If (x, y) were on the other hand so restricted, there would be no values of 
(Xy y) corresponding, for example, to 

< = •111010101 .... 

It thus appears that some such rule as that given by Peano is necessary 
to obviate the difficulty caused by the double representation of a certain 
class of rational numbers, in a given scale. 

The method may easily be extended to obtain a continuous corre¬ 
spondence between the points in a cube and those in a linear interval. 

« 

A somewhat different method of establishing correspondence between 
the points of the square, and those of the linear interval, is the following*: 
Let t x denote one of the perfect set of points defined by 


<i 


a x a 2 
3 f 3 2 


-f 


3 3 


+ 


when every a is either 0 or 2. For such a point t x , x and y may be defined by 


( a i , a 3 

2 V 2 2 2 

_ /a* , «4 
2 \ 2 2 2 


+ 


2 3 


4 - 


2 3 




A point t which does not belong to the perfect set is interior to one of 
the complementary intervals (t x , t x ") of the set; in such an interval we 
may define x, y as linear functions of <, thus 

* x ' + v ^ ~ l ^’ 
y ~ y' + y r y r , (t - O, 

h ~~ h 

where (x f , y') t ( x n , y") correspond to t Xi t x " respectively. 


328. A method of constructing a continuous curve which fills a square 
has been given in a geometrical form by Hilbertf. 

Divide the interval (0, 1) into four equal parts, and number them in 
order as 1, 2, 3, 4. Then divide the square into four equal parts, as in 
Fig. 7, and number them 1, 2, 3, 4, to correspond with the segments of the 
linear interval. Next divide each segment of the straight line into four 
equal parts, and each of the four squares into four equal parts as in Fig. 8. 
The sixteen squares so formed are then numbered in order, so that each 
square has one side in common with the one next in order; the squares 
then correspond with the segments numbered in the same way. At the 
next stage there are (Fig. 9) 64 squares corresponding to 64 segments of 
the interval (0, 1). Proceeding in this manner indefinitely, any point of 

* See Lebesgue, Lemons 8ur V integration (1904), p. 44. 
f See Math. Annalen, vol. xxxvm (1891), p. 459. 
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(0, 1) is determined by the intervals of the successive sets of sub-divisions 
in which it lies. The corresponding point in the square area is determined 
by the succession of squares, each containing the next, in which it lies. 



Fig. 8 

The curve is thus determined as the limit of a sequence of polygons 
denoted by the thickened lines in the figures. The curve thus obtained is 
continuous, but has no tangent. Hilbert remarks that, if the interval 
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(0, 1) be taken as a time interval, a kinematical interpretation of the 
functional relation between the curve and the segment is that a point 
may move so that in a finite time it passes through every point of the 
square area. 

Continuous curves of this kind can be constructed by any method by 
which an everywhere dense enumerable set of points in the square can be 
made to correspond with a similar set of points in the linear interval; 
provided the functional relation x — / (/), y </> (t), in such correspondence, 
is uniformly continuous. For, when this condition is satisfied, the func¬ 
tions obtained by the method of extension of f (t), </> (/) to the remaining 



Fig. 9 


points of (0, 1) as secondary points (see § 287) will yield a correspondence 
of all the points of the square with those of the linear interval, of the 
required character. 

Another method, differing from that of Hilbert, has been given by E. H. 
Moore* and by Schoenfliesf. 

Let m be an uneven number (in the figure, m = 3); divide the linear 
interval (0, 1) into ra a equal parts, and also the square into m 2 equal parts. 
Let these squares be passed through by a polygonal line, of which the 
sides are diagonals of the squares, as in the figure; in this manner the 
squares are arranged in order 1, 2, 3, ... m 2 , and are placed into corre¬ 
spondence with the segments bearing the same numbers. At the same 

* Trans. Amer. Math. Soc. vol. i (1900), p. 77. 
f Bericht iiber die Mengerdehre t vol. I, p. 121. 
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time the end-points of a diagonal so traversed are made to correspond 
with the end-points of a segment of the linear interval. Thus m a + 1 
points in the linear interval are placed into correspondence with points 
in the square, so that to each of the m 2 4- 1 points of the linear interval 
there is one point in the square; but the converse is not the case. Next, 
divide each of the m 2 linear intervals into m 2 equal parts, and the corre¬ 
sponding squares into m 2 equal parts; then construct as before a polygon 
traversing diagonals of all the ra 4 squares, making their end-points corre¬ 
spond to the end-points of the corresponding m 4 parts of the linear interval. 



Proceeding in this manner, we gradually place points in the square, con¬ 
sisting of an everywhere dense enumerable set, into correspondence with 
a set in the linear interval which possesses the same property ; and the 
functional relation so set up is uniformly continuous. The definition of 
the functions for the whole linear interval is then obtained, as explained 
above, by the method of extension. The case m = 3 corresponds to Peano’s 
analytical method. In the method of Moore and Schoenflies, the curve is 
determined as the limit of a sequence of polygons inscribed in the curve. 
In Hilbert’s method the polygons which approximate to the form of the 
curve are not inscribed in the curve, but are otherwise determined. 




CHAPTER VI 


THE RIEMANN INTEGRAL 

329. The fundamental operation of the calculus, known as integration, 
regarded from one point of view, consists essentially in the determination 
of the limit of the sum of a finite series of numbers, as the number of terms 
of the series is indefinitely increased, whilst the numerically greatest of 
the individual terms of the series approaches the limit zero. The laws 
which regulate the specification of the terms of the series must be sup¬ 
posed, in any given instance, to be assigned, and to be of such a character 
that the limit in question exists. It is in this form that the problem of 
integration naturally presents itself in ordinary problems of a geometrical 
or physical character, such as the determination of lengths, areas, volumes, 
&c. The method of integration, so regarded, has its origin in the method 
of exhaustions employed by the Greek geometers, and was developed later 
ip. forms whose theoretical exactitude depended at various epochs upon 
the stage which the development of Analysis in general had reached. In 
the hands of Cauchy, Dirichlet, and Riemann, the definition of the definite 
integral gradually attained to an exact arithmetic form which fitted it 
for the purposes of modern analysis; and in fact the definition given by 
Riemann leaves nothing to be desired as regards precision. Riemann not 
only formulated a rigorous arithmetical definition of the integral of a 
bounded function, but also established a necessary and sufficient con¬ 
dition for the existence of the definite integral of the function. This 
definition of the definite integral was, in the latter part of the nineteenth 
century, the one that was employed in all rigorous mathematical analysis, 
but in the present century it has, for the purposes of theoretical investi¬ 
gations, been largely superseded by the more general formulation of 
Lebesgue. In accordance with Lebesgue’s definition, integrable functions, 
i.e. functions which possess a definite integral, form a class which is 
markedly wider than, and includes, the class of functions that are integrable 
in accordance with Riemann’s definition. Although the definition, of 
which the most precise formulation is that of Riemann, is primarily appli¬ 
cable only to a bounded function over a bounded interval, it was extended 
by Cauchy, Harnack, de la Vall6e Poussin, and others, to cases in which 
the function to be integrated is unbounded, and to integration over un¬ 
bounded intervals. It was also extended to the case of double, or multiple, 
integration. 

The process of integration, leading to the indefinite integral, is also 
regarded as the operation inverse to that of differentiation; and the rela- 
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tion of this mode of regarding integration with the one referred to above 
has been formulated in what is known as the fundamental theorem of the 
Integral Calculus. Many important investigations are concerned with the 
relation between the two modes of regarding integration, and with the 
establishment of the fundamental theorem, including an examination of 
the limitations to which it is subject, it is in this connection that the 
advantages of the Lebesgue Integral and its generalizations, over the 
Riemann Integral, are most apparent. 

The Riemann Integral is not only of interest from an historical point 
of view, but it still possesses great intrinsic importance in Analysis, and will 
probably continue to be the basis upon which practical applications of the 
Integral Calculus rest. Accordingly an account of the theory of Riemann 
integration will be given in the present Chapter. An account is also given 
in the present Chapter of the properties of the Riemann Integral, and of 
some of its extensions to the case of unbounded functions, or of integration 
over unbounded domains. A more complete account of these extensions 
is however included in the more general theory of Lebesgue integration, 
and its extensions, which will be discussed in Chapters vn and viii. 

THE RIEMANN INTEGRAL IN A LINEAR INTERVAL 

330. Let / (x) be a bounded function, defined for the elosed linear 
interval ( a, b ), where b>a ; so that there exist an upper boundary (J, 
and a lower boundary L, of the functional values in the closed interval. 
Let a system of nets, with closed meshes, be applied to the interval (a, 6), 
and let 8 2 (n) , S 2 (n) , ... 8 Wn (n) denote the breadths of the m n meshes of the 
net D n , of the system. Let M (8 r (n) ) denote any number so chosen as to 
be not greater than the upper boundary of the function/ (x) in the closed 
mesh 8 r (n) , and so as not to be less than the lower boundary of / (x) in the 
same mesh, and consider the sum 

Sn = V n) M (V*>) + s 2 ( "> M (8 2 <*>) + ... + 8 mn <»> M (8 win <">). 

If the sequence 8 t , S 2 , ... S n , ... be convergent, and have the same number 
S for its limit , whatever system of nets , applied to (a, b), be employed , and 
however the numbers M (§ r (n) ) be chosen , subject only to their limitation in 
relation to the upper and lower boundaries of f (#) in the meshes 8 r (n) , then 
the function f (x) is said to have a Riemann integral in the interval (a, 6), 
and the number 8 defines the value of its integral. The integral , when the 

rb 

limit S exists , is denoted by / / (x) dx. 

J a 

It will be observed that M (8) is not necessarily the value of / (#) at 
any point in the interval 8; for all that is necessary is that it should not 
be greater than the upper boundary, nor less than the lower boundary, 
of / (x) in the closed interval 8. In this respect the definition has a slightly 
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different form from that given by Riemann*, who restricted M (8) to have 
the value of f (x) at some point in the interval. 

It can easily be seen that this alteration in the form of the definition 
is unessential. Let V (8 r (T,) ), L (8 r (n) ) denote the upper and the lower 
boundary of f (x) in the closed mesh 8 r (n) ; and let {e rt } denote a monotone 
sequence of positive numbers converging to zero. A point £ r in) in the 
mesh 8 r <"> exists such that IJ ($ r {t,) ) — f (£ r in) ) < and we then have 

r nt n r - m n 

£ 8 r <"> V (S r (n) ) - £ V" 1 f (£r <n> ) <*„(&- a). 

r 1 r 1 

From this it follows that if either of the limits 

lim' £"8 r «"> U (& r in) ), lin/s\<">/(£.<">) 

r 1 n^oo r 1 

exists, then the other exists, and the two have the same value. Similarly, 
it can be seen that a corresponding result holds when L (8 r (n) ) is substi¬ 
tuted for V (8 r (yi) ), the points £ r (n) being then so chosen that 

f(£r in) ) 

r t/i n 

If then lim L 8 r (n) /(£ r (yj) ) exists, and is independent of the mode in which 

n ~oo r 1 

the points £ r (n) of the intervals 8 r (n) are chosen, and of the system of nets, 
it follows that the limits of S 8 r (y,) U (8,. <n) ) and H S r (y<) L (8 r (r,) ). and 

r 1 r~l 

r 7n» 

consequently that of L S r (n) M (8 r (w) ), exist and have the same value, 

r -=1 

independent of the particular system of nets. 

A Riemann integral will be spoken of as an 72-integral, and a function 
/ ( x ) which has an R -integral, in (a, />), will be said to be integrable (R) 
in ( a , b). 

The definition of a definite integral, of which Riemann’s definition is 
a development, was given by Cauchy for the case of a continuous function, 
Cauchy’s definition! is in fact what arises when M (8) is in every case 
restricted to be the functional value at one end of the interval 8. Thus 
it may be expressed by 

rb 

I f(x) dx = lim [(*! - a)f(a) + (x 2 - x 1 )f(x l ) + ... + {b - x n ^)f (*„_,)], 

. a 

where a, x l9 x 2 , ... b are the end-points of a set of sub-divisions of 
(a, 6), and the limit is determined under the conditions stated above, 
which involve the convergence to zero of the length of the greatest of the 
sub-intervals (a, x x ), (x 1 , x 2 ), ... b). 

* See Riemann’s Oesam. Werke, 2nd ed. p. 239. 
f Journal de Vicole polytechnique , cah. xix (1823), pp. 571 and 590. 
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THE UPPER AND LOWER RIEMANN INTEGRALS 

331. The investigation of the necessary and sufficient conditions that 
the bounded function / (x) may have an ^-integral in (a, 6), in accordance 
with the above definition, is considerably simplified by the introduction 
of the upper and lower i?-integrals of the function / (x) in the interval 
(a, b). 

If (a, ft) be any interval contained in (a, b), the upper and lower 
boundaries of / (x) in the closed interval (a, /?) may be denoted by Uf, L/ 
respectively. The upper and lower boundaries of / (#) in (a, b) are accord¬ 
ingly denoted by U a b , L b \ or simply by U and L. If the interval (a, ft) 
be denoted by a single letter 8, we may write U (8), L (8) for UJ *,. LJ * 
respectively. 

If (a, 6) be divided into any number of parts denoted by , 8 2 , ... 8 m , 

r -?n 

taken in order from left to right , the sum 2 8 r U (8 r ) has a definite lower 

r» 1 

r ~m 

boundary , and the sum 2 8 T L (8 r ) has a definite upper boundary , when all 

r *-1 

possible modes of dividing (a, b) into parts are taken into account. These 
lower and upper boundaries are defined to be the upper , and the lower *, 

R-integral of f(x) in (a, 6), and are denoted by [ f (x) dx , f / (x) dx respec- 

. a - a 

lively. 

In each of the sums, 8 f denotes the length of the interval described by 
the same letter. Since 

r-m r ~m 

2 8 r U (8 r ) £ L (6 - a), 2 8 r L (S r ) £ U (b - a), 

r“1 r-1 

it is clear that the first sum has a finite lower boundary, and that the 
second sum has a finite upper boundary. The upper and lower integrals 
consequently always exist. 

The following theorem will be established: 

If € be an arbitrarily chosen positive number , a number d can be so deter¬ 
mined that , for any net with closed meshes such that the breadths of all its 

r-m 

meshes are < d, the sum 2 8 r U (8 r ) exceeds the upper integral of the function 

r-l 

by less than €. 

* The upper integral and the lower integral are named by Jordan ‘Tintegral© par excto” 
and ‘T integrate par defaut” respectively; see his Court d' Analyse, vol. I, p. 34. They were intro¬ 
duced by Darboux, Anncdes de Vicole normale, (2), vol. rv (1875), p. 72, and also by Thomae, 
Einleitung (1875), p. 12; and by Asooli, Atti dei Lincei , (2), vol. n (1875), p. 863. 
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r - in 

f (x) dx is the lower boundary of the sum S 8 r U (8 r ), for all 

a r «1 

r m rb 

possible nets, a net D exists, such that 2 8 r U (8 r ) < / (x) dx 4- Je ; 

r - 1 -a 

where the summation is taken for the m meshes of the net D. Let d be 
chosen to be such that 2 m Ud< and consider any net for which all 
the meshes have breadths < d. Those meshes of this second net which 
contain, in their interiors or at their ends, end-points of the meshes of D 
are at most 2m in number, and the part of the sum 2 8 £7 (8), taken for 
the second net, which corresponds to these 2m meshes is < |e. All the 
other meshes of the second net are interior to meshes of />, and the part of 

7b 

2 877 (8) which corresponds to these meshes is less than j / (x) dx 4- 
Therefore the sum 2 8?/ (8), for all the meshes of the second net, is 


< 


rb 

f (x) dx -f e; and this is the case for every net the breadths of whose 

J n 


meshes are all < d. Also we have 2 8U (8) £ /(x) dx. 

•' a 

For any system of nets { D n }, the breadths of all the meshes of D n are 
< d, for all values of r?, from and after some fixed integer. We thus have 
the following theorem: 

If a system of nets, with closed meshes, befitted on to (a, b), and D n denote 
the sum 

{n) U (V n) ) + 8 2 (n) f/ (S 2 (n) ) - 4 - ... + &mn {n) U 
ivhere 8 1 (n) , 8 2 (n) , ... 8 mn (n) denote the breadths of the m n meshes of the nth 
net D n , of the system; then Sj, S 2 , ... 2 n , ... wa sequence of numbers which 
7b 

converges to f (x) dx, whatever be the system of nets employed . 

a 

rb 

The corresponding theorem for / (x) dx may be proved in a similar 

- a 

manner; or it may be deduced from the above by observing that 

| {~f(x)}dx=~l f (x) dx. 

j a a 

It can be at once deduced from these theorems that 

( f (x) dx a I f(x) dx. 

• a Jji 

For we need only consider a single system of nets. Since, in any net of 
the system, Z n £ 2) n ', where 2/ denotes the sum 

8 1 < n > L ( 8 1 (n >) + 8 2 <"> L ( 8 2 <">) 4 - ... ; 

it follows, by letting n be indefinitely increased, that the upper integral is 
not less than the lower integral. 
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If, in the sum E n , we had taken, instead of U (8), the upper boundary 
of / ( x) in the open interval 8, we should have obtained the sum 


v 


r vim _ n 

= 2 V"’ tC.'o 

r 1 


where the interval 8 r (n) is (x r „ lf x T ), and U* r r ^ 0 denotes the upper boundary 
of / (x) in the open interval 8 r (n) . 

We have E n > E n ; hence E and E, the lower boundaries of S„, 
satisfy the condition E ;> E. 

Let E rH2) be compared with E n . In any mesh 8\ of D n}p , that is con¬ 
tained in (x r _j, ity), or 8, the value of 17 (8') is - (7^'_, 0 r 0 ; unless 8' has an 
end-point at x r-l or £ r , in which case the upper boundaries of / (.r) in the 
closed and in the open interval 8 differ from one another by not more 
than U — L. Hence we have E wf „ - E„ 2 (U L) 8m w , where 8 is the 
length of the greatest mesh in As p is increased indefinitely, 8 is 

diminished indefinitely, and therefore E E n . As this holds for each 
value of n , we have E E; and since also E ^ E, we have shewn that 
E and £ are equal. It is clear that the corresponding result will hold in 
the case of the lower integral. It has thus been shewn that: 


In the definitions of the upper and lower integrals of a function the upper 
and lower boundaries of the function in an open interval 8 may be employed 
instead of the upper and lower boundaries in the corresponding closed interval . 


332. It has thus been shewn that a bounded function / (x), defined for 
the interval (a, 6), alw T ays possesses an upper, and a lower, integral in that 
interval. The necessary and sufficient condition that / (r) should have an 
jft-integral in (a, b) is that the upper, and the lower, integrals in the interval 
should be equal. It has been shewn that the upper and lower integrals 

r m-n r m n 

are the limits of E 8 r (n) U (8 r (w) ), E 8 r (n) L (8 r (n> ), for a particular system 

r * 1 r - 1 

of nets, and are independent of that system. 

That the condition is necessary follows from the fact that all the 
numbers if (8), in the sums S l9 S 2 , ..., may be made identical with U (8); 
or that all may be made identical with L (8). That the condition is suffi¬ 
cient follows from the fact that S n lies between 

r ■= ran r m„ 

E 8 r (n) U (8 r (n) ) and E 8 r <n) L (8 r (n) ); 

r -1 r 1 

and thus that, when the two latter sums have the same limit, that limit 
is also the limit of 8 n . We have therefore obtained the following theorem: 

The necessary and sufficient condition* that the bcmnded function f (x) 
may be integrable (R), in (a, 6), is the following: Let F (8 <n) ) denote the 


* See Riemann’s Gesam. Werke t 2nd ed. p. 240. 
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fluctuation U (S (r,) ) — L (8 (r?) ) o/ the function in the interval 8 (n) , which may 
he either closed or open; then it must, be possible to define a system of nets 
fitted on to (a, b), such that, for the net I ) n , the sum 

cV">i^(V r,) ) I- 8 a <»>F(8 s <»>) I ... j- S mn (n) F (& mn lti) ) 
has the limit zero , as n is increased indefinitely. 

That this limit exists, and is equal to zero, is equivalent to the state¬ 
ment that, corresponding to an arbitrarily chosen positive number a 
net D n belonging to a given system, exists, such that the absolute value of 

r m n 

2 8 r ( ">F (8 r (M) ), 

r 1 

for that value of n, and for all greater values, is less than e. 

rb 

The necessary and sufficient condition for the existence of / (x) d.r 

•' a 

may be stated in the following somewhat more convenient form: 

If any system of nets be fitted on to the interval (a, 6), and Ic be an arbi¬ 
trarily chosen positive number , the sum of the lengths of those meshes of D n 
in which the fluctuation of f (x) is - k must converge to zero , as n is inde¬ 
finitely increased. 

To see that the condition so stated is sufficient, we observe that, if s {n) 
be the sum of the lengths of those meshes of D n in which the fluctuation 

r tltn 

oi f (x) is Ic , then 2 8 r (n) F ( 8 r (n) ) * s ltl) (U I) j k (b - a — s (n) ). 

r 1 

Since s {n) ~ 0, as n ~ oo, the upper limit of the sum on the left-hand 
side is ^ Ic (b — a). Since k is arbitrary, the sum converges to zero. 

To shew that the condition is necessary, we observe that 

r m* _ 

2 K ln) F (8 r <«>) ^ ks<"> I- (b a - *<">) F > hs (n) ; 

r 1 

where F is the least of the fluctuations in all the intervals S (n) . Unless s in) 
converges to zero, it is impossible that 2,$ r {t,) F (S r (n) ) can have the limit zero. 

The necessary condition for the existence of the j?-integral may, in 
view of the first theorem of § 331, be stated as follows: 

The necessary and sufficient condition that the bounded function f (x) may 
be integrable (R) in (a, 6), is that, corresponding to an arbitrarily chosen 
positive number e, a positive number d can be so determined that , for every 

r ~m 

net, such that the maximum breadth of its meshes is < d, the sum 2 8 r F (h r ) 

r l 

is less than e. 

333. The most succinct form of the necessary and sufficient condition 
that a bounded function is integrable (R) is the following*: 

The necessary and sufficient condition that a bounded function may be 
* Lebesgue, Annali di Mat. (3) s , vol. vn (1902), p. 254. 
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integrable (R), in the interval for which it is defined , is that the points of dis¬ 
continuity of the function form a set of measure zero. 

It is convenient to express this condition in the form that the function 
must be continuous almost everywhere in the interval. 

To shew that the condition is necessary, let us consider the closed set 
G k at which the saltus to (#), off ( x ), is ^ k, where is a positive number. 
If an interval 8 contain a point of G k within it, the fluctuation of f (x) in 
8 is £ k. If a point of G k is the common end-point of two intervals, of 
equal length, the fluctuation of f (x) in one at least of these intervals is 
^ \k\ hence the part which these two intervals contribute to the sum 
X8F (8) is 2 ? |&8. If we have a net with equal meshes fitted on to (a, 6), 
the contribution of all those meshes which contain, within them or at an 
end-point, a point of G k , is not less than the product of \k into the sum 
of the breadths of these meshes. Unless the content of G k is zero, the sum 
of the breadths of these meshes is greater than some fixed positive number, 
for all the nets of a symmetrical system. It is therefore necessary for the 
existence of the ^-integral that the content of G k should be zero; and this 
must be the case for every positive value of k . The set of points of dis¬ 
continuity of the function is the outer limiting set of {G> n }, where {k n } is 
a sequence of diminishing values of k that converges to zero. It follows 
that the set of points of discontinuity of the function must have measure 
zero. 

To shew that the condition is sufficient, we observe that, if the content 
of G k is zero, all the points of G k are contained within intervals of a finite 
set the sum of whose lengths is < c. The intervals complementary to this 
finite set have a total measure > b — a — €, and at every point in each of 
them co (x) < k. In accordance with the theorem of § 234, each of these 
complementary intervals can be divided into a number of parts, in each 
of which the fluctuation is < 2k. Let this be done for each of the comple¬ 
mentary intervals. We have now a net fitted on to (a, 8), such that the 
sum of the breadths of those meshes in which the fluctuation is £ 2k is < c. 

For this net S8F (8) < e (U — L) -f 2k (b — a — e); and since k and e 
are both arbitrarily small, a net can be determined for which S8F (8) has 
an arbitrarily small value. The condition of integrability is therefore 
satisfied if, for every value of k, G k has content zero, that is, if the set of 
points of discontinuity of the function has measure zero. 

334. The following theorem* will now be established: 

* This theorem was given by de la Vallee Poussin, see his Cours d* Analyse, 3rd ed. vol. I, 
p. 254. A proof of the theorem, other than that given above, was provided by Pollard, see 
Messenger of Math . vol. xliv (1915), p. 141, where a criticism is given of de la Vall5e Poussin’s 
proof. 
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If f (x) be bounded in (a, b), and a> (x) be the saltus of f (x), at x, then 

7b rb 7 b 

f (: x) dx - / (x) dx - to ( x) dx. 

J a .'a .'a 

If f be any point in (a, 6 ), there exists an interval (f — 17 , £ {-*?') con " 
taining £ as an interior point, such that, in any interval interior to that 
interval, the fluctuation of / (;r) is less than to (£) -f e. If d be an arbitrarily 
chosen number, and A be the smallest of the three numbers \d, |ry, £77 
the interval (f — A, £ + A), or the part of it in (a, 6 ), is of length not 
greater than ri, arid the fluctuation in it is < to (f) -f e. Taking fixed 
values of the positive numbers d, e, a single such interval corresponds to 
each point £, of (a, b). Employing the Heine-Borel theorem, it follows 
that there exists a finite set of overlapping intervals, covering (a, 6 ), such 
that, if 8 be any one of them, the fluctuation of / (x) in 8 is < to (f) +«, 
where £ is a definite point interior to 8. 

Consider two overlapping intervals (a, p), (a, P'), where p' > /?, of this 
finite set. Let £, (' be the two definite points interior to these intervals. 
If neither £ nor £' is in the part (a, p), common to the intervals, we can 
replace the two intervals by the non-overlapping intervals (a, ft), (ft, ft'). 
In the first of these the fluctuation of / (£) is < to (£) -f- e, and in the second 
it is < to (£') + €. If (a, p) contains one of the two points £, f', say 
we replace the intervals by (a, a') and («', P'), which have the same pro¬ 
perties as in the last case. If (a, P) contains both the points £, f', let 
to (£) > co (£'); we then take (a, f), (£, p') in place of (a, P) and (a', p'). 
In (a, £) the fluctuation is < to (£) -f e, and in (£, /?') it is also < to (£) + e. 
We can proceed in this manner with any pair of intervals that overlap 
one another, and we finally obtain a net, fitted on to (a, b), such that, in 
any mesh 8 , the fluctuation of the function is < to (£ s ) 4 - e; where is a 
definite point within, or at an end of, the mesh 8 ; moreover the breadths 
of the meshes do not exceed d. If w ( 8 ) be the upper boundary of to (x) 
for all interior points of the mesh 8 , there is such a point at which* 
to (x) > to (8) — p ; where p is arbitrarily chosen. Therefore the fluctuation 
in the mesh is > to ( 8 ) — p, and hence is £ to ( 8 ), since p is arbitrary. It 
follows that, for the net, S 8 E ( 8 ) ^ E 8 &> ( 8 ). Taking a sequence of values 

7b 

of d that converges to zero, 28co ( 8 ) converges to to (x) dx; hence 

J a 

[ / (x) dx — [ f (x) dx ^ l to (x) dx. 

J a J a J a 

Again, the fluctuation in 8 is < co (f) + e; and hence, for the net, 

28 F (8) < 28co (£) + e (6 - a). 
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7b 

It follows that lim D8F (&) cannot exceed oj (x) dx + e (b — a); and 
d~0 J a 

since c is arbitrary, we have 

7b rb 7b 

\ f(z)dx - f f ( x) dx s to (#) dx. 

J a -a J a 

It has thus been proved that 

T b . [ b 7 b 

f (x) dx — / (x) dx = to (x) dx. 

J a Jja J a 

The theorem of § 333 can be deduced from this result. For the con¬ 
dition t to (x) dx = 0, is equivalent to the condition f to (x) dx — 0; and 
J a J a 

unless the set of discontinuities of the function has measure zero, for some 
value of k (> 0) the set O k must have positive content, in which case it 
is impossible that the integral can have the value zero. 

PARTICULAR CASES OF FUNCTIONS THAT ARE 1NTEGRABLE (E) 

335. The following classes of bounded functions satisfy the condition 
of integrability (J?) which has been expressed in various forms above. 

(1) . All functions which are continuous in the intervals for which they 
are defined. 

(2) . All functions with only a finite number of discontinuities, or with 
any enumerable set of discontinuities. 

(3) . Monotone functions, and all functions with bounded variation. 

For, as has been shewn in § 243, the points of discontinuity of a 
function with bounded variation form an enumerable set. 

(4) . Generally, every point-wise discontinuous function which is such 
that the closed set of points for which the saltus is ss k has content zero, 
whatever positive value & may have. 

Dini* has given the theorem that a function is integrable ( E ), if at all 
points where the discontinuity is of the second kind , it is so for all such points 
only on one and the same side of the point; and at these points the function 
may be continuous on the other side , or may have ordinary discontinuities on 
that side. In particular, any function which has only ordinary discontinuities 
is integrable (E). 

To prove this we observe that it has been proved in § 239 that, for 
such a function, the set of points for which the saltus is £ k has content 
zero, whatever positive value k may have. Therefore the condition of 
integrability is satisfied. 


* See Grundlagen, p. 335. 
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Riemann’s definition of an integral, and the condition for the existence 
of the integral, are applicable, without essential change, to the case of a 
function which, for particular values of the variable, has indeterminate 
functional values lying, in the case of each such point, between finite 
limits of indeterminancy. At each point of indeterminancy of the function, 
it is immaterial whether the function be capable of having all, or only 
some, values between the limits of indeterminancy; thus there is no loss 
of generality, if the function be regarded as having two values only at 
each such point, viz. the two limits of indeterminancy at the point. In 
estimating the fluctuation of the function in a prescribed interval, the 
upper boundary is found by taking the upper limits of indeterminancy of 
the function at the special points as functional values at those points, 
whilst the lower boundary is found by taking the lower limits of indeter¬ 
minancy at the special points as the functional values at those points. 
As in the case of a function which is every^where single-valued, the saltus 
at any point is defined as the limit of the fluctuation in a neighbourhood 
of the point, when that neighbourhood is diminished indefinitely. The 
conditions of integrability are exactly the same as for a function which 
is everywhere single-valued, viz. that the function be bounded in its 
domain, and that the set of points of discontinuity of the function have 
measure zero; or, in other words, that it be bounded, and continuous 
almost everywhere, in the domain. 


EXAMPLES 

1. Riemann’s function f (x) ------ + ••• + + ..., where (x) denotes the posi¬ 

tive or negative excess of x over the nearest integer, and (a;) — 0 when x is half-way between 
two integers, has been shewn in Example 2, § 240, to be point-wise discontinuous, with all 
its discontinuities ordinary ones, and everywhere dense in the interval (0, 1). Since all the 
discontinuities are ordinary ones, and the function is bounded, f {x) is integrable in (0, 1). 

2. Let / (x) bo defined for the interval (0, I) as follows: —If x be irrational, let/ ( x ) — 0; 
if x — yjq, where y/q is in its lowest terms, let f (x) — \jq; also let / (0) —/(1) =0. This 

function is an integrable point-wise discontinuous null-function; thus j f {x) dx = 0. There 

is only a finite number of points at which the functional value exceeds an assigned positive 
number. 

3. Let/(x) — 0, for all rational values of x; and/(x) = 1, for all irrational values of x. 
This function is not integrable (R) in any interval, for it is totally discontinuous. 

4. Let / (x) be defined* for the interval (0, 1) as follows:—For ^ < x ^ 1, let / (x) = 1; 

for ™ let/(x) = for ^ < x , let/(x) = and generally, for 

2»+i < * S let /(*> = and /(°) = °- 


Dini-Luroth, Qrundlagen, p. 344. 
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This function is integrable, and 


f 

J o 


f(x)dx 

1 


fvm and . • 
2 m 2 ml 


2 w- i 3 2 2m ~ 2 

1 


, where x is between 


GEOMETRICAL INTERPRETATION OF RIEMANN INTEGRATION 

336. Let / (#) be a bounded function, defined for the interval (a, b), 
of which the values are all £ 0. Associated with the function there exists 
a plane set of points (#, y) consisting of all the points of which the co¬ 
ordinates satisfy the conditions a :i x & b, 0 &y&f(x). In accordance 
with Jordan’s theory of measure of sets of points (see § 142), this set has 
an exterior extent, and an interior extent; and the set of points is measur¬ 
able (J) when the two have the same value, in which case their common 
value is the extent, or measure ( J ), of the set. The extent of a two-dimen¬ 
sional set of points may be regarded as a generalization of the conception 
of area; thus in general, the exterior extent and the interior extent may 
be spoken of as the exterior area and the interior area of the space bounded 
by the axis of x, the two straight lines x — a, x = b, and the “curve” 
defined by y = / (a;). This set of points G has an area, in the ordinary 
sense, when the exterior area, and the interior area, are equal; in which 
ease the function y — f (x) is said to be quadrable in the interval (a, b). 

If a fundamental rectangle be taken, which contains the plane set in 
its interior, we may fit on to this rectangle a system {/)„} of nets, with 
closed meshes. Consider those meshes of D n which are such that every 
point of each mesh is an interior point of G. If the segment (a, 6), on the 
rr-axis, be divided by means of the boundaries of the meshes of D n into 
intervals 8 2 , § 2 , ... , we see that the sum of the measures of those meshes 

» 

of D n which consist entirely of interior points of G is SSL (S) -f rj n , where 

rj n is a number which converges to zero, as n is indefinitely increased. 
Similarly, the sum of those meshes of D n each of which contains at least 

n 

one point of G> or of the boundary of G is (8) + p n) where p n ~ 0, 
as n — oo. 

7b 

It thus appears that ) / (x) dx is the exterior extent of G, and that 

. a 

fb 

I / (x) dx is the interior extent of G. 

J a 

~ rb 

If | f (x)dx exists as a definite number, the plane set G is measurable 

a 

(J), and the value of the integral measures the area bounded by x = a, 
x = y = 0, y= / (x). 
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It has been shewn, in § 142, that the condition for the measurability 
(«/), of the set G , is that the frontier of G should have the plane measure 
zero. It is clear that, in any case, the plane measure of the three rectilinear 
portions of the boundary of G is zero; thus the condition for the existence 
of the integral is that the set of boundary points which consists of points 
on the “ curve ” y = / (x), or of limiting points of points on the curve, shall 
have plane measure zero. This condition is equivalent to the condition 
that the linear measure of the set of points of discontinuity of the function 
/ (x) in the linear interval (a, b) is zero. 

In case the bounded function / (x) is not everywhere < 0 in (a, b ), we 
may take / (a;) -- (x) — / 2 (x) ; where f x (x) f (x), for all values of x for 

which/ (x) _• 0, and/i (x) —- 0, for those values of x for which f(x) is negative. 
In case the two sets of points (x, y) for which a ^ x s b, 0 % y s ( x ), 

rb 

and a < x ^ b, 0 ^ y < f 2 (x) are both measurable ( J ), the integral I / (x) dx 

J a 

is the excess of the measure of the first set over that of the second set; 
and this may be interpreted as the excess of that part of the area defined 
by x a, x — b, y f (x) which is above the x-axis, over that part which 
is below it. 

If the two plane sets of points be not measurable (./), the exterior and 

interior extents of the first set are [ /, ( x)dx , [ /, ( x)dx ; and those of the 

• « 

7° f b 

second set are / 2 {x) dx , I f 2 (x) dx respectively. The upper integral 

• a L. a 

f / (x) dx is then the excess of the exterior extent of the set a s x ^ 6, 

a 

0 s y ^ f Y (x), over the interior extent of the set a < x £ 6, 0 %. y £ f 2 (x); 

rb 

whilst the lower integral I / ( x) dx is the excess of the interior extent of 

la 

the first set over the exterior extent of the second set. 

The condition that / (x) may be integrable (E) is that the frontier 
which consists of the set of points a ^ x :^b, y = / (x), when dosed by 
adding the limiting points, has its plane measure zero. 

If a linear set of points H be defined on the x-axis, and lie in the 
interval (a, 6), a function </> (x) may be defined by the rule that <f> (x) = 1, 
if x be a point of //, and </> (x) — 0, if x be a point of C (H). The set H 
has an exterior linear extent, and an interior linear extent, which are 
7i> rb 

given by I <f> (x) dx, / </> (x) dx, respectively, as may be seen by referring 

a mm 0 

to the definitions. For it is easily seen that the exterior and interior linear 
extents of H are numerically equal respectively to the exterior and interior 
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plane extents of the plane set a x z 6, 0 ^ y ^ <f> (x). When H is measur¬ 
able (J) the function <f> (#) is integrable ( R) in (a, b), and f </> (x) dx is 

J a 

the measure (J) of H. This measure may be regarded as a generalization 
of the notion of the length of a linear interval. The condition that the 
linear set H is measurable (J) is that its frontier, which consists of those 
points of H that are limiting points of C (//), and of those points of C (H) 
that are limiting points of H, should have the measure zero. 

PROPERTIES OF THE DEFINITE RTEMANN INTEGRAL 

337. We proceed to consider the properties of the integral f f (x) dx , 

J a 

of a bounded function / (x), defined for the interval (a, b), which is such 
that the condition for the existence of the /£-integral is satisfied. 

(1) . The integral [ f (x) dx is defined as the value of — j f (x) dx. 

(2) . If f (#) be integrable (R) in (a, b), so also is | f (x) |, and 

[ / (x) dx j s f I / (*) I dx. 

For the fluctuation of | / (x) | in any interval 8 cannot exceed that of 
/ (x) in the same interval; hence, if 8F (8), for a system of nets, have the 
limit zero, when F (8) is the fluctuation of / (#) in 8, it has also the limit 
zero when F (8) denotes the fluctuation of | / (x) [; and thus the latter 
function is integrable (R). Again U 9 the upper boundary of / (x) in 8, 
cannot numerically exceed £/', the upper boundary of | / (x) j in the same 
interval; thus | Sf78 | £ Zf/'8, and hence the absolute value of the limit 
of Zf/8 is < that of 1,U'8. 

(3) . If f (x) be integrable (R) in (a, b), f (x — h) is integrable (R) in 

rb rb+h 

(a — h, b + h), and f (x) dx — / (x — h) dx. 

■a .a+h 

This follows at once from the definitions of the integrals as the limits 
of sums. 

(4) . If the values of the integrable function f (x) be arbitrarily altered at 
each point of a measurable set of points G, the new function (x) so obtained 
is integrable (R), provided it be bounded , and also the measure of the deri¬ 
vative G' of the set be zero. 

For the only points of discontinuity of (x) which are not points of 
discontinuity of / (x) are points of G, or of G', and therefore form a set of 
measure zero; hence all the discontinuities of <f> (x) form a set of points of 
measure zero, and <f> (x) is therefore integrable (i?), provided it be bounded. 
In particular, the theorem holds for any reducible set G. 
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Also , if <f> (x) = / (x), at all points belonging to a set which is everywhere 
dense in (a, 6 ), then , provided cf> (x) be integrable (JR), its integral is identical 
with that of f (x). 

For, in the finite sum X 8 iif ( 8 ), we may take the value of M ( 8 ) in any 
interval 8 to be one of the values which the two functions / (x) f </> (x) 
have in common in that interval; hence the sums may all be chosen so as 
to be the same for the two functions. Thus, if the functions be both 
integrable (R), their integrals are identical. 

(5) . A function f (x) which is integrable (R), in (a, b), is also integrable 
(R) in any interval («, )5) contained in (a, b). 

For the measure of the set of points of discontinuity of f (x) in (a, 6 ) 
being zero, the measure of the set of those points of discontinuity which 
are in (a, ft) is also zero, and thus the function is integrable (R) in (a, /3). 

rf c is any point in (a, b ), we have 

[ / (x) dx — I / (x) dx + f / (x) dx. 

J a J a J c 

For the two integrals on the right-hand side both exist; also a system 
of nets can be so chosen that the point c is always an end-point of two 
of the meshes of each net. If this be done, the sum DSJf ( 8 ), for (a, b), 
may be divided into two parts, one of which contains all the intervals on 
the left of the point c, and the other all those on the right of that point; 

thus 2.8M ( 8 ) = S, 8 Jf ( 8 ) + S 2 8 j»/ ( 8 ). 

The limits of the three sums are the three integrals off (x) in (a, b), (a, c), 
and (c, b) respectively; thus the theorem is established. 

( 6 ) . If* /u/ 2 ,/ 3 , ... / n be a finite number of bounded functions , each of 
which is integrable (R) in (a,b), and if F (/ l5 / 2 , ... f n ) be a continuous func¬ 
tion with respect to (/ 1? / 2 , .../ n ), then the function F is integrable (R) in 
(a, b). 

For the only points of discontinuity of the function F (x) are those 
of the functions f x (#), / 2 (#), ... f n (x); hence the set of points of discon¬ 
tinuity of F (x) has measure zero; and thus F (x) is integrable ( R ), since 
it is also a bounded function. 

Important particular cases of the general theorem are the following: 

(a). If / (x) = f x (x) -f / 2 (#) + -.. f f n (%), where all the functions f r (x) 
are integrable (R), then / (x) is integrable (R). 

f b r-n rb 

f (x) dx D f r (x) dx. For, in any 

a r - 1 a 

* Du Bois-Reymond, Math . Annalen, vol. xx (1882), p. 123. See also W. H. Young, Quarterly 
Journal of Math. vol. xxxv (1904), p. 190. 
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interval 8, the upper boundary of / (x) cannot exceed the sum of the upper 
boundaries of the functions f r (x). From this we see that 

I / (x) dx s 2 1 f r (x) dx, 

■'a 1 - a 

rb n rb 

and similarly we have / (sc) dx s £ f r (x) dx. 

Ja l a 


( b ) . If f (x) (sc)./ 2 (x) ...f n (x), where all the functions f r (sc) are 

integrable (It) in (a, b), then / (x) is also integrable (B) in (a, b). 

(c) . If / (sc), <f> (x) be integrable (B) in ( a , b), and | cf> (x) | always exceeds 

f (sc) 

some fixed positive number A, so that , \ \ is a continuous function of 

4 > (*) 

(/,<£), then is integrable (P) in (a, b). 

9 (x) 

(7). If two functions /+ (a;), / (x) be defined as follows: — Let /+ (x) — / (sc) 
for all values of x such that f (x) > 0, and let /+ (x) = 0, wAen / (sc) s 0; Zetf 
/- (a:) = — / (sc) /or all values of x such that f (x) < 0, and f~ (x) — 0, when 
f (sr) 5- 0; then if f (x) be integrable (B) in (a, b), the functions / + (sc), f~ (sc) 

rb rb rb 

are integrable (B) in (a, b), and f (x) dx — / f (sc) dx — / (sc) dsr. 


For the fluctuation of /+ (sc) in any interval 8 cannot exceed that of 
/ (sc) in the same interval; hence, since 2)8 F (8), for/ (sc), has the limit zero, 
the corresponding sum for /+ (sc) has the limit zero; and thus /+ (sc) is 
integrable. In a similar manner it can be shewn that f~ (sc) is integrable. 

Since/(sc) =/+ (x) — f~ (sc), we see from (6) (a) that 
f / i x ) dx - f /+ (x) dx - [ /- (sc) dsc. 

It should be observed that it is not in general true that, if / (x) be 
integrable in (a, b), and be expressed as the sum f x (x) + / 2 (sc), of two 
bounded functions, then f x (sc), / 2 (sc) are also integrable in (a, 6). For it is 
clear that, / (sc) being given, we may take for f x (sc) any arbitrarily defined 
non-integrable function, then/ 2 (sc) is also determinate and non-integrable. 

(8). Iff (sc), (f> (sc) be both integrable (B), and be such that \ f (x) \ < \ <f> (x) \ 

I f b I 

for every value of sc, then I / (sc) dx < | <f> (sc) | dx . 

• j a I J a 


In particular, if (x) is constant , and, equal to P, the upper boundary of 
rb 

| / (sc) | in (a, 6), then j / (sc) dx ^ P (b — a). 

b a 

For / {| </> (sc) j — | / (sc) J} dsc is ^ 0, since, in every interval 8, no value 
J a 

of | ^ (sc) | — | / (sc) | is negative, and thus the sums of which the integral 
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is the limit are all s 0. Also from (2), we have I f (x) dx ■» | \f{x)\ dx, 

. a J a 

rb 

and this is ^ | <f> (x) | dx. The particular case follows by assuming 

<f>(x)~P. 

If U, L denote the upper and lower boundaries of f (x) in (a, b), then 

L (b — a) <, f f (x) dx c U (b — a). 

J a 

For £8 £7 (8), D8L (8) each lies between £/Z8 and L£8, or between 
U (b — a) and L (b — a) ; the same must hold of the common limit, which 

rb 

is the integral f (x) dx. 


(9) . If rj ... r) n , ...bean enumerable set of non-overlapping intervals 
contained in (a, b), in descending order of length , then the sum of the integrals 
of f ( x) taken over ^ , r) 2 , ... rj n , converges to a definite finite limit , as n is 
increased indefinitely; f (x) being a function which is integrable (R) in (a, b). 

Let us denote by S n the sum of the integrals of / (x) taken over the 
intervals rj l9 y] 2i ... r) n . Since 171 + 172 + ••• + increases with n , and is 
always less than b — a, it has a definite limit as n is increased indefinitely; 
we can therefore choose n so great that t 7„ h1 + rj n ^ 2 + ... + 7] n+m < e, for 
every value of m, where e is an arbitrarily chosen positive number. With 
this value of n, we see that | S n+m — S n | < e.P, where P is the upper 
boundary of | / (x) | in (a, 6 ). If 17 be an arbitrarily chosen positive number, 
we can choose € such that e < 17 jP; thus n can be so chosen that 

I S n | < 17 , 

and hence S n has a definite limit, as n is increased indefinitely. 

( 10 ) . If /i (x),f 2 (x), ... f n (x), ... be a sequence of functions, defined in 
the interval (a, b) y and / (x) be such that, for all values of x in (a, 6 ), 

I / (*) - fn (x) | < e, 

where «• is an arbitrarily chosen positive number, provided n is ? a fixed 
integer n e , dependent on c; the sequence {/ n (x)} is said to converge uni¬ 
formly to f (x) y in the interval (a,b). 

If a sequence of functions {f n (x)}, all integrable (R) y in the interval (a y 6 ), 
converges uniformly, in that interval , to the bounded function f (x), then f (x) 
is integrable (R) y and 

( f (x) dx — lim I f n (x) dx. 

J a n~oo . a 


At almost every point of (a, b) f that is, almost everywhere in (a, 6) 
(see § 333), all the functions of the sequence {/ n (x)} are continuous; for 
the set of all those points at which any of them are discontinuous has the 
measure zero. Let f be a point at which all the functions f n (x) are 
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continuous; if (f — h, £ -f- h) is a neighbourhood of the fluctuation of (x) 
in (f — h, £ -f h) is < e, if h be sufficiently small; the integer n< having 
been so chosen that \f(x) —/„ e (x) | < e, in (a, b). It follows that the 
fluctuation of / (x) in (£ — h, £ + h) is < 3e. Since e is arbitrary, £ is a 
point of continuity of / (x) ; therefore / (x) is continuous almost everywhere 
in (a, b), and is therefore integrable (i?). Also 

If f(x)dx-\ fn e (x) dx\< € (b - a); 

I.a -a I 

and therefore f / (x) dx = lim f f n (x) dx. 

J a n~oo - a 


iK-INTEGRALS OF FUNCTIONS OF TWO OR MORE VARIABLES 

338. The Riemann definition of the integral of a bounded function, 
defined for an interval (a, 6), may be extended at once* to the case of a 
bounded function/(x (1) , x (2) ) defined in a rectangular cell (a (1) , a l2) ; 6 (1) , b (2) ), 
or more generally to the case of a bounded function/ (x U) , x i2) , ... x ip) ), of 
p variables, defined in a ^-dimensional cell (a {1) ,a {2) ,... a (p) ; b a) , b (2) , ...b (p) ). 

In the definitions given in §§ 330, 331, we have only to consider 
systems of nets in two, or in p -dimensions, instead of linear systems. 

Thus the upper and lower integrals! of / (x (1) , x {2) ) in the cell (a (1) , a [2) ; 
b (l) , b {2) ) may be denoted by 

b <2> ) 

f(x^\x^)d(x^\x^), 

J a< 2 )) 

r(6< l >, 6<*)) 

f(x"\ X (2) )d(x (1) , x (2) ), 

a<*>) 

which represent the limits of the sums of the products of the areas of the 
meshes of a net D n into the upper, or the lower, boundaries of the function 
in the corresponding meshes. That these limits exist, and are independent 
of the particular system of nets, is proved exactly as in § 332. 

When the upper and lower integrals have equal values, f(x a) ,x {2) ) 
is integrable (E) in the fundamental cell, and their common value is 
denoted by r(&< i >, *><*)) 

f(x^\x^)d(x^,x^). 

J (a* 1 ), a <*>) 

* This extension was made by H. J. S. Smith, Proc. Land. Math. Soc. (1), vol. vi (1875), 
p. 152, and also by Thomae, EirUeitung in die Theorie der bestimmten Integrale (1875), p. 33; 
also Schlomilch's Zeitschr. vol. xxi (1876), p. 224. 

f See Jordan’s Coura cTAnalyse , vol. i, p. 34. An elaborate treatment of double integration 
has been given by Stolz, Orundziige , vol. in, where the triangle or polygon is employed in relation 
to the measure of sets of points, instead of the rectangle. On this matter see Schoenflies, Bericht, 
vol. i, p. 179. 
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The necessary and sufficient condition that / (x (1> , x {2) ) is integrable ( R ), 
viz. that the plane measure of its points of discontinuity should be zero, 
or in other words, that the function should be continuous almost every¬ 
where in the cell, is established in the same manner as in § 333, the proof 
requiring only a slight modification. 

The integral 

r(feO>. bM) 

f (x (1 >, x< 2) ) d (x (1) , x< 2 >), where / (x* 1 *, x< 2) ) z 0, 

J (av\ a<*>) 

in the cell is the measure (J) of the three-dimensional set of points 
(x (1) , x (2) , x (3) ) defined by 

a a) s x (1) > b {l) ; a < 2 > s x< 2 > ^ 6 (2) ; 0 5 x< 3) zf (x (1) , x< 2) ), 
it being assumed that this measure (J) exists. 

A function / (x (1) , x (2) ) which takes both signs in the fundamental cell 
is the difference of two functions fa (x (1) , x (2) ), f 2 (x (1) , x (2) ), both of which 
are ^0. If the frontiers of both the three-dimensional sets 

« (1) x (1 > s 6 (1) ; a (2) s x (2) * b i2) ; 0 s x (3) ^ fa (x (1 >, x (2) ), 

a (1) s x (1) s b «>; a (2) ^ x < 2 > 6 < 2 >; 0 s x (3) s fa (*«>, x< 2 >), 

have their three-dimensional measures zero, the function / (x (1) , x {2) ) is 
integrable (12) in the cell, and the integral is the excess of that of fa (x (1) , x {2) ) 
over that of fa (x (1) , x {2) ). 

The integral 

f(6(0, *>(*), ...*><*») 

f{x (1 \x' 2 \ ... x^)d(x"\x' 2 \ ... X<*>) 

<*<*>, ■ aW) 

may be considered in a precisely similar manner, and the condition for its 
existence is an extension of that for the cases j> =• 1,2. 

339. Next, let G be a bounded set of points in plane space; it is 
therefore contained in the interior of a fundamental rectangular cell 
(a (1) , a (2) ; 6 (1) , b (2) ). Let us suppose that a bounded function / (x (1) , x (2) ) 
is defined for all the points of G. Let / (x (1) , x (2) ) be defined in the funda¬ 
mental cell by the conditions 

/ (x (1) , x (2) ) = / (x (1) , x (2) ), at all points.of G , 
and / (x (1) , x (2) ) = 0, at all points of C (G). 

If we consider, for the function f (x (1) , x (2) ), the sum S3 {U (3) - L (3)}, 
for those meshes of the plane net D n , fitted on to the fundamental cell, 
which contain a point of G and also a point of C (G) y we see that, for all 
such meshes, U (3) — L (3) > 0, unless U (3) and L (3) are both zero, since 
there is at least one point in the mesh at which / (x (1) , x (2) ) — 0. Thus 
S3 {{7 (3) — L (3)}, taken for all such meshes, is > 0, unless / (x (1) , x (2) ) = 0 
at every point of all these meshes. 
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C<2)) 
r (2)\ 


Unless / (x ll) y x i2) ) = 0, at every point of the frontier of 6r, with the 
possible exception of a part of G of which the measure (J) is zero, the 
limit of £8 {U (8) — L (8)}, as n ~ oo, will not be zero unless E8, taken for 
all meshes that contain points on the frontier of G f converges to zero, as 
n ~ oo. Thus it follows that the upper and lower integrals of j (,z (1) , x v ‘ 
in the fundamental cell will have unequal values, and thus that f (x a) y x Vt 
is not integrable (R), unless, either the set G is measurable (J), or 

/(«<!>, *< 2) ) - 0 

at all the points of its frontier with the possible exception of a part, of 
which the measure (J) is zero. 

We define the R -integral 


I /(%' 

ha) 


i d (x ( 


1) r (‘2)\ 


f 

J (a 


off (x U) , x (2) ) over the set G , to be the value of 
r(b( l \ 6 < 2 >) 

/ (x n) , X {2) ) d (# (1 \ x (2) ), 

a<*>) 

when the latter integral exists, which we have seen is only, in general, 
the case when the set G is measurable (J). 

In the definition of the .^-integral of a bounded function / (x a) , x {2) ), 
defined for a bounded set G, it will accordingly be assumed that G is 
measurable ( J ); and then the integral exists when the function that is 
equal to / (x {1) , x (2) ) at all points of G y and is elsewhere zero, has an 
R -integral in a cell that contains G. 

b<v) 


An integral 


/, 


(a* 1 ), 


/(:r (1) , x< 2 >)d(x {1 \x (, ‘ 


or 


f /(* 

ho) 


(1) t { 2 ) 


) d (x {] 


% 


is, in accordance wdth tradition, spoken of as a double integral, although 
it is defined as the single limit of a finite sum, and accordingly the sign 
of integration is here employed only once. The term ‘‘double,” in the 
name double integral, must be taken to have reference to the two-dimen¬ 
sional set of points for which the function is defined. A similar remark 
applies to the case of a p-fold integral. The above notation is such that if 
(x (l) , # (2) ), or (a; (1) , x {2 \ ... x ip) ), 

be denoted by a single letter x, a double, or ^-fold, integral may be 
denoted by r b 

f (x) dx . 
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independently of the number of dimensions of the space in which the 
points a, 6, x lie. This is in accordance with the parity of the properties 
of the integral in any number of dimensions with those of the R -integral 
in linear space. 

340. With slight adaptation, the properties established in § 337 are 
applicable to the case of functions of two or more variables. The notation 
will be the same as for functions of one variable, provided the single 
letter x is employed as typical of (x (1) , x i2 \ ... x (p) ) in space of p-dimensions. 

The definition in (1) of ( f (x) dx 

h 

rb 

as the value of — / (x) dx 

. a 

holds for a cell (a, b). The proof of the property (2) is unaltered by em¬ 
ploying 8 for a cell, instead of a linear interval. The property (3) is 
unaltered essentially; it becomes, for the case p — 2, 
r(b< l \ &<*>) 

f (x (1 \ x {2> ) d (x {1) , x l2) ) 

f (x^ - h a) , x (2) - h (2 >) d (x™ 9 x < 2 >). 

J (aC)+W»), aW+hW) 

The property (4) holds with its form unaltered. The property (5) holds 
for a cell (a, ft) contained in the cell (a, b). 

If c is a point in the cell (a, 6), the cell (a, b) is divided into 2 P cells by 
the (p — 1)-dimensional planes through the point c, and 

f f(x)dx 

' a « 

is expressed as the sum of the integrals over the 2 V cells into which (a, 6) 
is so divided. 

The properties (6), (7), (8) remain unaltered, and (9) holds for an 
enumerable non-overlapping set of cells in the cell (a, b). 

A more general form of (5) which holds for jo-ple integrals is the 
following: 

If O be a bounded set that is measurable (J), and if Q be the sum of two 
sets O x and Cf 2 , both measurable (J), then 

f / (x) dx = I / (x) dx + I f(x)dx. 
ho) J(O t ) .'(O,) 

To prove this theorem, let K be the set of points of O at which the 
saltus of f (x) is ^ k; then K has two components, K x a part of O l9 and 
K 2 a part of 0 2 . The only points at which the saltus of f (x), considered 
as defined in Q x only, and zero elsewhere, is £ Jc, consist of those points 
of K l that are interior to O l9 together with points forming a set K x on 
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the frontier of G x and G 2 . Since the content of this frontier is zero, the 
measure (J) of K x ' is zero; and the measure (J ) of the set of points of K x 
interior to G x is also zero, since K x has the content zero. Since the measure 
(J) of the set of points at which the saltus of / (x) regarded as defined 
in G x , and elsewhere zero, is zero, it follows that / ( x) is integrable ( E) 
in G x ; and similarly it can be shewn to be integrable ( E) in G 2 . The integral 

I f ( z ) dx 
j (V) 

is, by definition, the limit of the finite sum 

8 X M (8 X ) 4- 8 2 M (8 2 ) -I- ... 4- 8 m M (8J 
taken for a net fitted on to the fundamental cell, or interval. The meshes 8 
consist (1), of those which contain interior points of G x only, but no points 
on the frontiers of G x and G 2 , (2), of those which contain interior points 
of G 2 only, but no points on the frontiers of G x and G 2 , and (8), of those 
which contain points of G x and G 2 which are not interior points of either 
set, but are points on the frontiers of G x and G 2 . The above sum may be 
divided therefore into three portions containing those meshes 8 which 
respectively belong to (1), (2), and (3). The limit of the first of these sums is 


that of the second is 


I f (*) dx, 

J (04 

f / (*) dx, 

J (Ot> 


and that of the third is at most numerically equal to U multiplied by the 
content of the points on the frontiers of G x and G 2 ; where U denotes the 
upper boundary of | f (x) | in G. Since the contents of the frontiers are 
zero, tfye limit of the third part of the sum is zero. The theorem has 
accordingly been established. 


INTEGRABLE NULL-FUNCTIONS AND EQUIVALENT INTEGRALS 

341. If f (x) be integrable (E) in the interval, or cell, (a, 6), and be 
such that in every interval, or cell, contained in (a, 6), its integral is zero, 
then / (x) is said to be an E-integrable null-junction. 

The necessary and sufficient condition that a bounded function f (&) may 
be an E-integrable null-function is that f (x) = 0, almost everywhere in its 
domain . 

Let G k denote the set which contains the points x at which 

I / (*) i * k , 

where k is any positive number. 

To prove the sufficiency of the condition, let I) n be a net with closed 
meshes fitted on to the interval, or cell, (a, 6), and let 28' denote the sum 
of the measures of those meshes of D n which contain at least one point 
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of G k . If the condition of the theorem is satisfied, 218' has the limit zero, 
as n ~ oo. The sum 2184/ (8) employed in § 330 is in absolute value 

< P218' + (A - 28') k, 

where A is the measure of the interval, or cell, (a, b)> and P is the upper 
boundary of | / (x) | in (a, b). If 28' converges to zero 

| 28J/ (8) | - Ak 

is less numerically than the arbitrarily chosen positive number c, for all 
sufficiently large values of n. If k be chosen < e/A, | 2184/ (8) | < 2e, for 
large enough values of ti, It follows that lim 284/ (8) = 0, and therefore 

oo 

f (x) is integrable (R) in (a, b), and its integral has the value zero. The 
same argument applies to any integral, or cell, (a, ft) contained in (a, b). 

To shew that the condition is necessary, let it be assumed that / (x) 
has, in every interval, or cell, contained in (a, b ), an A*-integral that 
vanishes. At any point x 1? at which / (x) is continuous, / (x x ) must be zero. 
For let / (Xj) have, if possible, the positive value c ; then a neighbourhood 
of x x can be determined such that, at every point of it, / (x) lies between 
c -t e and c - e, where e is an assigned positive number < c. The integral 
of f (x) over this neighbourhood, which we take to have the measure A, 
is s A (c — e) > 0, contrary to the hypothesis. In a similar manner it can 
be shewn that/(x x ) cannot be negative; therefore/ (x x ) = 0. Since f (x) 
is continuous almost everywhere in its domain, it follows that / (x) has 
the value zero, almost everywhere, that is, with the possible exception of 
points belonging to a set of measure zero. 

342. Two functions / (x), </> (x) both integrable (A), which have their 
integrals equal, when taken over any interval, or cell, contained in the 
fundamental interval, or cell, (a, 6), must differ from one another by an 
A-integrable null-function. 

If / (x) be integrable (R), and consequently point-wise discontinuous, 
and ip (x) be that function defined, as in § 241, by extension of the function 
which is defined only at the points of continuity of / (x), and has at those 
points the same functional values as / (x) itself, then ip (x) is integrable (R), 
although it is in general multiple-valued at the points of discontinuity of 
/ (x). It has been explained in § 235 that Riemann's definition is applicable 
to such a function as ip (x). That ip (x) is integrable (R) follows from the 
fact that its points of discontinuity form a set of which the measure is 
zero. The function / (x) — i p (x) is zero at the points of continuity of / (x), 
and is discontinuous only at the points of discontinuity of / (x), which 
form a set of measure zero. Consequently / (x) — ip (x) is an A-integrable 
null-function, and the two functions have equal integrals in any interval, 
or cell, for which/ (x) is defined. It has thus been shewn that: 
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A function f ( x) that is integrable (R) in an interval , or cell , is the sum of 
an R-integrable null-function and of the function i/j ( x) obtained by extension 
of the function defined by the values of f (x) at its points of continuity . 


THE FUNDAMENTAL THEOREM OF THE INTEGRAL CALCULUS 


343. The fundamental theorem of the Integral Calculus asserts that 
the operations of differentiation and of integration are in general inverse 
operations. Before we proceed to consider the conditions under which 
this is the case, the following theorem will be established: 

If f (x) be a bounded function which is integrable (R) in the interval (a, 6), 
then f f (x) dx is a continuous function of x, for the whole interval (a, 6), 

a 

and it is a function of bounded variation in (a, b). It is also absolutely con¬ 
tinuous in (a, b). 

It has already been shewn that f / (x) dx exists, for any point x of 

J a 

the interval (a, b ); denoting its value by F (x), we have 
F (x ± h) — F (a;) =-= f / (x) dx; 

J X 

hence by (8), of § 337, | F (x ± h) — F (t) | Ph , where P is the upper 
boundary of | f (x) | in (a, b). If e be any arbitrarily chosen positive number, 
and we take h x < c/P, then for all values of h which are £ h l9 we have 
| F (x ± h) — F (x) | < e; 

but this is the condition of continuity of F (x) at the point x. In case x 
be one of the end-points of (a, 6), h must be restricted to have one sign only. 

To prove that F (x) has bounded variation in (a, b), let (a, 6) be divided 
into n sub-intervals by the points o, x lf x 2 , ... x n _ x ,b. The sum of the 
absolute differences of the values of F ( x) at the ends of these sub¬ 
intervals is 


[ f(x)dx + \ X% f(x)dx + ... + f f(x) 

J a J J Xn-l 


dx 


rb 

and this is, in accordance with the theorem (8) of § 337, ^ \f(x)\dx; 

J a 

and therefore the sum is less than a fixed positive number. Since the 
total variation of F (x) in (a, b) is bounded, it follows from the theorem 
of § 246 that the total fluctuation in the interval is also bounded. 


If (flh, fcj), b 2 ), ... (a n , 6 n ), ... be a finite, or infinite, set of non- 
overlapping intervals contained in (a, 6), of which the total measure is 
<r/, we have for the finite, or infinite sum 


2 I F (b r ) - F (Or) I s 17S (6, - a T ) < V U, 

r -1 
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where U is the upper boundary of | f (x) | in (a, b). The number rj being 
taken to be c/U, where e is an arbitrarily chosen positive number, we see 
that the condition, given in § 218, for the absolute continuity of F ( x) 
is satisfied. 

When/(x) is integrable in (a, 6), the function j f(x)dx, which has 

J a 

been shewn to be absolutely continuous, and of bounded variation in 
(<z, b), is said to be the indefinite integral corresponding to f (x). 

If f (x) be any function defined in (a, 6), a function </> (x) which, at 
every point x of the interval, possesses a differential coefficient equal to 
/ (x), is said to be a primitive of / (x). 

The definition is, however, frequently extended to cases in which 
<f>' (x) either does not exist, or is not equal to f (x), at points belonging to 
an exceptional set, of measure zero; the condition <$>’ (x) — f (x) being 
satisfied at all points not belonging to the exceptional set. 

Taking the function F (x) = j / (x) dx , as the indefinite integral corre- 

J a 

sponding to/ (x), the following properties will be established: 

(A) . Under certain restrictions F (x) possesses a differential coefficient 
which is equal to / (or), and thus F (x) is a primitive of / (x). 

(B) . Also it will be shewn that, if <f> (x) be a function which possesses 
a differential coefficient / (x), then / (x) may have an indefinite integral 
F (x), in an interval (a, x), which integral differs from </> (x) by a constant 
only; and thus that the indefinite integral of / (#) is determinate except 
for an additive constant. 

It will appear that there are cases of exception to both theorems. 
When F (x) is an indefinite integral, it may happen that at certain points 
F ( x ) does not possess a differential coefficient; and when <f> (x) is a function 
which possesses a differential coefficient, it is not always the case that 
the latter is integrable (B), and when integrated yields the function <f> (#) 
except as regards an additive constant. 

344. If f (x) be continuous in the interval (a, 6), and F (#) denote the 

indefinite integral f f (x) dx , then, at every point in (a, 6), F (x) possesses a 

J a 

differential coefficient which is equal to f (x). 

For since / (x) is continuous, an interval (x — h lf x -f hf) can be found 
such that \f(x± 0h x ) ~ f (x) | < c, for all values of 6 numerically less 
than 1. It follows that 

# rx±h 

F (x dt h) — F (x) = / / (x) dx 

J X 

± h [/(*) + e] and ± h [/ ( x) - e], 


lies between 
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provided h < h l . Hence, since 

F (x dt h) — F (x) 

±h 

lies between / (x) - f c and / (x) — e, 

for h<h li it follows that f (x) is the differential coefficient of F (x). At 
the points a , b, the function F (x) possesses derivatives on the right and 
on the left respectively, and their values are / (a), / (6). 

If (f> (x) be a function which at every point of (a, b) has a differential 
coefficient , which is a continuous function f (x), then 

<t> (x) ~ <f> (a) = [ f (x) dx . 

J a 

For let / (x) dx be denoted by F (x), then the function </> (x) — F (x) 

a 

has at every point a differential coefficient which is zero, and therefore, 
by the theorem of § 267, the function </> (x) — F ( x ) is constant; it is clear 
that this constant must be </> (a); and thus the theorem is established. In 
this theorem and elsewhere, a derivative at a on the right, and a derivative 
at b on the left, are included in the term “differential coefficient.’’ 

345. If a given bounded function / (x) that is integrable (B) be not 
everywhere continuous in the interval (a, 6), the proof given above is 
applicable to prove that, at any point of continuity of f (x), the function 

j f (x)dx has a differential coefficient equal tof{x), and thus has a differential 
J a 

coefficient almost everywhere in (a, b). 

That F (x) has a differential coefficient almost everywhere has been 
shewn, in § 298, to follow from the fact that F (x) is of bounded variation. 
It has here been proved independently, for the special case of an in¬ 
definite integral. 

At a point of ordinary discontinuity of / (x), the same proof, when 
modified by taking only positive values of h , or only negative values of h, 
and using / (x + 0) or / (x — 0), in the two cases, instead of / (x*), will 
shew that F (x) has at such a point derivatives on the right and on the 
left, and that these are / (x 4- 0), / (x — 0) respectively. At a point at 
which / (x) has a discontinuity of the second kind, the proof fails alto¬ 
gether; at such a point therefore F (x) need not possess a differential 
coefficient, nor definite derivatives on the right and on the left, but may 
have all its four derivatives D + F (x), D + F (x), D~F (x), D_F (x) of 
different values. 

If / (x) be integrable ( B ), and consequently point-wise discontinuous, 
and (x) be the function formed by extension of the functional values of 
/ (x) at its points of continuity, as explained in § 342, we have 

f(x)= x (*) + 0 (*)> 
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where y (#) is an integrable null-function; and therefore / ( x) and if* ( x) 
have the same indefinite integral F (a:). The derivatives of F (x) are 
independent of the function x ( x )j and depend only upon if/ (z), which is 
determined by the values of / ( x ) at its points of continuity. 


rxih 

Since F (x -f h) — F (x) -- ip (x) dx, and since the values of if/ (x) 

T 


in the interval (x, x -f h) all lie between ip (x -f 0) + € 1 and ip (x -f 0) — e 2 , 

, . 7 j , F (x + h) — F (x) 

where e 1? e 2 converge to zero as h does so, we see that - - - 

lies between ip (x + 0) + e 1 and ip (x 4- 0) — € 2 , hence D+F (z), D+F (x) 
both lie between* ip (x + 0) and ip (x ~f 0). By taking h negative, we see 


that D~F (x), D__F (x) both lie between ip (x — 0) and ip (x — 0). In case 
ip (x) have a unique limit on the right, F (x) has a derivative on the right, 
ip (x 4- 0); and in case ip {x) have a unique limit on the left, F (x) has a deriva¬ 
tive ip (x — 0) on the left. It may happen that ip (x) is continuous at a point 
of discontinuity of f (x); at such a point F (#) has a differential coefficient 
equal to the value of ip (x). Even when ip (x) has a discontinuity of the 
second kind, it is possible that F (x) may have a differential coefficient, 
or a derivative on the right or on the left, or both. This case has been con¬ 
sidered! by G. Prasad. 


Iff (x) be integrable (R), and F (x) be the corresponding indefinite integral , 
any one of the four derivatives DF (x), of F ( x ), is integrable (R ), and has 
F (x) for its indefinite integral. 

For DF (x) differs from / (x) only at a point of discontinuity of / (x) 9 
and at such a point DF (x) lies between the upper and lower boundaries 
of ip (#); thus / (x) — DF (a:) is an integrable null-function. Therefore 

f ip (x) dx = f D+F (x) dz = f D + F (x) dx = f D~F (x) dx 
J a J a J a J a 

— f D^F (#) dx — [ f (a:) dx — F (x). 

J a J a 

It has been shewn that the indefinite integral of a function that is 
integrable (R) has a differential coefficient at the everywhere dense set of 
points of continuity of the discontinuous function; there may however 


* It is stated by Schoenflies, see Bericht uber die Mengenlehre , p. 208, that the derivatives of 
F (x) are equal to ^(z + 0), *ft(x + 0), ip(x-Q), ip(x-Q). This, however, is not necessarily the 
case. It has been shewn by Hahn, MoncUshefte der Math. u. Physilc, vol. xvi (1905), p. 317, that 
/ (x) may be so chosen that the corresponding integral function has, at a particular point, deri¬ 
vatives on the right having arbitrarily given values lying between, or equal to, the values of 
ip (x + 0), tp (x + 0) at the point; and in particular that/(x) may be so constructed as to have, at 
the point, a definite derivative which has an assigned value between the two limits, 
t Bulletin Calcutta Math. Soc. vol. xv (1924), and vol. xvi (1925). 
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also be an everywhere dense set of points at which this continuous function 
does not possess a differential coefficient. 

346. It has been shewn that, if the continuous function <f> (#) possesses 
everywhere a differential coefficient / (x) which is everywhere a continuous 
function, then 

<f> (*) ~ («) = | / (s) dx = F (x). 

J a 

This is a particular case of the following more general theorem: 

If cf> (x) be a function continuous in the interval (a, 6), and if one of its 
four derivatives Z) f /> ( x ), D + <f> (x), D~<f> (x), D_<f> (x) be a bounded R-inte- 
grable function in (a, b), then each of the other three derivatives is also bounded 
and integrable (R) in (a, 6), and cf> (x) — <f> (a) is the integral of any one of 
the f'our derivatives through the interval (a, x). 

If (a, x) be divided into a number of parts (a, xf), (x lr x 2 ), ... (# n _ 3 , x), 

it has been shewn in § 280, that - ( Xr \ $ ^es between the upper 

X r ~ % r ~l 

and lower boundaries of any one of the four derivatives D<f> (x) in the 
interval (x r _ ly x r ). It follows that <f> (x) — cf> (a) lies between two sums 

(x 1 -a) U ( a,x 1 ,D) + (x 2 - x,) U (x lt z 2 ,D) + ... + {x-xV (x„._ 1 ,x,D), 
(x 2 - a) L (a, x lt D) + (x 2 - x 2 ) L (x,, x 2 , D) + ... + {x - x^) L(x„_ lt x,D) 

where U (x r _ x , x r , D), L (x r .^ 1) x r , D) are the upper and lower boundaries 
of D<j> (x) in the interval (x r _ x y x r )\ and it is known that these are the 
same for all four derivatives. The limits of the above sums, when the 
intervals are diminished indefinitely, so that the greatest of them con¬ 
verges to zero, are the upper and lower integrals of any one of the four 
functions Defy (x). If it be known that any one of these derivatives is 
integrable (R) in (a, x), then the upper and lower integrals are equal, and 
the other three are also integrable, the common value of the integral being 
cf> (x) — <f> (a). Thus 

<f>(x) — (f> (a) = I D+<f> (x) dx — I D + (f> (x) dx = j D~cf> (x) dx = I !)_</> (x) dx. 
J a J a J a J a 

It should be observed that, as has been shewn in § 280, the four 
derivatives are all equal to one another at a point at which one of them 
is a continuous function; and thus at such a point there is a differential 
coefficient. If one of the derivatives be integrable, there is therefore a set 
of points of measure equal to that of the interval (a, b), at which all four 
derivatives have equal values, and at which therefore a differential coeffi¬ 
cient exists. 

347. In case D<f> (x) be a bounded function which is not integrable (i?), 

the above proof shews that <f> (x) — (a) lies between the upper and lower 
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integrals, in (a, x ), of any one of the four functions D<j> (x). This includes 
the case in which <f> (x) has a differential coefficient which is bounded but 
not integrable (R); in that case cf> (x) — <j> (a) lies between 

I <f>' (x) dx and I 0' (x) dx. 

J a J a 

Tw I h 

Since I 0' (x) dx is in absolute value less than h. U, where U is the 

J T 

upper boundary of | 0' (x) | in (a, b ), it follows, as in § 343, that [ 0' (x) dx 

■ a 

is a continuous function of x \ similarly it may be seen that f <f>' (x) dx 

fa 

is a continuous function of x. At a point of continuity of 0' (x ), both 

f (/>' (x) dx , f 0 ' (x) dx 
J a fa 

have the differential coefficient 0' (x), as may be seen by a process precisely 
similar to that in § 344. Thus the upper and lower integrals of 0' (x) 
possess properties similar to those of the integral of 0' ( x ), when it exists, 
and both of them may be regarded as primitives of 0' ( x ), in the extended 
sense. 


The function 7>0 ( x ), when not integrable ( R ), may be a non-integrable 
point-wise discontinuous function, or it may be totally discontinuous. 

If / (x) be any non-integrable bounded function, the following theorems 
may be established by proofs similar to those in § 343 and § 345: 

The upper and lower integrals j f(x)dx, f f(x)dx are continuous , and 

J a Jja 

of bounded variation in (a, b). 

At any point of (a, b), at which f (x) is continuous , the upper and lower 

integrals f f(x)dx y | f(x)dx each possess a differential coefficient which is 
J a fa 

equal to f (x). 

348. An important general class of continuous functions for which 
the four derivatives are not integrable (B) } even when a differential 
coefficient exists, or when derivatives on the right and on the left always 
exist, is the class of everywhere-oscillating functions. Those functions 
which become everywhere-oscillating functions when a linear function is 
added have the same property. 

If a derivative DF (x) be such that, in every interval, it has no finite 
upper boundary or no finite lower boundary, it is certainly not integrable 
(R); it is therefore only necessary to consider an interval in which the 
funotion DF (#) is bounded. Let (a, x) be such an interval, and let us 
suppose that F (x) — F (a) is not zero. 
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In every interval (x r _ lf x r ) contained in (<z, a), U (x r _j, x ri D) the upper 
boundary of DF (a:) is positive, and L (# r -i, x r , D) the lower boundary 
of DF (x ) is negative; thus the two sums 
{x t - a) U (a,x lt D) + (x 2 - xf) U ( x lf x 2 , D) + ... 4- (x-x n _ x ) U 
(^i ~a)L (a, Xj , D) -f (x 2 - xj L (x lt x 2 ,D) -f ... -f {x - x^) L(z n _ lt x,D), 
are such that the first is essentially positive, and the second essentially 
negative, the non-vanishing number F (#) — F (a) lying between them. 

It follows that the limits of these two sums, as the number of sub¬ 
divisions of (a, x) is increased indefinitely, must be different from one 
another, since they cannot have zero as their common value; thus 

I I)F ( x ) dx, f DF (#) dx 

J a Jja 

are distinct from one another. 

It has thus been proved that a continuous function which is everywhere- 
oscillating in (a, h) cannot have a derivative which is integrable (R) in (a, 6), 
even if it have everywhere a differential coefficient , or definite derivatives on 
the right and, on the left. 

The function DF (; x ), or / (#), in the case of such function, may be a 
point-wise discontinuous function such that the measure of the set of 
points of discontinuity is greater than zero, or it may be a totally dis¬ 
continuous function. 

A continuous monotone function, which is not reducible to a function 
with an infinite number of oscillations by the addition of a linear function, 
has at every point definite derivatives on the right and on the left, each 
of which is either continuous or is an IMntegrable discontinuous function, 
since either derivative has only ordinary discontinuities. Thus such a 
function has I?-integrable derivatives, provided these derivatives are 
bounded in the interval. 

In case the continuous function F (x) have a differential coefficient, 
or a derivative which is not everywhere finite, or is not bounded in the 
interval, this derivative is not integrable in the sense in which we have 
hitherto defined integration. This case will be considered in connection 
with the theory of improper integrals. 

349. The preceding investigations provide answers to the questions 
which arise as regards the validity of the two propositions (A) and (B) of 
§ 343, which together constitute the fundamental theorem of the Integral 
Calculus, asserting that the operations of differentiation and of integration 
are in general reversible. The definition of a definite integral has hitherto 
been restricted to that of Riemann, and is applicable to bounded functions 
only. The extensions of that definition to the case of unbounded functions, 
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which will be considered later, and also the more general definition of 
integration due to Lebesgue, to be considered in Chap, vii, will lead to 
corresponding extensions of the scope of the fundamental theorem. 

As regards the theorem (A), that the indefinite integral 

F(x) = \ f (x) dx 

J a 

of a bounded B-integrable function possesses a differential coefficient 
equal, at a point x of (a, b), to / (x), it has been shewn that the theorem 
holds without restriction in case / (x) is a continuous function; but that, 
if / (x) be not continuous, the theorem still holds as regards every point 
of continuity of / (x). It follows that the points of (a, b) at which F (x) 
either possesses no differential coefficient, or possesses one which is not 
equal to/(x), form a set of measure zero, which may however be every¬ 
where dense in ( a , b). 

The theorem (B) that, if </> (x) possess a differential coefficient / (x), 
then the corresponding indefinite integral F (x) = j f (x) dx differs from 

<f> (x) only by a constant, holds if / (x) be a continuous function, and more 
generally, if / (x) be bounded and integrable (R). In case <f> (x) does not 
at all points possess a differential coefficient, the more general theorem is 
applicable that, if any one of the four derivatives of <j> (x) be bounded 
and integrable ( B ), then the integral function corresponding to that de¬ 
rivative differs from </> (x) by a constant only. The theorem fails either 
in case <f> (x) be not a function with bounded derivatives, or in case it be 
a function with bounded derivatives, but those derivatives do not satisfy 
Biem anil’s condition of integrability. 

The problem of the determination of a continuous function which shall 
have a given function / (x) for its differential coefficient, at every point at 
which / (x) is continuous, may be considered here in the case in which 
/ (x) is restricted to be bounded in the interval (a, b) for which it is defined. 
This problem of the determination of such a primitive of / (x) is regarded 
as having a unique solution provided functions exist which satisfy the 
condition, and further provided any two such functions differ from one 
another by a constant only, that constant having one and the same value 
for the whole interval (a, b). In the first place, the problem cannot be 
determinate unless / (x) be integrable (B) ; for either of the two functions 

[ / (x) dx, j / (x) dx satisfies the condition of the problem, and these 

J a J[a 

functions do not differ from one another by a constant, as they both 
vanish at the point a, and are elsewhere unequal. Next, if / (x) be inte¬ 
grable ( R ), the function [ f (x) dx satisfies the condition of the problem, 

Ja 
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but the solution is not necessarily unique. In case however the points of 
discontinuity of the J?-integrable function / (x) form an enumerable set, 
the theorem of § 267 shews that the solution is determinate; for any two 
functions which have equal finite differential coefficients at all points of 
(a, b) except those of an enumerable set, differ from one another by a 

constant. In this case f f (x) dx -f- C is the function required. When the 

J a 

points of discontinuity of the integrable function / (x) form an unenumer- 
able set, with a perfect nucleus, although that set must have zero measure, 

the problem has not a unique solution. For, although j f (x) dx is a 

function which has the required property, another solution is obtained by 
adding to it any continuous function which has all the intervals comple¬ 
mentary to the perfect component of the unenumerable set as lines of 
invariability; that such functions exist has been established in § 269. 

There exists however only one function, viz. j f (x) dx, with bounded 

J a 

derivatives, which satisfies the condition of the problem; for it has been 
shewn in § 286, that any two functions which have bounded derivatives, 
one of which derivatives is prescribed at all points not belonging to a 
certain set of measure zero, differ from one another by a constant. 

Similar remarks apply to the more general problem of the determination 
of a function which shall have one of its four derivatives, say the upper 
one on the right, equal to a given function / (x) at every point of continuity 
of / (x). This problem has a solution whenever / (x) is bounded; in virtue 
of the theorem of § 267, the solution is unique when f (x) is integrable ( R ), 
and the points of discontinuity of / (x) form an enumerable set. When 
f (x) is integrable (R), and the set of points of discontinuity is unenumer¬ 
able, there exists, in virtue of the theorem of § 286, only one solution for 
which the derivatives are bounded. As before, if the restriction, that the 
required function is to have bounded derivatives, be not imposed, the 
solution of the problem is indeterminate. 

EXAMPLE 

The following example was given* by Volterra, as the first case of a continuous function 
possessing a bounded differential coefficient, not integrable {R). 

Let O be a perfect non-dense set of points in the interval (a, b), and such that its 
content is greater than zero. Let (a, p) be an interval complementary to the set G, and let 

1 ,1 1 


^ (x, a) — (z — a) 2 sin 


, and therefore 6' (x, a) — 2 (x — a) sin-cos - 

X - a x 


The 


function <f>' ( x, a) vanishes at an infinite number of points in (a, p); let a 4- y be the greatest 
value of x which does not exceed \ (a -f p), for which <f>' (x , a) vanishes. Let F (x) «= 0 at 
every point of G , and in each interval (a, p) complementary to G, let F (x) = <f> (x, a), for 

* Giorn. di Battaglini , vol. xix (1881), p. 335. 
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values of x such that a <i x £ a 4* y; let F (a:) = <j> (a -I- y, a), for values of x such that 
a + y£x£0 — y; and let F (x) =* — <j> (x , 0), for /? — y £ x £ fZ. The function F (*) is 
continuous, and has everywhere a finite differential coefficient which is bounded in the 
interval (a, b). It is easily seen that F' (x) vanishes at every point of O. The function 
F' (x) has a discontinuity of measure > 2 at each point of the set G , which is not of zero 
content, and therefore F' (x) is not an integrable function. 


FUNCTIONS WHICH ARE LINEAR IN EACH INTERVAL OF A SET 

350. The existence of continuous functions which are linear in each 
interval of an everywhere dense set of intervals has been already referred 
to in § 274. It has been shewn in § 269 how a function / (x) can be con¬ 
structed which is continuous, and has as lines of invariability the intervals 
complementary to a non-dense perfect set of points. It is clear that the 

integral function ( / (x) dx is linear in each of the intervals, and being 

• a 

also continuous, it is a function of the type referred to. A more general 
function which is continuous, and is linear in each interval of the set, 

i ~,r 

f(x)dx any continuous function for which 

. a 

the intervals of the set are lines of invariability. 


INTEGRATION BY PARTS 


351. If u , v denote functions of x , defined for the interval (a, 6), and 

du dv ^ interval, the 


which have continuous differential coefficients 


formula 


d (uv) dv 
dx = U dx + V dx 


dx 9 dx 
du 


leads at once, by integration over the interval (a, b), to the well-known 


formula 


rb dv j s 

u 7 dx + 


dx 


b 

v 

J a 


du 

dx 


dx 


uv 


for integration by parts, due to Leibnitz. 

The simplest generalization* of this formula is that, if / (x), g (x) are 
bounded and integrable (E) in the interval (a, b), then 

ff (*) | j X 9 (*) dx^dx+ j b g (x) j J* f (x) dx J dx = jj (x) dxJj ( x) dx. 

To prove this formula, we employ the identity 

r -p r-p r **p r~p 

2 <z r 2 j8 r = L ct r s r _! -f- 2 ft s r , 

r »1 r-1 r «1 r-X 

where s r denotes a* + -f* ... + a r , and s' r -i denotes ft + ft + + ft-i* 

* SeeThomoe, Zeitschr.f ’. Math. vol. xx (1875), pp. 475-478; also Hardy, Messenger of Math. 
vol. xxx (1900), pp. 185-187, and vol. XLVin (1919), p. 90. 
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In a net D n fitted on to the interval (a, 6), let a r = 8,.^ (8 r ), where 
M t (8 r ) is a number not greater than the upper boundary of / (x) in the 
mesh 8 r , nor less than the lower boundary of / (x) in 8 r ; let fi r be 8^2 (8 r )> 
where M % (8 r ) is defined similarly in relation to g (#). We have then, if 
P = rn n , 

r ™nin r t~r- 1 "1 r m n V t'-r ~| 

s I KM, (8 r ) 2 8 t M t (8,) + 2 | 8 T M, (8 r ) 2 8 t M, (8.) | 

J r-1 


r^m n r~m* 

= 2 8 r M x (S r ) 2 8,-Jf, (8 r ).(1). 

r-1 r“1 

r m n 

Let n be so great that | / (x) dx differs from both Z 8 r U 1 (8 r ) and 

. a r •» 1 

r=>m* rb 

Z 8 r L 1 (8 r ) by less than 77; and so great that gr (x)da: differs from both 

r- 1 .'a 

r 

Z 8 r ^ 7 2 ( 8 r ) and Z 8 r L 2 ( 8 r ) by less than 77; where 77 is an arbitrarily 

r-1 r-1 

chosen positive number, f/j (8 r ), L 2 ( 8 r ) are the upper and lower boun¬ 
daries of f (x) in the mesh 8 r , and U 2 (8 r ), L 2 (8 r ) those of g (z). 

Then it is clear that, for every value of r, 


Z 8 t U 1 (8 t ) — [ f (x) dx < 77, and 


[ f(x) dx - Z 8 t L x (8 t ) < 77, 
Ja 1 


rxr t =-r 

hence / (a:) dx and Z 8*^ (8,) differ from one another by less than 77. 

.'a <>=1 

A similar result holds with respect to the function g (x) ; the mesh 8 r 
being {x r _ l7 x r ). 

If M t (8 r ) = / (^ r _x), and M 2 (8 r ) — g (a: r _x), we see that 

r - m* f £-r -1 “| 

2 8 r M, (8,) 2 S,Jf, (8 ( ) 


differs from Z & r f( x r-i) g (x) dx by less than 77 Z 8 r | / (av-*) | or 

r “ 1 J a r - 1 

by less than A (b — a) rj, where A is the upper boundary of | / (x) | in 
(a, b). A similar result holds with respect to g (x). 

As 77 is arbitrarily small, we now see that, when n ~ 00 , the formula 
obtained by proceeding to the limit of the expressions in (1) gives the 
above formula for integration by parts. 

The formula is equivalent to a somewhat more complex formula which 
was given* by Du Bois-Reymond. 

Let a, be fixed points in the interval (a, b). Writing 
F (*) = f / (*) dx, G (x) = [ g (x) dx, 


Abhandlungen <L Munch . Akad. vol. xn (1875), p. 129. 
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the above formula becomes 

f / ( x) G (#) dx -f f g {x) F (x) dx = f f (a;) dx [ g (x) dx. 

J a J a J a • a 

If we subtract from each side the expression 

G (P) | / ( x) dx + F (a) f g (x) dx, 

J a J a 

and express each of the integrals on the right-hand side as the sum of 
two integrals, taken from a to a and from a to b, in the first case, and 
from a to ft and ft to 6, in the second case, we find, after a little reduction, 
Du Bois-Reymond’s formula for integration by parts: 

j 7 (*) j J 0 (x) dxj dx + [ g (x) | J / (x) d.xi dx 

= ( f (x)dx f g (x)dx . 

.J a J 13 J « 

This formula reduces to the earlier one, if we take a — ft — a. 


The case of functions / (# (1) , x l2) ), g ( x {l) , x {2) ), both integrable ( R) in 
a cell (a (1) , a (2) ; b {1) , 6 (2> ), may be considered by applying the formula (1), 
when the meshes 8 are those of a net of a system fitted on to the cell. 
The form of the resulting expression when we proceed to the limit will 
depend upon the order in which the meshes of the net D n are arranged 
in the series. If we count the meshes from left to right, first taking the 
lowest row along the x {1) axis, then the next row also from left to right, 
and so on, it is easily seen that the formula obtained is 


r(b< l >, 

/(*<l ) ,Z (2) ) 

(ad>, a< 2 >) 


f (&d>, .r<*>) 


U (a (1 >, a< 2 >) 


g (x (1) , x (2) ) d (x ll) , x (2) ) 


d (*<», z< 2 >) 


(bw, w) 

+ | g (x a) , x {2) 

(a^), a <*>) 

(b('\ bW) 


( 6 d>, .r< 4 >) 


U (a (1) , a< 2 >) 


f(x"\x ( 2 ) )d{x"\x' 


d (z™, x (2) ) 


/, 

r(bv\bW) [(bw.bw) 

f (x< l \ x' 2 ') d (x (1) , z< 2 >) g (z™, x< 2 >) d (x {1 \ x< 2 >). 

.'(ad), a < 2 >) J(ad> a< 2 )) 


This is one of the formulae analogous to the formula for integration 
by parts in the case of functions of one variable. 


cauchy’s definition of an improper integral 

352. The definition of the 12-integral of a function / (x), in a linear 
interval (a, b), presupposes that / (#) is bounded in (a, 6). A method was 
given by Cauchy of extending the definition to cases in which f (x) is 
unbounded in the neighbourhoods of points of a finite set. 

Let us suppose that a point c, where a < c < b, is such that, in its 
arbitrarily small neighbourhood, the absolute values of the function have 
no upper limit, and let us suppose further that c is the only point of this 
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kind, and that the function / (#) is integrable in any sub-interval of (a, b) 
which does not contain c within it, or at an end. The two integrals 

( / (x) dx, [ f(x)dx 

Ja . c -tV 

both exist, whatever sufficiently small positive values be assigned to e, e'. 

It may hap-pen that , as e, e are diminished independently , so as to con¬ 
verge in each case to the limit zero , the two integrals also converge to definite 
limits; if this be the case, we define the sum , 

lim I f (x) dx -f- lim [ f (x) dx 
e~0 J a f'~!) . cf(' 

to be the improper integral of f (x) in the interval (a, b), and we denote this 
improper integral by b 

| / (x) dx, 

■ a 

using the same notation as in the case in which f (x) is integrable (R) in (a, b). 


rc -e 

The condition that lim / / (x) dx 

€-0 la 

should exist, is that, corresponding to each arbitrarily small number S 
which may be chosen, a number ej can be found, such that 


/; 


f (x) dx < 8, 


C - 

whatever value 0 may have, subject to the condition 0 < 6 < 1. 
A similar condition must be satisfied in order that 

lim f f (x) dx 

may exist. e '~° ' cH e ' 

It may happen that, although the two limits 

[ f(x)dx , [ f (x) dx 

Ja J C-+V 


do not exist, yet if we take e' = e, the sum 

[ f (x) dx + l f (x) dx 
Ja Jc+e 

may have a definite limit; when this is the case, the limit defines Cauchy's 
principal value of the integral of / (x), in (a, b). 

It thus appears that a principal value may exist when the function 
possesses neither an integral nor an improper integral in the interval (a, 6). 

In case the point a itself be a point of infinite discontinuity , then the limit 

f / (*) dx, 

Ja+< 
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for e ~ 0, when it exists , is defined to he the improper integral 

f f(x)dx 
J a 

off ( x ) in the interval (a, b). 

A similar definition applies in case the point h is a point of infinite 
discontinuity of the function. 

If, in the interval (a, 6), there are two points of infinite discontinuity 
Ci, c 2 (where a < c x < c 2 < h ), let c be any point between c x and c 2 . Then 
in case the four improper integrals 

[ / (x) dx, I f(x)dx, \‘f(x)dx, f f (x) dx 

• a J Ci J C C 8 

all exist, their sum is defined to be the improper integral of / (a;) in (a, b), 
and is denoted by / & 

j f (x) dx ; 

• a 

and it is clearly independent of the value of c. The definition, in case one 
of the points c 1} c 2 is an end-point of the interval (a, 6), is of the same 
character; or both of them may be end-points. If c t = a , c 2 = 6, then if 
the two improper integrals 

f f(x)dx , ( f(x)dx 

J a J c 

exist, their sum defines the improper integral 

( / (x) dx. 

J a 

The definition of an improper integral can now immediately be ex¬ 
tended to the case in which there is any finite number of points of infinite 
discontinuity in the interval. If these be c lt c 2 , c 3 , ... c„, taken in order 
from left to right, and the improper integrals 

l‘f(x)dx, \'f(x)dx,...\f(x)dx 

all exist, their sum is defined to be the improper integral 

[ f ( x ) dx. 

J a 

The definition can be extended to the case in which there is an infinite 
number of points of infinite discontinuity of / (x), forming a set O , of 
the first species, and of the first order. In that case G f consists of a finite 
number of points c lt c 2 , ... c n . 

If the improper integrals 

rct-ti rc,~t, rc,-t, 

f(x)dx, f{x)dx, f(x)dx,..., 

JO J Ci + «i' Jc 
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each of which falls under the last case, all exist, and have values which 
converge to definite numbers as , e/, e 2 , e 2 ', ... converge to zero, inde¬ 
pendently of one another, the sum of their limits is taken to define the 
improper integral fb 

f (■*) dx. 

J a 

It is clear that this definition admits of extension to the case in which 
G is of the first species, and of any order. 

It is easily seen that an integral which exists in accordance with this 
definition is such that the integral also exists in the interval (a, #), where 
x is any point in (a, b), and that it is a continuous function of x. Moreover 

I / (z) (lx = I / (x) dx + I / (x) dx. 

. a J a J j 

This definition of an improper integral was given* by Du Bois-Iteymond, 
for the case in which G' is finite. The definition has been extended*)* by 
Schoenflies to the case in which G is enumerable, but possesses transfinite 
derivatives. 


353. When the integral of a function / ( x ), with a finite, or an enumer¬ 
able, set of points of infinite discontinuity, exists, it may, or may not, 
be the case that | / (r) | has an integral in the same interval. 

rb 

The improper integral f (x) dx 


is said to be absolutely convergent in the interval (a, b), in case 

f I / (X) I dx 

<J a 

exists; otherwise it is said to be conditionally , or non-absolutely , convergent. 

The integral is said to be absolutely convergent at a point c, of infinite 
discontinuity of the function, when both the integrals 

[ I / (*) I dx, [ If (x) | dx 

. C f J C 

exist, provided € be sufficiently small. If either of these does not so exist, 
the integral * b 

f (x) dx 

J a 

is non-absolutely convergent at the point c. 

An absolutely convergent integral is absolutely convergent at each of 
the points of infinite discontinuity of the function. The converse also holds. 


* CreZZe’s Journal , vol. lxxix (1875), p. 36. 


f See Btrichty vol. i, p. 185. 
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rb rb 

If \f(x)\dx exists, then f (x) dx exists. For, if c be a point of 

J a J a 

infinite discontinuity of f (x), we have 

If f (x)dx I s ( I / (x) I dx < r ); 

I J C f e' I J Cl «' 

for all sufficiently small values of e, whatever value c' (< e) may have. 
The corresponding condition, on the other side of c, is also seen to be 

rb 

satisfied. The condition of convergence of / (x) dx, at the point c, is 

J a 

therefore satisfied. 


EXAMPLES 


f 1 l.i 

1. The integral I sin dx exists, but is non-absolutely convergent. For consider 
J o x x 

dx, taken over the set of intervals 

( — 1 -—-- -V where n = 1, 2, 3, ... m. 

\2w7r-)-^7T 2rnr-\-{nJ 9 


1 . 1 
- sin - 

X X 


We have 


/■*»»+*•r 1 . 1 _ ^ 1 , / 1 \ 

J 1_* * V2 \ Sn + lJ* 


2nir + 

hence the integral taken over the set of intervals is 


1 n-m f 1 \ 1 (n-m ] ) 

7 2 ° ge n” 1 V + 8»“+d) > V2 l0ge I » - j 5T+TJ ; 


^ V2 

thus, as m increases indefinitely, the value of the integral does so, and therefore | i sin ~ | 

cannot be integrable in the interval (0, 1), in accordance with Cauchy’s definition. On the 

Al | . All 

other hand J sin dx converges to a finite limit, as e ~ 0, so that the integral / sin -* dx 

J « x x J o x x 

exists in accordance with Cauchy’s definition, but it is not absolutely convergent. 

2. Let / (x) denote a function which is integrable ( R ) in every interval (a, 6), where 

d> lx) 

0 < a < b; and let f (x) be, in the neighbourhood of the point 0, of the form — V £- / , where 
k is positive, and <f> (x) is a bounded function, defined for the interval (0, c). 

We have | /*' *-g*\<A j'* < , ^ [c-* - 

where A is some positive number. 

If 0 < k < 1, it is clear that I dx is arbitrarily small, for a sufficiently small value 

J « x 

of € (> e), and therefore the improper integral I f(x)dx exists, being convergent at the 
point x = 0. If k £ 1, the improper integral does not exist. 

d> (x) 

3. Let / (a) be, in the neighbourhood on the right of the point 0, of the form 
where p is positive, and <f> (x) satisfies the same condition as in Ex. 2. 


We have 


4>{x) 


- 9 dx\<~ {[log<]“ p - [log< T p }; 


x [log £] l+p 

and thus the improper integral j f(x)dx exists, being absolutely convergent. 

J o 
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4. ( Z dx t taken through any interval which contains a point of infinite discontinuity 


of tan x t does not exist. 
For 


n , 

/ 2 tan x . . 2 . sin e 

ax > log —,, 

ft X n Sin C 


and this is arbitrarily great, for a sufficiently great value of e/e'; thus the integral does not 
converge at the point x = The integral possesses however a principal value at the 
point £tt. For the sum of the integrals taken through the intervals (£■* — e, — e') and 
(Arr e', -f e) is 


[* 2x 

I cot X . , dx < 7T (4 7T 2 - 

J t' - * 


and this converges to 0, with e. 


1 

5. The function cos (e x ) 4 * - e x sin ( e x ) oscillates between indefinitely great positive and 
negative values, in the neighbourhood of the point x = 0. For every value of x , except 
d ^ 

x = 0, the function = ^ { 2 : cos (e x )}. 


d 1 

Also J f i x cos ( e *)) d x ~ € nos (**) ~ cos ( ee )» 

where e > e' > 0. It thus appears that the integral of the function converges at the point 
x - 0; and therefore the function is integrable in an interval containing that point. 


RIEMANN INTEGRALS OVER AN UNBOUNDED INTERVAL 

354. The definition of the integral of a bounded integrable function 
given in § 330 is applicable only to the case in which both the limits a, b 
are definite points, and in which therefore the interval of integration is 
finite. 

Let x l9 x t9 ... x n9 ... be a sequence of increasing numbers having no 
upper limit; it may then happen that the sequence of R -integrals 

( l f(x)dx, f * f (z) dx, ... I f(x)dx 9 ... 

Ja Ja a 

has a definite limit A, independent of the particular sequence {# n } chosen. 
When this is the case / ( x) is said to have an integral (R) 

[ f (#) dx, 

J a 

in the unbounded interval (a, 00 ), the value of this integral being A. It 
has been presupposed that, in every interval (a, x), the function / (#) is 
integrable (R). 

If the integrals 



where x l9 x 29 ... x n9 ... form a sequence of decreasing values of x which has 
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no lower limit, all exist, and the sequence of integrals have a limit B> 
independent of the particular sequence chosen, the limit B is denoted by 

f (*) dx. 


If the two integrals 


i 


[ / (x) dx, f f (x) dx, 

JO J -oo 

as thus defined, both exist, their sum is denoted by 

i f (x) dx. 

J -oo 

The three numbers 

[ f(x)dx, f / (x) dx, [ f(x)dx, 

J a J -oo J -co 

being the limits of integrals, and not themselves in the proper sense of 
the term integrals, belong to the class of improper integrals. 

In each case it is necessary, but not sufficient, for the existence of these 
improper integrals, that / (x) be integrable (R) in every finite interval con¬ 
tained in the intervals (a, oo), (— oo, b), or (— oo, oo); and it will at 
present be assumed that / (x) is bounded in every such finite interval, and 
has therein a proper integral. 


In case the integral J f (x) dx have a definite limit, as c is indefinitely 
increased, that limit is said to be the principal value of 

[ / (X) dx. 

J -00 

This principal value may exist, even when the integral 

[ / (x) dx, 

J -CO 

as defined above, does not exist; but in case the latter does exist, its value 
is equal to its principal value. 

The necessary and sufficient conditions for the existence of the integral 


[ f(x)dx , 

J a 


are (1), that the integral exists in every interval ( a,x ), where x > a, and 
(2), that , corresponding to every arbitrarily chosen positive number e, a value 
of x, can be found such that 


j f(x)dx 


< €, 


for every value of such that 

A similar condition applies to the case of 
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355. It was shewn in § 332 that the necessary and sufficient condition 
that the bounded function / (x) be integrable (72) in the interval (a, b ) is 
that, for a particular system (D n ) of nets, fitted on to (a, b), the sum 

m 

2 8 m (n) F (8 m (n) ) should converge to zero, as n~ oo. It was also shewn 

VI *» 1 

that, if this condition be satisfied for one system of nets, it is satisfied for 
all such systems. 

We have to enquire how far a corresponding condition applies to the 
case of an 72-integral through an infinite interval. 

fb 

Since f(x)dx, when it exists, is the limit lim f(x)dx , we see that 

J a b ~oo . a 

the integral is given as the repeated limit lim lim 28 m (n) t7 (8 m (n) ), or as 

b ~oo oo 

lim lim 28 m (n) L (8 m (w) ). The question then arises whether, or under what 

b ~oo oo 

conditions, the order of the repeated limits may be reversed without 
altering their values. When this can be done, we have 

\* f{x)dx= lim 2 &m in) U (8 m (n) ) = lim 2 & m ™L (8 m <">), 

J a n~oo m * 1 n- oo m »1 

where the summations are taken for the meshes of a net fitted on to the 
infinite interval (a, oo). 

In the first place it is necessary that the two sums should not be 
divergent. 

The following theorem will be established: 

//, for the function f (x), bounded in the interval (a, oo), a system of nets 
fitted on to the interval exists , such that 2 8 m {n) U (8 m <n) ), 2 8 m (n) L (S m (n) ) 

m * 1 to -1 

both exist , as definite numbers , S n , 2 n , which converge , as n ~ oo, to one and 
the same number A , f (x) dx exists , arid ite rniwe is A . 

J a 

From the condition stated in the theorem it follows that 

S SJ n >F (SJ">) 

m -1 

exists, and converges to 0, as n ~ oo. It will be shewn that this condition 

is sufficient to ensure that f f (x) dx exists in every finite interval (a, 6), 

J a 

where b> a. If b is not an end-point of a mesh of the system, we may 
modify the system of nets by dividing into two each mesh that contains 
the point 6. The first sum will thereby be not increased, and the second 

CO 

sum will not be diminished; accordingly the condition that 2 8 m {n) F (8 m (n) ) 

TO “ 1 

exists and converges to zero, as n ~ oo, will still hold good. It then follows 
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rn - 

that, when we take the finite sum 2 b m (n) F (8 m (n) ), where b is an end- 

m -1 

point of 8 mw (n) , this sum will converge to zero, as n ~ oo; and thus 
rb 

f (x) dx exists as an i?-integral. 

a 

Again we have, for any finite value ra, of ra, 


m » rn 

S 8 m in) U ( 8 m <n >) — 

m-1 


a+26 m ("> 

1 


/(*)<*** S 8 m (n) F (S m ,n) ) 

m - 1 


and thus < 17, where 77 is an arbitrarily chosen positive number, and w is 

CO 

chosen so large that 2 & m {n) F (S m (n) ) < 77. The number ra may be chosen 


so large that 2 8 w (n) £7 (8 m (n) ) < 77; hence 

m - m +1 


s S m (n, U ( 8 m <n) ) 


1 


/ (z) da: < 277, 


for m > mj, some fixed integer. 

We may also suppose n to have been chosen so large that 

S 8 m ^?7(S m ^)-A<77; 

* 1 

r a+i 

we see then that f(x)dx differs from A by less than 377 , for all values 

. a 

m 

of ^ which are equal to a + 2 S w (n) , where m ^ . If X be any number 

m -1 

greater than the least of these values of two values of f exist, between 

which X lies. If & be the smaller of these, j f (x) dx lies between Ud 

and Z/d, where d is the maximum of the meshes d n of D n \ and U, L are 
the upper and lower boundaries of / (x) in (a, 00 ). It now follows that 
[X 

f(x)dx differs from A by less than 377 + £, where £ is the greater of the 

■ a 

two numbers | Ud |, \ Ld\. Since 77 , £ are both arbitrarily small, it follows 

rx 

that f (x) dx converges to A , as X ~ 00 ; and thus that the i^-integral 

. a 

f (x) dx exists. 

J a 

00 

It should be observed that the convergence of 2 S m in) F (S TO (W) ) to 

m-l 

zero, as n ~ 00 , for a particular system of nets, is not by itself sufficient 

rOO rfr 

to ensure the existence of / (x) dx, but only that of / (#) dx, for every 

.0 . o 

finite value of b (> a). In this respect an integral ( R ) over an infinite 
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interval differs from one over a finite interval; since, in the latter case, 
the convergence of the finite sum HS in) F ( 8 (n) ) to zero, as n ~ oo, for a 
particular system of nets, is sufficient to ensure the existence of the 
integral. 

356. The converse theorem will now be proved, that: 

f oo 

f(x) dx have a definite value, a particular system of nets can always 

■ a 

00 

be determined for (a, oo), such that E b m in) U ( 8 m (w) ) exists for each net D n 

m«= 1 

of the system , and converges to the value of the integral , as n ~ oo. 

Let us consider a set of points a,x lt x 2 , ... x n , ... which diverges as 
n ~ oo. A net can be determined for the interval (a, x x ), such that 

SSC7 (8) — [ Tl f(x)dx<ie; 

■' a 

similarly, a net can be determined for (x x , x 2 ), such that 
ESC/ ( 8 ) - [Jf (x) dx < e ; 

and generally, a net can be determined for (x n _ x , x n ), such that 
Z8U($)-[ r " f(x)dz<±*. 

J ffl- J * 

Thus a net can be fitted on to (a, x n ), for which 

zsu (8) ~ [y (z) dx < e (i ~ 

rx n r*> 

Now x n can be taken so large that I f (x) dx — j f (x) dx < rj ; there- 

J a J a 

n™m 

fore a net can be fitted on to (a, oo), such that 2 hU ( 8 ) differs from 
00 “ 1 
f (x)dx by less than an arbitrarily small number, for all sufficiently 

. a 

large values of m. Therefore I! hU ( 8 ) converges to a limit which differs 

00 m " 1 

from / (x) dx by not more than c. By taking a monotone sequence of 

J a 

values of e which converges to zero, we obtain a system of nets, fitted on 
to (a, oo), such as is required. 

It may also be possible to define a system of nets D n such that E8f7 ( 8 ), 
taken over (a, b ), does not converge, as b ~ oo; and thus the theorem 

f(x)dx = Urn E 8J*>Z7 ( 8 w w) « lim E S m ™L ( 8 m <«>) 

J a n ~ oo m ~ 1 n ~ oo n «1 

only holds, provided the nets are such that the sums exist for each net of 
the system, from and after some fixed value of n. 
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It can also be shewn that: 

When the integral f (x) dx has a definite value, then for any system of 

j a 

OO 

nets fitted on to (a, oo) which is such that 2 b m (n) F ( 8 m (n> ) exists, and con- 

m -1 

oo 

verges to zero, as n~ co, the set of numbers 2 8 m (n) t7 (S w (n) ) exists, and 

m 1 

converges to the value of the integral. 

Since, for every value of m, 


ra+2&m ( *> 


f(z)dxs E8 J”>U (8 J n >) f 

1 j a 


a + 2£ m <«> 


/(z)tfz + 2S m <«>F (8 m (n) )> 


we see that the number 2 b m in) U (S m (n) ) has, as m ~ oo, limits of indeter¬ 


minancy between / (x) dx and 


•00 

/(*) 
. a 


dx + E 8 m ln, F (8 m (n) ). There- 


QO rb 

fore as ft ~ oo, 2 h m (n) U (8 m (n) ) converges to f (x) dx. For a system of 

. a 

nets which is not such that 

S SJ*'F (8 m (n) ) 

m “1 
m 

converges to zero, 2 $ m (n) U (8 r w (n) ) does not in general converge, asm - oo. 

m -1 

f* I 6 

357. The definitions of f(x)dx, f(x)dx may be extended to the 

.'a j -ao 

case in which / (a;) has points of infinite discontinuity. If the improper 
rx 

integral / (a:) da; exist for every value of X which is > a, and if it 

J a 

converge to a definite limit, as X increases indefinitely, then that limit 
defines 

/ (x) dx. 

J a 

The integrals f / (x) dx, [ / (x) dx, when they exist, possess many of 

Jx J --co 

the properties of a proper, or an improper, integral [ / (x) dx. These 

■ a 

integrals are continuous functions of the finite limit x. For 
| f {x)dx= \ f(x)dx+if (a;) dx, 

J -oo . ~oo . a 

where a < x; and since f f{x)dx is a continuous function of x, so also is 

J a 
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r 00 rx 

When the integral / {x) dx exists, the integral f (x) dx is con- 
J a J a 

tinuous for all values of x in the interval (a, oo), including x = oo, as it 
is there continuous in the extended sense of the term, when the point oo 
is regarded as belonging to the domain. 

If the integral j / ( x ) dx exist for every finite value of z in the interval 

ifl 

(a, oo), and if <f> (x) be a function which is finite and continuous for every 
such value of x, and be such that 

<f> (x) - <f> (a) - f f (x) dx , 

J a 

then, provided <f> {x) be continuous for x oo, the function f (x) is inte¬ 
rs 

grable in ( a , oo), and </> (oo) — </> (a) =- I f (x) dx. If the function <f> (x) 

J a 

have a derivative, say D+<f> (#), which is integrable in every interval (a, x) 
of (a, oo); then if the integral be a proper one, or be such an improper one 
that the relation 

<f> (x) - (j) (a) — I D + (f> (:r) dx 
J a 

subsists, then, provided the limit (f> (oo) exist, we have also 
<f> (oo) — <f> (a) — f (x) dx. 

J a 

A similar statement applies to each of the other derivatives of <f> (x). In 
the case in which <f> (a?) — </> (a) differs from f D + f (#) dx by an integrable 

J a 

null-function, this holds also for the limit x = oo. 

358. An integral / (#) dx is said to be absolutely convergent when the 

J a 

fOC 

integral \f(x)\dx exists; otherwise it is said to be conditionally or non - 

’ fit 

absolutely convergent. If j \f(x)\dx exists, then also [ f (x) dx exists; 

•“ .'o 

rx t rx, 

for / (x) dx s | | / (x) | dx, and hence the convergence of the latter 

Xi ( . X t 

integral follows from that of the former one. 

U* f (*) andf (x) <t> ( x) be both integrable in every interval (a, x), contained 

f QO 

/ (x) dx be absolutely convergent, and if <f> (x) be, from 
and after some fixed value of x, numerically less than some fixed number, 
then the integral J f (a?) <f> (a?) dx exists, and is absolutely convergent . 

• Riemann’s Ges. Wtrke, p. 229; also Pringsheim, Math. AnncUen, vol. xxxvn (1890), p. 691. 
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r 00 

For, since f (x) dx is absolutely convergent, we can find, corre- 

a 

sponding to a fixed positive number a, a number £ > a, such that 

ri+A 

| / (x) | dx< a, 


.'f 


for all positive values of h ; we have then 


rt+h 


f (x) <f> (x) dx 


rt+h 


K \f(x)\dxz Kg , 


• £ 


where K is the upper boundary of | <j> (x) |, and is by hypothesis finite. 

r* 

It is thus seen that | / (x) <f> (x) is convergent. Also since 

. a 

rzh r £-f -h 

| f (x) <t> (x) I dx s K\ I / (x) I dx :S Xct, 

•'( f 

we see that the convergence is absolute. 

359. An important set of tests of the absolute convergence of an 

integral / (x) dx is the following : 

a 

If f (x) be integrable in every interval (a, x), then f (x) dx converges to 

■ a 

a definite finite value , provided f (#) converge to zero , as x is increased in¬ 
definitely, in such a manner that one of the expressions 
f (x) .x 1+ *, / (x) x (log x) 1+fc , 

/ (x) x log x (log log x ) l + k ,. f (x) x log x.log log x . (log log ... log :r) 1+fc , 

converges to zero , as x is indefinitely increased, k denoting some fixed number 
greater than zero . 

f oo 

/ (x) dx is not convergent , in case f (x) be of invariable 

. a 

sign , from and after some fixed value of x y and provided also any one of the 
above expressions remains numerically greater than some fixed number (> 0), 
as x is increased indefinitely, when k has the value zero. 

rx+h 

We see that, in the first case, J \f(x)\dx is numerically less than 


one of the expressions 


rX+h fa r 

.X * l+ *’ .1 


X + h 


dx 


c 


*+ h dx 

x x log x (log log x) l+k ’ * * # * 


Jx x (log a) 1 " 1 ** ’ 

where C is a constant, dependent on X, which converges to zero as X is 
indefinitely increased. These expressions have the values 

q\ 

k 


" 1 1 

c r i 

1 

[ 2 ;* (X + h) k \ 

’ k [(log 2> " 

' (log X + h)\ 


c r 1 _ 1 

k [(log log X)* (log log X 4 - h) k 


j •• •» 
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hence, k being positive, it is clear that X may be so chosen that 

f X+A 

\f[x)\dx 

X 

is less than an arbitrarily fixed number, and thus 

f \f(*)\dx 
J a 

is convergent. In the second case, k being now zero, we see that 

rx+h 

f i x ) dx 
Jx 

is numerically greater than one of the expressions 

q t x + h dx q dx g rx+h dx 

Jjf X ’ .'x xlogx’ X X log x log log X * ‘ * * 

or than one of 

n , X 4 h „ log (X + h) n . log (X + h) 

61 °g ~x , 61 °g - \ogX ’ Cloglog logX 

and these expressions increase indefinitely, as h is increased. It follows that 

I f (x) dx is in this case divergent. 

J a 

CHANGE OF THE VARIABLE IN A SINGLE INTEGRAL 

360. Let f (x) be a function that is bounded in the interval (a, b). We 
now assume that x is a continuous monotone function, p (£), of another 
variable defined for an interval (a, ft), of f, and such that a = p (a), 
b = p (ft). The following theorem will be established: 

If Dp (£), one of the derivatives of the non-diminishing monotone function 
p (f), be integrable (E) in the interval (a, ft), then 

\f(z)dx= \*f{P (€)}Bp (£) d£, and f / (x) dx = f / {P (()} Dp (f) d£. 

J a j a 2.0. 2 a 

Let a system of nets {Z) n } be fitted on to the interval (a, ft). Since the 


incrementary ratio 


* (fi> - 4 > &) 


has, in any interval, the same upper 


£i £2 

and lower boundaries as Dp (£) in the same interval (see § 280), to a mesh 
8 (n) of D n there corresponds a mesh 8 (n) , of a net D n , fitted on to (a, 6), 
such that 8 (n) & &8 (n) , where k is the upper boundary of Dp (£) in (a, ft). 
If u {n) denote the upper boundary of f {p (f)} Dp (f) in the interval $ <n> , 
we see that u (n) lies in the interval bounded by U (8 (w) ) u (n) , and U (8 (w) ) Z (n) , 
where u in \ l (n) are the upper, and the lower, boundary of Dp (f) in the 
mesh 8 (n) . Also 8 (w) lies in the interval bounded by u (n) § (fl) and l in) $ <w) ; 
we may therefore write 

8J B> - §J B > - *m <B) )}, where 0 S 0 m <"> S 1. 
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We have then £8 (n) Z7 (8 (n) ) — SS (n) u (n) 

= 28 (n) [{w (n) - 0 (w) (i u <"> - Z<*>)} U (8 (n >) - u (n) ] 

= £8< w > {u {n) U (8< n >) - u< n >) - E (n) (u< n) - Z<”>) U (8<*>); 

the summations being taken for the m n meshes of the net D n , or of the 
net D n . We shall assume that f(x)z 0 in (a, b ); by the theorem of § 346 
this involves no loss of generality, for the constant G can be so chosen that 
/ (x) 4- C s 0. We then have 

0 s £8 (w) {u M U (8<*>) - w<">} £ £8 (n) £7 (8< n) ) (u (n) - Z<">) <; Z72g< n > (w (n > - Z (n) ), 
where U is the upper boundary of f(x) in (a, 6); we have also 
0 £ $(n) ( U M _ /(n>) jj (§(«>) g ££<«>£/ (§<«>) - Z<*>) <; UZ,8 (n >(uM - Z<*>). 

Since Z)<// (£) is integrable (2?) in (a, fi), it follows that S8 (n) (u {n) — Z (n) ) 
is arbitrarily small, when n is sufficiently increased; it thus appears that 
|28<">Z7(8<">) - 

is arbitrarily small. Therefore as n ~ oo, the two sums 

tn » ra* m - m« 

S 8 m <n) U (8 m (n) ), 2 

//< - 1 r/< *-■ 1 

converge to the same limit. Thus the theorem is proved for the case of 
the upper integrals; for the lower integrals it can be proved in a similar 
manner. 

We deduce at once the following theorem for the transformation of 
a single integral: 

If Dip (£), one of the derivatives of the monotone function ip (£), be inte¬ 
grable (E), in the interval (a, /9), of and if either of the E-integrals 

i b f(z)dx } I'fwmDtiOdt 

J a J a 

exists , the other also exists , and they have the same 'value; f (x) being a 
bounded function. 

Since ip (£) has a finite differential coefficient <// (£), almost everywhere 
in the interval (a, /?), and ip' (£) is integrable (E) in the interval (see § 345), 
provided one of the derivatives Dip (£) is integrable ( E ), we have 

[ f (x) dx = f J{4> (f)} Ip' (f ) d$, 

-'a J a 

provided either of the integrals is known to exist as an 22-integral. 

In case ip (£), although not monotone, is such that it is monotone in 
each of a finite number of intervals («> yi), (y l9 y 2 ), ... (y n > £), where 
y 1 , y 2 , ... do not necessarily all lie between a and £, these intervals of f 
may be considered separately, and the above result can be consequently 
extended to such a case. 
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The following theorem* for the transformation of a single j?-integral 
is a particular case of the above theorem: 

If the function f (x) is integrable (R) in (a, 6), and if a monotone function 
iff (£) is defined by x = ip (£) = c + J <p {£) d£, where ip (£) varies from a to p 
as x varies from a to b, and (p (£) has an R-integral in (a, ft), then 


[ b f(x) dz-Fftf (£)} + (€) dS. 

J a la 


361. It may happen that, when ip (£) is a monotone function of the 
interval (a, b), of x, corresponds to the infinite interval (a, oo), of £. We 
now assume that ip (oo) is the limit of ip (£), as £ ~ oo, i.e. that ip (f) is 
continuous at oo. If the conditions of the first theorem above are satisfied 
for every interval (a, b - e), of x , with the corresponding interval (a, fi') f 
of we have 


[ / (x) dx = f {ip (f)} Dip (f) d£. 

J a J a 


Since this holds for every value of e, we have, on proceeding to the limit 
e = 0 ’ f/ (*) dx = ["/{* (£)} D* tf) d€, 

J a J a 

in accordance with the definition in § 355, of the integral on the right- 
hand side. Similar considerations apply to the case in which the value 
a of a; corresponds to £ = — oo. 

[b 

If it be desired to transform the integral I f(x)dx , by means of the 

J a 

relation £ = <p (x), where <p (a;) is a single-valued function of x , then, unless 
<p (x) be monotone in the interval (a, 6), the inverse function ip (f) will 
not be everywhere single-valued. 

If it be assumed that (p (x) is monotone in (a, 6), and that a — <p (a), 
p — <p (b), a derivative Dtp ( x ), of <p (x), is reciprocal to a derivative Dip (£) 

of \p (£). If it be assumed that is integrable in (a, P) } and that the 


same holds for 


/(*) 


D<p (x) 

considered as a function of f, or else that / ( x) is 


Df> ( x ) 

integrable in (a, 6), then we may use the transformation 


j a f(x) dx = [ e 






df. 


rb 

If ^ (a:) be not monotone, f{x)dx cannot in general be transformed 

• a 

into a single integral in y. If, for example, <p (x) increases from x = a to 
x = k y and then diminishes from x = lc to x = 6, we must take 

[ f (x) dx — f* ( ’ {iM l df + [* j^U df, 

•'« •« (X)<^ (*))(-*(!) .'*(*) (*))*-♦<*) 

* Lebesgue, Annales de Toulouse, (3), vol. i (1909), p. 44. 
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and the integrals on the right-hand side cannot in general be combined 
into one integral through the interval (a, /?), because in the two integrals 
the integrand has different values for the same value of y. 


Thus, for example, if y — sin x, 



f r/<£>u+[•[/<*> | 

J 0 |_COS X J J 1 |_cos x_ 


dy 


j' 1 /(si n- 1 y) 
Jo 



f (| + sin- 1 yj 

vr~y* 


dy, 


the value of cos x in the second integral on the right-hand side of the 
first equation being negative, and the value of sin^ 1 y being in the 
interval (0, r). 


REPEATED INTEGRALS 

362. The actual evaluation of a double integral over the fundamental 
rectangle, of which the sides are x = x 0 , x = x x , y = y 0 , y = y x , is usually 
made to depend upon the evaluation of successive single integrals taken, 
first with respect to one of the variables, and then with respect to the 
other. The expression 

[Zi rvt i*a ri/» 

\ dx If (x, y) dy , or / (x, y) dydx, 

in which / (x, y) is supposed to be integrated first with respect to y , for 
a constant value of x, and then with respect to x, is called a repeated 
integral. Similarly, the expression 

rvi [x, 

dy f (x, y) dx, 

J Vo Jx 9 

in which the integrations are performed in the reverse order, is called a 
repeated integral. The question of the existence of these repeated integrals, 
and, in any given case, their relation with one another, and with the 
double integral, will be here investigated. It will be observed that the 
double integral has been defined as a single limit; whereas the repeated 
integrals, when they exist, are each obtained as the results of repeated 
limits. We have then to investigate whether, or under what conditions, 
a double integral is capable of representation as a repeated limit, of one 
of the forms indicated. It cannot be assumed a priori that the existence 
of the double integral necessarily implies the existence, for each value 
of x , of the single integral 

T'/ (*. V) dy 
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as a definite number. Neither is the existence of this single integral, as a 
definite number, necessary for the existence, as a definite number, of the 

repeated integral r Tl r Vi 

dx \ f (x, y) dy. 

• *o J Vo 

In fact, if we assume that the upper and lower integrals 

Jvi [V i 

f(x>y)dy , f{x,y)dy 

J Vo Jjh 

have different values for some of the values of x, it may happen that the 
two repeated limits 

r* i 7vx fv i 

dx\ f(x,y) dy , dx I / (a?, y) dy 

J *o * l/o J *o Jj/o 

have identical values. 


The repeated integral will consequently be regarded, in this case, as 
existing; and thus it may be defined as 

r-Ti 7vx 

dx f (x, y) dy , 

J X 0 JjJ o 

where the upper or lower integral with respect to y is to be taken in¬ 
differently, provided the repeated limit exists as a definite number. 

rv i 7*i 

In a similar manner I dy f (x, y) dx , 

J y 9 

when it has a definite value independent of whether the upper or the 
lower integral with respect to x be used, will be regarded as the repeated 
integral, first with respect to x and then with respect to y. 


363. It was first established by P. Du Bois-Reymond* that, when the 
bounded function / (x, y) has a double ^-integral in the fundamental 
rectangle, then the two repeated integrals exist, and are each equal to 
the double integral. We shall first give a proof| of this theorem \vhich 
exhibits its relation with the theory of sets of points. 

The following preliminary theorem will first be established: 

If j f (x, y) d (x, y) y taken over the fundamental rectangle , have a definite 

value, then the values of x for which the single integral j f (x, y) dy, taken 

with x constant , has a definite value t define a set of points on a side of the 
rectangle, of linear measure equal to the length of that side . 

* CreUe's Journal , vol. xciv (1883), p. 277. 

f The investigation is founded on that of Sohoenflies, Bericht , vol. i, p. 193. 
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It follows from this theorem, that the set is everywhere dense in the 
interval, and of cardinal number c. Moreover, the points at which 

7vx [V i 

f(x,y)dy , f(x,y)dy 

J Vo {Vo 

differ from one another form a set of measure zero. 

On the assumption of the existence of the double integral, the set K 
of all the points at which the saltus of / ( x , y) is k\ where k is an arbi¬ 
trarily chosen positive number, is a closed set, of plane content zero. If 
a straight line be drawn parallel to the y-axis through the point x of the 
side y — y 0 , of the rectangle, then the component of K on this straight 
line will be denoted by K x , and its linear content by I (K x ). It has been 
shewn in § 143 that, a denoting a prescribed positive number, the linear 
content of that set of points x, on the side y — y {) , of the rectangle, for 
which I (K x ) a, is zero ; and thus that / ( K x ), considered as a function 
of x y is an integrable null-function, for each value of k. The function 
X (x), — lim I (K x ), is also an integrable null-function; for the set of points 

at which y (x) does not vanish is made up of those sets of points at which 
1 (K x il) ), I (K x i2) ), ... / (K x in) ), ... do not vanish; where 

K^yK^y ... K<">, ... 

correspond to a diminishing sequence of values of k converging to the 
limit zero: and since each of these sets has zero measure, it follows that 
the set of points at which y (r) does not vanish has zero measure. At any 
point x X y at which y (r) vanishes, / (K Xl ) vanishes, for every value of k. 

It should be observed that, at a point of K X y it is not necessarily the 
case that / (x, i/), considered as a function of y, with x constant, has its 
saltus ^ k ; in fact, this saltus may be less than k , or may be zero. However, 
all the points at which the saltus of / (x, y), taken with x constant, is 
$ ky are certainly included in the set K x . 

For any fixed value of x, the upper and lower integrals 
r / (*, y) dy , P / (x, y) dy 

• Vo ±Vo 

both exist, and the two have equal values at any point of the everywhere 
dense set of points x t of linear measure x t — x 0 , at which y ( x ) vanishes. 
The preliminary theorem above stated has thus been established. 
rv i 

Let F ( x ) denote / ( x , y) dy , where F (#) consequently has a single 

- Vo 

determinate value at each point x t at which 

[ V 'f(x,y)dy, f V ‘ f (x, y) dy 

j Vo ijjo 



[CH. VI 


512 The Riemann Integral 

are equal. At any point of that set, of measure zero, at which 
l V 'f(x,y)dy, \ V 'f(x,y)dy 

J Vo bio 

have different values, F (x) is regarded as indeterminate; and the upper 
and lower integrals are the upper and lower limits of indeterminancy. It 
will now be shewn that the function F (rr), so defined, is an integrable 
function. 

Let x ' be a point on the side y = y 0 , of the rectangle (x 0 , y 0 ; x l , y x ), 
such that the component K x >, of K , on the line x — x\ has content < a. 
A finite number of intervals c*, c 2 , ... € m can be determined on the straight 
line x = x\ neither abutting on, nor overlapping, one another, such that 
their sum e* + e 2 4- ... -f e m > b — cr, where b is the length of the side of 
the rectangle parallel to the ?/-axis, and such also as to contain, in their 
interiors and at their ends, no points at which the saltus of / (#, y) is £ k. 
For each point of one of these intervals e there exists a rectangle, with 
the point at the centre, such that the fluctuation of / (x, y) in that rect¬ 
angle is < k. The breadths of these rectangles for all points of e must 
have a finite minimum, for otherwise there would exist a point of € which 
would belong to K. It follows that, for the point x', an interval (x' — a , 
x' + ft) can be determined, such that the straight lines x — x — a, 
x — z* + p intersect all the rectangles corresponding to all the points of 
the intervals e 1 , e 2i ... € m . If x ± , x 2 be any two points in the interval 
(x' — a, x' -f ft), we have 

| F (x x ) - F (x 2 ) \ < bk + a (U — L). 

Now a finite number of separate intervals , S 2 , ... 8 r can be deter¬ 
mined on the side y = y 0 , of the rectangle (length = a), such that 

-f- 8 2 * H~ 8 r > a — rj y 

where rj is a prescribed positive number, and such that each point of each 
of the intervals 8 is a point x' y for which an interval (x' — a, x' -f ft) can 
be determined as above. By applying the Heine-Borel theorem we see that 

^,8 2t ...S r 

will all be covered by a finite number of these intervals (x' — a, *' + /*). 
It thus appears that the side y = y 0 , of the rectangle, can be divided into 
a finite number of parts 

T 1 f t 2 > ' * • T j> > 

and > ^2 > • * • \ y 

suoh that Tj + r 2 + ... + t v > a — rj 9 

and Ai + A* -f ... -f- A Q <c rj, 

so that the fluctuation of F {x) in any one of the parts r is 

< bk + a(U- L). 
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Let k m c/26, cr =- e/2 (U — L)\ 

then we see that F ( x) is such that the side y - y 0 , of the fundamental 
rectangle, can be divided into a finite number of parts, such that the 
sum of those parts, in which the fluctuation of F (x) is ?j e, is less than the 
arbitrarily chosen number rj. It follows that F ( x) is integrable along the 
side of the rectangle. 

It has now been shewn, on the assumption of the existence of the 
double integral, that the repeated integral 

[X I rr I Ti/i 

F (x) dx, or dx | / (x, y) dy , 

J So • J/o 

taken over the fundamental rectangle, has a definite value. Moreover, 
this value is equal to that of the double integral. For, let the fundamental 
rectangle be divided up by means of straight lines parallel to the y-axis, 
through the end-points of each interval of the two finite sets {r} and {A}. 
Any one of the rectangles so constructed, with r as base and with height b , 
can be divided into parts, by means of straight lines parallel to the x-axis, 
such that, in each one of a number of these parts the sum of whose heights 
is > b — cr, the fluctuation of / (x, y) is < k. The fundamental rectangle 
has now been divided into a finite number of parts, such that the sum of 
the products of'each part, multiplied by the upper boundary of / (x, y) 
in that part, exceeds the sum of the products of each part, multiplied by 
the lower boundary of the function in that part, by less than 
abk j (aor + biq) (U — L), 
which is arbitrarily small. 

Also f ' dx \ f ( x , y) dy, and I f (x, y) d (x, y), 

J Xo {jjo J 

both lie between the two sums of products, and therefore differ from one 
another by less than abk + (aa 4 - 677 ) (U — L). The equality of the double 
integral and the repeated integral is thus put in evidence by the mode of 
sub-division of the rectangle which has been adopted. 

Similar reasoning applies to the repeated integral in which the inte¬ 
gration is taken first with respect to x, and then with respect to y . 

It has thus been established that, if the double integral through the 
fundamental rectangle exist , then the two repeated- integrals also exist , and 
are each equal to the double integral. 

All the points at which y (x) vanishes are points of continuity of the 
function F ( x); but there may also be other points at which F (x) is con¬ 
tinuous ; because the existence of a saltus of / (x, y) at a point (x, y) is 
consistent with / (x, y) being continuous with respect to x, and also with 
respect to y , at the point. 




The function F (x) may be replaced by ip (#), the most nearly con¬ 
tinuous function related to it (see § 242). We thus have 

f f ( x , V ) d (x, y) = 4> (x) dx. 

If it be assumed that the set of points G, for which f (x, y) is defined, 
is measurable in accordance with .Jordan’s definition of a measurable set 
(see § 142), then the double integral of the bounded function may be re¬ 
placed by that of the function / ( x , y), defined for all points of a rectangle 
which contains G , by the convention that / (x, y) shall vanish at all those 
points of the rectangle which do not belong to G. If the set G be such 
that each straight line parallel to the y-axis contains points of G which 
fill up a finite, or an indefinitely great number of continuous intervals, or 
more generally, if the set of such points, for each value of x y be linearly 
measurable (</), then the integral 

f f ( x , y) dy, 

taken along the whole segment of the line between the sides of the rect¬ 
angle, may be replaced by the same integral taken through the component 
of G on the same segment. In particular, if the points of G on the straight 
line through the point x consist of all the points in the linear interval 

(/lW,/2(4 

pi 

we may replace ( f (x,y) dy 

• Vo 

rftU) 

by f(*,y)dy; 

•'/» (*) 

and therefore in this case, 


ro rfa(x) 

f (*, V) d (x, y) — dx f (x, y) dy. 

O 's e Jf x {.r) 


364. A simple proof* of the fundamental theorem of § 363 will be 
given, which depends upon the fact that, for any bounded function / (x f y), 
if the operation of taking the upper integral first with respect to y, and 
then with respect to x, be performed, the result cannot exceed the upper 
double integral; and that, similarly, the result of successively taking the 


* This method of proof was first employed by Harnack; see his edition of Serret’s Differential 
and Integral Calculus , p. 282; also Math. Annalen , vol. xxvi (1886), p. 566. Other proofs of this 
kind have been given by Arzelit, Mem. dell' 1st. di Bologna (5), vol. ii (1891), p. 133; by Jordan, 
Liouville'8 Journal (4), vol. vni (1892), p. 84, or Cours d'Analyse, vol. i, p. 42; also by Pringsheim, 
Sitzungsberichte d. Miinch. Akad. vol. xxvm (1898), p. 59, and vol. xxix (1899), p. 39; the first 
of these contains a history of the literature of the theorem. See also Pierpont’s paper “ On multiple 
integrals,” Trans. Amer. Math. Soc. vol. vi (1905), where a proof of this character for multiple 
integrals is given. 
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lower integral with respect to x and to y cannot be less than the lower 
double integral: thus 

|/ (x, y) d (x, y) < j dx j f (x, yjdy&jdxjf (x, y) dy 

- (/ (x, y) d (x, y), 

the integrals being all taken over the fundamental rectangle. 

If / (x, y) be integrable, so that 

I f (x, y) d (x, y) — [/ (x, y) d (x, y), 

it follows that J dx J f (x, y) dy ■■= j dx j f (x, y) dy 

= J dx jf (x, y) dy = j dx J/ (x, y) dy, 
and thus that the repeated integral 

J dx j f (x, y) dy 

has a definite value equal to the double integral. 

To prove this, let the rectangle be divided into a number of parts 8, 
by means of straight lines parallel to the sides. Since the double integral 
is assumed to exist, this may be done in such a manner that, e denoting 
an arbitrarily chosen positive number, the conditions 

J f (x> y) d (x, y) - e s E {S L (8)} s E (817 (8)} s- J f (x, y) d (x, y) + e 

are satisfied, where the summation S is taken for all the rectangles 8, 
and U (8), L (8) denote the upper and lower boundaries of / ( x , y) in a 
rectangle 8. Now, if we take the upper and lower integrals of / ( x , y) along 
a straight line parallel to the y- axis, we have 

~jf ( x < y) dy a {8' U (8)}, 

[f(x, y)dy*'L 1 {8' L (8)}, 

where the summation refers to all those rectangles 8 which are inter¬ 
sected by the straight line along which the upper and lower integrals are 
taken; and in the case when that straight line is along one or more 
boundaries of the rectangles 8, 2 refers to all the rectangles on one side 


33-2 
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of that line: also 8' denotes that interval along the line of integration 
which is in the rectangle 8. It follows that 

jdxjf(x, y)dy sS {8t7 (8)} s j / (x, y)d(x, y) + « 

and j dx jf (x, y) dy 2 E {8 L (8)} s f f (x,y)d (x, y) - e; 

and since these inequalities hold for every value of e, we have 
J dx j f (x, y) dy s j f (x, y) d (x, y), 

jdxjf(x,y)dy is j/ (x, y) d (x, y); 
and thus the theorem is established. 

365. The converse questions now arise whether, from the existence of 
one of the repeated integrals, or from the existence and equality of both 
repeated integrals, that of the double integral can be inferred. The answer 
to both questions must be in the negative. Continuity of a function / (x , y) 
with respect to x and y separately does not necessarily imply continuity 
with respect to ( x , y) ; moreover, the saltus of the function at a point with 
respect to x , when y has a constant value, or with respect to y, when x 
has a constant value, is not necessarily equal to the saltus of the function 
with respect to ( x , y). It may happen that the component of K on a 
straight line parallel to one of the axes may consist of points, some or all 
of which are points of continuity of the function when considered as a 
function of one variable on that straight line. Thus K may have a plane 
content greater than zero; and yet the linear content of the points on all 
straight lines parallel to the axes, at which the linear saltus of the function 
is ^ k, may be zero. Hence either, or both, of the repeated integrals may 
exist, whilst for values of Jc , the sets K are not of content zero*; and 
therefore whilst f (x, y) does not admit of a double jR-integral. 

It was shewn| by W. H. Young that the existence of the .R-integrals 
f ( x > y) dx, for values of y such that 0 £ y £ 1, and the existence of 
° i 

the jR-integral / (z, y) dy , for all values of x such that 0 £ x & 1, the 
Jo 

function f(x,y) being assumed to be bounded in square (0,0; 1,1), 

* An incorrect theorem, relating to this point, has been given by Schoenflies, Bee his Bericht, 
vol. i, p. 197. In this theorem, the condition that K should be closed is stated to be the condition 
for the existence of the double integral. If, however, K were not closed, it could not represent 
the set of points at which any function had a saltus ^ Jc. The examples given by Schoenflies do 
not in reality accord with his theorem. 

t Monatshefte fiir Math. u. PhysiJc , vol. xxi (1910), p. 127. 
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necessitate the existence of the two repeated integrals. It was further 
shewn* * * § by Lichtenstein that, subject to the same conditions, the two 
repeated integrals exist and have the same value. A further investigation 
is thatf of D. C. Gillespie % The case of multiple integrals has been treated^ 
by Ettlinger. 

EXAMPLES 


1§. For the rectangle bounded by x — 0, x = 1, y — 0, y = 1, let f (x, y) = 1, for all 
rational values of x , and / ( x , y) = 2 y, for all irrational values of x. We have then 

v) d y = h 

whatever value x may have; and hence the repeated integral 


( dx [ f{x, 
Jo Jo 


has the value 1: but the double integral does not exist, since I (K) > 0, for any value of k 
in the interval (0, 1). 

2||. Let x be represented by a finite or infinite decimal, excluding those decimals in 
which every figure from and after some fixed place is 9. Let p x denote the number of 
decimal places in the representation of x in the manner described. Let y be represented 
in a similar manner, with a corresponding definition of p y . Let the function / (x, y) be 

defined in the rectangle bounded by x = 0, x = 1, y = 0, y = 1, by /(x, y) = - -\ - 

when p x and p v are both finite; otherwise let / (x, y) — 0. 


We have 


(' 1 
J o Py + 1 


. dy = 0; for there is only a finite number of values of y, in (0, 1), for 


which p y is less than an arbitrarily chosen fixed integer, or-is greater than an arbi- 

Vy A 

trarily chosen fixed proper fraction. The function / (x, y) vanishes, except when one at 
least of x and y is representable by a finite decimal; and thus the double integral 

j f (x, y) d (x, y) vanishes. 

71 1 n 

Now I /(x,y)dy= f {x, y) dy => 0; 

Jo Vx 1 2 .0 

and thus / / (x, y) dy has no definite value for any value x of the everywhere dense 

Jo 

enumerable set of points for which p x is finite. 


Nevertheless 


/o * /o ^ x ' y) dy ” 0 y ) d (*> y)- 


3||. With the same notation as in the last example, let /(x, y) == 0, when p x , p y are 

both finite, or both infinite; let f (x, y) == —, when p x is finite, and p y infinite; and 

^ Vx 

f (x, y) — j-j-— , when p y is finite, and p x is infinite. In this case / (x, y) differs from 0 at 


* Gdttinger Nachrichten, 1910, p. 468; also Sitzber. Berl. math. Oes. vol. x (1911), p. 55. 

t Annals of Math. (2), vol. xx (1919), p. 224. 

J Amer. Journal of Math. vol. xlvu (1926), p. 215. 

§ Thomae, Schlomilch's Zeitschrift , vol. xxni (1878), p. 67. 
j| Pringsheim, Sitzungsber. d. Leipziger Akad. vol. xxvm (1898), p. 71. 
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an unenumerable set of points; and yet the set of points at which / (x, y) > * has the plane 
content zero, since all such points are on a finite number of lines parallel to the coordinate 
axis, although they are everywhere dense on those lines. The double integral, and conse¬ 
quently the repeated integrals, exist in this case. 

4*. An example has been given in Ex. 1, § 143, of a set of points K which is everywhere 
dense and unclosed, whereas the sets K x , K y are all finite. Let /(x, y) = c', at every point 
of K y and = c, at every other point. In this case, the double integral does not exist; but 

f f(x,y)dx - c, [ f (x, y) dy - c, 

J o jo 

whatever values y and x may have in the first, and in the second, integral respectively. 
Consequently f dx ( f (x, y) dy and ( dy ( f (x, y) dx 

J o J o J o J o 

both exist, and have the same value c. 


5*. Let a set {(x', y')} be defined as follows:—Let x' have any value for which p x > is 
finite; and with such a fixed value of x', let every y' be taken for which p y > Sp x >. On 
every line parallel to the y-axis there is only a finite number of points of the set; but the 
set is everywhere dense on every line which is parallel to the x-axis, and which has for its 
ordinate one of the y'. Let / (x, y) — c', for the set {(x', y')}, and let / (x, y) ■= c, for all 
remaining points. 

We have, in this case. 


( f (*, V) dy = c, and f dx f f (x, y) dy == c. 

Jo yu Jo 

But / / (x, y') dx has c and c' for its upper and lower values; and the set of values 

0 r n 

of y' being everywhere dense, j dy J f (x, y) dx does not exist. 


6f. Let / (x, y) be defined at all points of the rectangle bounded by x = 0, x = 1, y =-~ 0, 
y — 1, by the condition that / (x, y) = 0, except at those points (x', y') at which 


2 m 4 - 1 
2n~ ’ 


2 p 4 I 

’ 


where f(x',y') = ~, 


m, n, p and q being positive integers. 

In this case the double integral exists, and therefore the repeated integrals both existf. 


IMPROPER DOUBLE INTEGRALS 

366 . As in the case of single integrals, the definition of a double integral 
may be extended to the case in which the function has a set of points of 
infinite discontinuity. This set is necessarily closed, and it will be assumed 
throughout that its plane content is zero. It will also be assumed that 
the domain for which such a function is defined is bounded, and that the 

♦ Pringsheim, Sitzungsber. d. Leipziger Akad. vol. xxvm (1898), p. 71. 

f Du Bois-Reymond, Crdlc's Journaly vol. xciv, p. 278; also Stolz, Orundzilge , vol.ni (1899) 
p. 73. 

J This is denied by Stolz, Orundziijge, vol. in, p. 88, on the ground that/(x, y) for x = 2 ~ tJ \ 

1 ** 
is not integrable with respect to y, but has ^ and 0 for its upper and lower values. We have, 

however, shewn that this is no justification for denying the existence of the repeated integral. 
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frontier has the content zero, the domain being therefore measurable in 
accordance with Jordan’s definition; and consequently the function may 
be replaced by another function defined for all the points in a fundamental 
rectangle, the new function being taken to vanish at all points not in the 
original domain, and to have the same value as the original function at 
all points of that domain. These assumptions being made, a definition of 
the improper double integral which is substantially the one given by 
Jordan*, and adopted by Stolzf, may be stated as follows: 

Let Dj, I) 2 , ... D n , denote a sequence of domains contained in the 
fundamental rectangle, each one of which is contained in the next and consists 
of a closed set with its frontier of content zero. Further , let us suppose that 
none of these domains contain , in their interiors or on their frontiers, any 
point at which f(x, y) has an infinite discontinuity, and that the sequence is 
such that the measure of I) n as n ~ oo, converges to that of the fundamental 
rectangle; then if the upper integrals 

f f (%, V) d (x, y), I f(x,y)d (x,y), ... I f (x, y) d (x, y), ..., 

Jl), JDn 

taken over the domains D 1 , D 2 , , converge to a definite limit, independent 

of the particular sequence {D n } chosen, this limit is defined to be the improper 
upper integral — 

I f (x- y) d (x, y) 

of f (x, y) in the given domain. A similar statement applies to the case of the 
improper lower integral. When the improper upper and lower integrals both 
exist, and have the same value , then the improper integral 

[f (x, y) d (x, y) 

over the given domain is said to exist, and to have this common value. 

It will be observed that the domains D n are all measurable in accord¬ 
ance with Jordan’s definition of a measurable set, and therefore also in 
accordance with the definition of Borel and Lebesgue. 

In case the function / (x, y) be integrable (B) in all the domains 
D 1 , Z> 2 , ..., however this sequence may be chosen, subject to the con¬ 
ditions stated above, then, if the sequence 

f(x,y)d(x,y), I f (x, y) d {x, y), ... 

'D , .'jD, 

converge to a definite limit, independent of the particular sequence {D n }, 
that limit defines the improper double integral 

jf(x, y)d(x, y). 

* Court d'Analyte, vol. n, 2nd ed. (1894), p. 76. 


t Orundzilge, vol. m (1899), p. 124. 
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If a function / (x, y) have an improper integral in the fundamental 
rectangle, then / ( x , y) has a proper /^-integral in any closed domain 
of which the frontier has measure zero, and which is contained in the 
fundamental rectangle, but itself contains no points in its interior, or on 
its frontier, at which / (x y y) is infinitely discontinuous. For, if D be such 
a domain, we may choose a sequence {Z) n }, such as is employed in the 
definition given above, and such that D is interior to D n , for every value 
of n . 

Since ( / (x, y) d (x, y) - ( / (x, y) d (x, y) 

jd Ip 

cannot exceed / / (x, y) d (x y y) — \ f (x, y) d (x, y) 

J D n JDn 

whatever value n may have, and since this latter expression converges 
to zero, as n ~ oo, it follows that 

f / (*. V) d (pc, y) - [ / (x, y) d (x, y) = 0, 

• ip 

or that | / (x y y) d (x y y) exists. 

J D 

It has thus been shewn that if f (x, y) have an improper double integral 
in the fundamental rectangle , it must possess a proper R-integral in any 
closed domain interior to that rectangle , such that the domain has its frontier 
of zero measure , and contains no points of infinite discontinuity of the 
f unction , either in its interior or on its frontier. 

In particular / (x, y) has a proper i?-integral over each of the domains 


367. The necessary and sufficient condition for the, existence of the im¬ 
proper upper double integral 

\f(x, y)d(x, y) 

is that , corresponding to any arbitrarily chosen positive number c, another 
positive number 8 can be determined , such that , if A be any closed domain 
whatever , of which the frontier has measure zero , and which is contained in 
the fundamental rectangle , but itself contains no point of infinite discontinuity 
of f (x y y) y either in its interior or on its frontier, then , provided the measure 
of A is < 8, the condition 


taken over A, is satisfied. 


1 jf(x,y)d (x, y) 


< <, 


A similar theorem applies to the improper lower double integral. 
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To shew that the condition stated in the theorem is sufficient, let 
D, D' be two domains of the kind specified in § 366, such that m (D), m (D') 
both differ from the area of the fundamental rectangle by less than 8; 
they are both interior to the fundamental rectangle, and contain none of 
the points of infinite discontinuity of the function. Let d be the set of 
points of D which do not belong to Z)' , and d' the set of points of D f 
which do not belong to D ; then 

m (d) < A — m (D f ) < 8, 
and rn ( d ') < A *— m (D) < 8, 

where A is the area of the fundamental rectangle. Also, since the domains 
I) -f d\ D' 4- d are identical, we have 

ho — In* = Id — Id'y 

where I denotes the upper double integral 

[/(*. y) d (x, y ) 

taken over the domain indicated by a suffix. It follows that 
I Id — In f | ^ | Id | + | Id' | < 2*; 
hence it is easily seen that any two sequences 

{In*}, {In'A 

both converge to one and the same definite limit. 

To shew that the condition stated in the theorem is necessary, let us 
suppose that it is not satisfied. We thus assume that a domain d, of 
arbitrarily small measure, can be found, such that | I d | > c. 

Let D be interior to the rectangle, and such that 

A — m ( D) < 8. 

Taking D to contain d, we then have 

m (D — d) > A — 28, 

provided m (d) <8. 

The two domains D, D — d both converge to A, if 8 be decreased inde- 
finitely; and = /„; 

thus | In — I n-d | > *, 

however small 8 may be; hence the limit does not exist in this case. 

The necessary and sufficient condition that the improper upper and lower 
integrals of f ( x , y) in the fundamental rectangle may both exist is that the 
improper upper integral of | / ( x , y) | may exist . 
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To shew that the condition stated is sufficient, we observe that, on the 
assumption of the existence of 

I \f(x,y) I d (x, y), 

it follows from the theorem established above that the upper integral of 

I / (*> y) I 

through a domain D of the type defined in § 366, tends to the limit zero, 
as m (D) does so. Also 

If / (*,y) d (x,y) , If f(x,y)d(x,y) 

I jd i ip 

are both less than, or equal to 

T | / (x, y) | d (x, y), 

J D 

as is easily seen. It follows that both the integrals 

f f (x,y)d(x,y), f f(x,y)d(x,y) 
jd ip 

converge to zero, as m (D) does so, and uniformly for all such domains D; 
and these are the sufficient conditions for the existence of 


f(x,y)d(x,y) 9 f (a?, y) d (x, y). 


To prove that the condition stated is a necessary one, let us assume 
that, for every domain D, satisfying the specified conditions, and such 
that m (D) < S, we have 

I f( x ,y)d(x,y) < €. 

I JD 

Now let / (x, y) = /+ (x, y) - f~ (x 9 y), 

where f+ (x, y) - / (x, y) 

at all points where / ( x , y) is positive, and everywhere else let 

f+ (x, y) = 0; 

also f~ (x, y) = ~ f (x, y) f 


at every point where / ( x , y) is negative, and everywhere else f~ (x, y) is 
zero. The domain D may be divided into a finite number of rectangles 8, 
some of which may lie partly outside D ; the functions being taken to be 
zero in all such outlying portions. Denoting by U& the upper boundary 
of a function in the rectangle 8, we have 

Ut {/(x, y)} = U s {/+ (x, y)}, 

in all elements 8 in which / ( x , y) has positive values; and in all other 


elements 


Ui{f + (x, y)} = 0. 
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The elements may be taken such that, if rj be an arbitrarily chosen number, 
the inequalities 

SSC/j {f(x, y)} - [ f(x,y)d(x, y )<r), 

28f/ { {/+ (x, y)} - 7 f+ (x, y) d (x, y) < v 

J D 

are both satisfied. These conditions are also satisfied for any domain 
contained in D. 

We have now 

f f + (x, V) d (x, y) s SSt/j {/+ ( x, y)} s X 8 l / 4 {/(x, y)} 

£ f (x,y)d ( x , i/) + v 
'a 

where Dj consists of the domain which is composed of those elements 8 , 
of D, in which / (z, y) has positive values. Since 77 is arbitrarily small, 
we have - 

| f+ (x,y)d(x,y) 1 * e. 

J D 

Again, since 

f f(x, y) d (x, y) = - [ - {/ (x, y)} d (x, y), 

we see that — {/ (x, y)} d (x } y) 

Jo 

has the limit zero, when m (D) has the limit zero; and from this it follows, 
as before, that ■- 

I t (*, y) d (x, y) s e. 

J D 

Since | / (*, y) | = / + (*, y) + f~ (x, y), 

we have \f (x,y)\d (x, y)s 2 f ; 

J D 

and therefore | / (x, y) | d ( x , y) 

has the limit zero, when m (D) converges to zero. It now appears, by 
employing the first theorem of this section, that 

f I f(x,y)\d (x, y), 

taken throughout the fundamental rectangle, has a definite value. 
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It should be observed that since 

[ I / (*> V) I d (x, y) s; [ | / (x, y) | d (x, y), 

the lower integral I | / (x, y) | d (x, y) 

J_d 

has the limit zero, whenever zero is the limit of 

j D \ f( x >y) I <*(*> y)> 

and that therefore j" \ f (x, y) \ d {x, y) 

always exists when f | / (x, y) | d ( x , y) 

does so, the integration being over the fundamental rectangle. 

368. It has been seen in § 366 that, in case / (.x , y) have an improper 
integral in the fundamental rectangle, it has a proper integral in any 
closed domain D contained in that rectangle, which has a frontier of zero 
measure, and contains no points of infinite discontinuity of the function, 
either interior to it, or on its frontier. It follows by the property (2) of 
§ 340, that | f (x, y) | is also integrable in the domain D\ and we have 
already seen that the existence of the improper upper integral of | / (x, y) | 
is a necessary consequence of the existence of the improper integral of 
f (x, y). It thus appears that the improper upper and lower integrals 
of | / (x, y) | must be identical, and therefore that, if f (x, y) be a f unction 
which, has an 9 improper double integral , in accordance with Jordan's de¬ 
finition, then | / (x, y) | has also an improper integral, so that every such 
improper integral is absolutely convergent. 

We have seen that Cauchy's definition of an improper single integral 
is applicable not only to the cases in which the convergence is absolute, 
but also to cases in which the convergence is not absolute. The same 
remark applies to Hamack's extension of Cauchy’s definition, which will 
be considered in Chapter vn. Jordan’s definition of an improper double 
integral is however much more stringent than Harnack’s definition of an 
improper single integral. In the latter case the integral is defined as the 
limit of the proper integral taken through a finite number of intervals, 
not chosen arbitrarily in any manner consistent with the condition that 
the sum of these intervals is to converge to the length of the interval of 
integration, but chosen so as to satisfy the special condition that they 
are complementary to a finite set of intervals which contain within them 
all the points of infinite discontinuity of the function, each interval of 
the finite set containing at least one such point. 
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If the proper integrals, of which the improper integral, in Hamack’s 
definition, is the limit, were not subjected to the above-mentioned re¬ 
striction, reasoning precisely similar to that applied above would shew 
that every improper single integral must be absolutely convergent. In 
order that a definition of the improper double integral should admit of 
the existence of double integrals which do not converge absolutely, it 
would be necessary to subject the domains D lf D 2 , ... D nt ... (the proper 
integrals through which define, as their limit, the double integral), to 
some restriction which would allow of the existence of a limit in cases in 
which such a limit does not otherwise exist, independently of the par¬ 
ticular set {D n } chosen. Such a restriction as to the nature of the domains 
D n would correspond to the restriction to a special class of sets of intervals, 
of the intervals through which the proper integrals in Cauchy’s or Harnack’s 
definition of an improper single integral are taken. The true extension of 
Hamack’s definition to the case of double integrals would be the following: 

Let the points of infinite discontinuity of f ( x , y) (the set of such points 
being of zero content) be enclosed in a finite set of rectangles with sides parallel 
to those of the fundamental rectangle , each rectangle of the finite set containing 
at least one point of infinite discontinuity , and no such point being on the 
frontier of the set of rectangles; and let D n denote the remaining part of the 
fundamental rectangle when the finite set of rectangles is removed. Then, if 
f (x, y) have a proper integral in every such domain D n , and if this proper 
integral converge to a definite limit when any sequence whatever of such 
domains D n is taken , such that the measure of D n converges to that of the 
fundamental rectangle , this limit shall define the improper double integral 
off (X, y). 

This extension of Cauchy’s definition would admit of the existence of 
non-absolutely convergent improper double integrals, as in the case of 
improper single integrals. With this definition, the theorems of § 367 
would no longer be valid. 

When it is asserted that non-absolutely convergent double integrals 
do not exist, the assertion must be taken to mean that such integrals do 
not exist in accordance with the definition of Jordan, and not that it is 
impossible to give definitions, such as the above extension of Harnack, in 
accordance with which double integrals would exist that do not converge 
absolutely. 

The properties of improper double integrals which are not necessarily 
absolutely convergent are more restricted than those which exist in 
accordance with Jordan’s definition, and it is consequently a matter of 
opinion whether, though the former certainly exist as limits, the name 
integral may be appropriately applied to such limits. 
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EXAMPLE 

If we take, as the domain of integration, the rectangle bounded by x — 0, x — a, y — 0, 
y — b , then the double integral of ^ sin ^ is not convergent, and therefore in accordance 

fa 1 1 

with Jordan’s definition does not exist; although the single integral / sin dx is non- 

Jo * * 

absolutely convergent, and exists in accordance with Cauchy’s definition, 

fa 1 1 /a 1 I 

The existence of / - sin dx depends upon the fact that / sin - dx converges to a 

J 0 * * J e X X 

definite limit, as e converges to zero, and this is sufficient to ensure the existence of the 

single integral. Although j i sin i d (x, y), taken over the domain bounded by x — e, x — a, 

y — 0, y = b, converges to a definite limit, as e converges to zero, this is not sufficient to 
ensure the existence of the Jordan double integral. Taking Jordan’s definition, let the 
domain D n consist of the rectangular spaces bounded by the lines y = 0, y — 6, and the 
lines parallel to the y-axis at the extremities of the intervals on the x-axis 


/ 1 a \ 

( > 

1 \ 

( 1 1 \ 

\(2n + l)n’ )’ 

\(2/i + n 

i) 7 T* (2 n f- 2) 7T/ 

\(2n 4- 5) v ’ (2 n 4- 4) n) 


The double integral taken through these spaces is 


( _1_ 1 \ 

\(4w -f 1) 7r* 4W7 t) 


ra fbi 1 p-2n f2p* fb x 1 

/ / - sin - dxdy 4-2 / / - sm - dxdy , 

I 1 Jo* * p~n + lJ 1 Jo* * 

(2n+l)*- (2p f 1) it 

/ o fb i i f n p •' 2n j 

I sin - dxdy + b I sin z 2 -- s — dz; 

1 /o x * Jo p-n+l * + 


(2n+l>. 

which is greater than 


h ( a 1.1 

b I - sin - 

J 1 * * 

<2n+l)ir 

b r i 

J 1 * 

(2 n+1)* 

6 / 


(4?l + 1) 7T J 


I sin zdz, 

J o 


1.1. 2rc& 

~8in- OX -f rr-—-rr — ; 

x x (4a 4- 1) 7r 


and this converges to b J -Bia^dx + 

whereas sin - d (x, y), taken over the domain bounded by x =* e, x = a, y =* 0, y b, 
j x x 

converges to , Z -0 1 . 1 , 

b - am - dx. 

Jo* * 

Therefore the mode of choice of the intervals D n affects the limit to which 

/ - sin - d (x, y) 

D n * * ' 

converges, as 2> n converges to the complete domain. Thus it is clear that the double 
integral, in accordance with Jordan’s definition, does not exist. 
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THE DOUBLE INTEGRAL OVER AN INFINITE DOMAIN 

369. The definition of a double integral has been extended by Jordan* 
to the case in which the field is unbounded. Let the function / (x, y) be 
defined at every point of an unbounded set of points G , which is the outer 
limiting set of a sequence of closed domains, each of which is measurable 
(J ); and let it be assumed that the upper and lower integrals of / exist 
over each bounded set of points A, that is measurable (J), contained in 6 ¥ , 
in accordance with the definition in § 338, or in accordance with that of 
§ 366. This does not make it necessary that the function / should be 
bounded in G , even if it be bounded in every such set A. We shall denote 
the upper and lower integrals of/, in A, by f A (f ), (/). 

Consider a sequence {A,,}, of sets A, such that A„ contains as a part, 
all those points of G, wdiose distance from the origin is < p n ; where { p n } 
is a monotone increasing sequence of positive numbers which increase 
indefinitely with n. 

If i\ n (f ) converges to a finite number n ~ oo, independent of the par¬ 
ticular sequence {A n }, subject to the above condition , then the upper integral 
1 Q (/), of the function over the set G, is said to exist , and is defined by that 
number. 

A similar definition may be applied to £ G (/), as the limit of f An (/). 

When l G (f), Iq ( / ) both exist , and are equal , their value is said to define 
Iq (/)> the integral of f over the unbounded domain G. 

The necessary and sufficient condition for the existence of 1 0 ( / ) is that , 
if e be an arbitrarily chosen positive number , the inequality | I A (f) | < € 
holds for every bounded set A, measurable ( J ), of points contained in G , and 
such that all the points of A are at a distance from the origin > p (e), where 
p (c) is some positive number dependent on e. 

To prove that the condition is sufficient, let A (1) , A (2) be any two bounded 
sets of points, measurable (J), each of them containing every point of G 
whose distance from the origin is < p (e). Let 8 (1) be the set of points 
of A (1> which do not belong to A <2) , and let 8 (2> be the set of points of A (2) 
which do not belong to A (1) ; thus the sets A (1) + 8 (2) , A (2) -f 8 (1) are 
identical. Therefore 

l a(!) (/) — (/) = Im if) — I tv (/); 

and since 8, 8' each contains no points of G at a distance < p (e) from the 
origin, we have | I A «) ( /) — /*<*> ( /) | < 2e. 

If {A n } be a sequence such as has been defined above, then if n be 
sufficiently large, p n >p(e); if then A (1) -=A n , A (2) = A n+m , we have 
I T±n(f)-~ T*n+m ( /) | < 2c, for a sufficiently large value of n, and for 

* See Cours <TAnalyse, vol. ii, 2nd ed. p. 81. 
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all positive values of m. Therefore { I ^ (/)} is a convergent sequence. 
Again, if {A n '} be any other such sequence as {A n }, and n , n ' are sufficiently 
large, we may take A (1) = A n , A <2) = A' n >; and thus 

|7a b (/)-7av(/)I<2c. 

As this holds for every value of e, when n , ri are sufficiently large, corre¬ 
sponding to each value of e, the two sequences {/ An (/)}, {/ 4V (/)} con¬ 
verge to the same limit; and thus 7 0 (f) exists as a finite number. 

To shew that the condition is necessary, let us assume that it is not 
satisfied. Then e can be so determined that, for every value of p , however 
large, there are sets A which contain no points of G at a distance < p 
from the origin, and for which | 7 A (/) | £ e. If {A n } be any sequence of 
the kind employed above, then, for any fixed value of n , the set A can be 
so determined that every point of it is at a distance from the origin 
greater than the upper boundary of the distances of all points of A n ; we 
have then | J An+A (/) — 7 ^ (/) | £ c. If this be done for each value of n, 
the sequence {7 ^+a (/)} cannot converge to the same limit as {/ An (/)}; 
the set A depends on n. It should be observed that the choice of A, for 
each value of n , is a case where the multiplicative axiom is employed. 

The theorem for the lower integral may be proved in the same manner, 
or may be deduced by changing the sign of /. 

370. The following theorem will be established: 

The necessary and sufficient condition for the existence of the upper and 
lower integrals of f over the set G is that the upper integral of | / | over G 
should exist as a finite number. 

It will first be shewn that the condition is sufficient. 

The upper boundary of | / | in any cell is the greater of the numbers 
| U |, | L |; where U and L are the upper and lower boundaries of f in 
the cell, and the lower boundary is the lesser of these numbers. Either 
U or L may be infinite. If D be any set of points, measurable (J), and 
contained in A, we have accordingly 

7d(|/D* |7,>(/)|, h(\f\)z\L>(f)\- 

If m (D) converge to the inner extent of A, we see that these relations 
still hold, when A is written for D. 

If then / A (| / |) < €, for every set of points A, all the points of which 
are at a distance £ p (e) from the origin, we see that | ? A (J) | < c, and 
| I a (/) | < €• Hence the conditions for the existence of /©(/), loif) 
are satisfied. 

To prove the necessity of the condition, let / = /+ — where /+ = /, 
at every point at which / is £ 0, and elsewhere let /+ = 0. By hypothesis, 
if A be any set of points such that all the points of G contained in it are 
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at a distance s a from the origin, we have | 7 A (/) | < Je, | J A (/) ] < 
provided a is properly chosen. 

Let A 1? A 2 be the two parts of A in which / 0 , / < 0 , respectively; 

and let A 1} A 2 contain sets I ) x , 7> 2 , measurable (J). 7 A (/*) is the limit of 
Tf,i (/+), or of 7 j ) 1+Z)2 (/*), as m (/>, 1 J) 2 ) converges to the lower extent 
of A. Hence D x , 1) 2 can be so de ermined that 7 A (/+) — I jy x + Dz (/ + ) < 17 , 
and /*(/')— (f ) < rj ; where 77 is an arbitrarily chosen positive 

number. 

Since the conditions | //>,(/) \ < | 7^ (/) | < £e, hold by hypo¬ 

thesis, we have 7^+^ (/+) < ]e, 7^+^ (/) < and therefore 

7*(/;)<*€ i-i,, 7 a (/ xjc+j. 

Now 7 A (/' +/“) - lim 7^+^ (/+ 1 / ) - 7 a (/+) + 7 a (/-) ft + 277 ; 

and since 77 is arbitrary, 7 A (| / 1 ) ft < e; therefore' |/ | has an upper 
integral over G. Thus the sufficiency of the condition has been established. 

371. It will now be shewn that: 

In order that l 0 {f) rnay exist, it is necessary and sufficient that I 0 (| / |) 
should exist as a definite number. 

To shew that the condition is sufficient, we observe that, D being any 
set of points measurable (J), | I D (/) | and | J D (/) | both lie between 
T 3 (\f\) and I D (\f\). Hence we infer that, in the interval, T G (\f\), 
Lo (1/ |), both 7 0 (/), l 0 (/) are contained. If then I 0 (\f |) and J 0 (|/ |) 
are equal, which is the case when the integral 1 0 (|/|) exists, it follows 
that /(?(/) exists. To shew that the condition is necessary; we observe 
that if I D (f) exists, in any set D that is measurable (J), so also does 
I D (|/ |) (see § 367), hence the same holds good for any set A, to which 
D converges; and proceeding to the limit, as A converges to G , we see 
that, if I 0 (f) exists, so also does I G (\f\), for l 0 (\f\), I 0 (\f\) both 
exist (§370), and each is the limit of / A (|/|). It thus appears that, in 
accordance with Jordan’s definition of the integral of f(x,y) over an un¬ 
bounded domain G , given in § 369, the integral is necessarily absolutely 
convergent, in the sense that | / (x , y) | has a finite integral in the domain G. 
It has been seen, in § 358, that this is not the case for a single integral of 
a function / (#) over an indefinitely great interval of x } as this integral 
may exist, and yet the integral of | / (x) | over the same interval may not 
exist as a finite number. For example, the integral of sin xjx over the un¬ 
bounded interval (0, 00 ) is not absolutely convergent. The reason for this 
difference between the two cases is that Jordan’s definition for the double 
integral is much more stringent than the definition in § 354 for the single 
integral. The single integral is defined as the limit of a sequence of integrals 
taken each over a single finite interval. If the integrals, of which the 
integral over the unbounded interval is the limit, were not restricted in 
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this manner, but might be integrals over other sets of points measurable 
(J), it would be possible to give a definition in accordance with which 
only absolutely convergent single integrals over an unbounded interval 
would exist. It would be possible to restrict the domains A in Jordan’s 
definition in such a manner as to admit of the existence of non-absolutely 
convergent double integrals over an unbounded domain G . 

When it is asserted that non-absolutely convergent double integrals 
over unbounded domains do not exist, the assertion must be taken to 
mean that such integrals do not exist in accordance with Jordan’s defini¬ 
tion, and not that it is impossible to give a definition in accordance with 
which double integrals exist that do not converge absolutely. 


EXAMPLES 

1*. The integral j sin {ax t by) z r ~ * y H 1 d ( x , y ), 

where 0 < r < 1, 0 < * < 1, taken over the positive quadrant, has no existence as an abso¬ 
lutely convergent improper integral. We find that the integral taken over the rectangle 
bounded by x = 0, x — h, y -- 0, y — k tends to the limit a~ r b ~ s r (r) r (.v) sin | (r 4 s ) n, 
as h and k are increased indefinitely. If the integral be taken over the domain x > 0, y > 0, 
ax + by < h y then when h is indefinitely increased, the integral has no limit if 1 < r 4 s <2; 
but it tends to the same limit as before, when r 4 s < 1. The integral may be regarded as 
conditionally convergent, if we adopt a definition in accordance with which it is sufficient 
that the integral taken through the rectangle x — 0, x — h y y — 0, y = k should have a 
definite double limit, as h and k are indefinitely increased. 


2f. The integrals 


j cos ( ax 2 4- 2hxy 4- by 2 ) d (x, y) 9 j sin (ox 2 4- 2 hxy 4 by 2 ) d (x, y), 

where a, ab — h 2 are positive, taken over the positive quadrant, do not exist as absolutely 
convergent integrals. It may be seen that, if the integrals are taken over the quadrant of 
a circle bounded by r — R y the value of the integral has no definite limit, as R is increased 
indefinitely. If the integral be taken over the rectangle bounded by x = 0, x — h\ y — 0, 
y = k\ then, when h' and k' are increased indefinitely, the integrals have 


0 , 


and - 


1 


2 Vab - h 2 


h 

Vab 


for limits respectively, the inverse cosine having its least positive value. These may be 
regarded as the values of the integrals, subject to a suitable restriction on the domains of 
which the positive quadrant is the limit. 


If a - 0, 6 = 0, h = i, J sin xy d (x, y), 

over the positive quadrant, has no existence, even considered as the limit of an integral 
over the rectangle. But 

f cos xy d (x, y) 


* Hardy, Messenger of Math. vol. xxxii (1903), p. 96. 
t Ibid. p. 169. 
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exists, and is equal to \rt y when the integral is defined as the limit of the integral over the 
finite rectangle. It may be remarked that the single integrals 

fh' rk / 

/ cos xydx, / sin xijdy 

Jo Jo 

are both non-convergent. 

THE TRANSFORMATION OF DOFBLE INTEGRALS 

372 . Let (x, y) be a point of a bounded perfect and connex domain //, 
and let x and y be expressed by means of two functions f x , / 2 in terms of 
two new variables f, 77, which may be represented by points (£, rj) in 
another plane. Let us suppose that the functions 

x =~ fi (£> y)> ii=U(Ly), 
and the reciprocal functions 

( = <t>i ( x > y), v = 4>-i (*, y), 

are such that the following conditions are satisfied: 

(1) . To each point (x, y) there corresponds one point (f, 77); and, con¬ 
versely, to each point (f, 77) there corresponds one point (x, y) ; and to the 
bounded domain H there corresponds a bounded domain H. 

(2) . The functions f y (£, 77), f 2 (£, y) are continuous functions of (£, 77) 
throughout the domain //. 

( 3 ) . The functions f x (£, y), f 2 (£, 17) have, at every point (f, y ), of H, 
definite partial differential coefficients with respect to £ and y , and each 
one of these is everywhere continuous with respect to (f, 77), and nowhere 
vanishes. 

( 4 ) . The Jacobian of f x (£,y), f 2 (f, y) with respect to £ and y does not 
vanish in the domain H. In virtue of ( 3 ) the Jacobian is everywhere 
continuous, and of fixed sign. 

From (2) and ( 3 ) it follows that, if (f 4- A£, y + A77), (f, y) be two points 
of H , and (x 4- Ax, y 4 - A y), (x, y) the corresponding points of H, then 



where d x , S 2) 0 3 , 0 4 converge to zero, as A£, A 77 do so, and (see § 310) 
uniformly for all points (f, y) in any closed domain contained within H. 
On solving these equations, we find 
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with a similar expression for Ar], where J denotes the Jacobian 

d (b V) 

and a x is a function of 0 1} S 2 , 0 3 , 0 A which converges with them to zero. 
Since, by (4), J never vanishes, it follows from these equations that 
A£, Arj converge to zero with Ax, Ay, and thus that the functions fa (x, y ), 
fa ( x , y) are both continuous functions. 

The partial differential coefficients 

d(f>i df 2 j j dfa dfa dfa 

0# drj/ dy ' dx 9 dy 

are also continuous in H ; and therefore 

A ^ = /t + X ') A * + (fy '■ * 2 ) A ^ 

A, = + X3 ) Aa; + + * 4 ) Ay, 

where y 1? ^ 2 , y 3 , converge to zero with Ax, Ay, and uniformly so for all 
points (x y y) in a closed domain contained within H. 

Corresponding to any closed set h , of zero content, contained within H, 
there is a closed set h , of zero content, contained within H. It is clear, 
from the continuity of the functions which define the transformation, that 
a limiting point of a sequence of points in H corresponds to the limiting 
point of the corresponding sequence in H; and thus h is closed, since 
h is so. 

Writing A£ = LAx + MAy, Arj = L’Ax + M'Ay, 

it follows, since 

where L , L\ ... converge uniformly to 

dfa dfa 
dx 9 dx 1 “ ‘ ’ 

that, if | Ax |, | Ay | be both restricted to be less than a fixed positive 
number e, the ratio (^f) 2 + 2 

(Ax)* + (Ay) 2 

has a finite upper limit A 2 , for the whole domain H. Now let the points of h 
be enclosed in a finite number of circles, the radii of which are all < c; 
it then follows that the points of h can be enclosed in a finite number of 
circles of which the radii are all less than eA. The sum of the areas of 
these circles on the (£, rj) plane, which contain within them all the points 
of hy has to the sum of the areas of the circles on the (x , y) plane, which 
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enclose all the points of /&, a ratio less than A 2 . Since the sum of the latter 
circles can be taken to be arbitrarily small, it follows that the points of h 
can all be enclosed in a finite number of circles the sum of whose areas is 
arbitrarily small. Therefore h has the content zero. 


373. Let / (x , y) be a bounded function, defined for all points of a 
closed connex domain G , contained in H , the frontier of Q having content 
zero; and let / (x f y) be integrable ( R) in G. If x t y be expressed in terms 
of £, 77 by the relations 

* = fi (€> v)> y=ft(£>v)> 

which satisfy the conditions of § 372 , then, corresponding to f (x, y) in G , 
we have a -function F (£,r}) in the domain 6r, contained in H f which 
corresponds to G. The frontier of G , corresponding to the frontier of G , 
has also the content zero. A point of discontinuity of f (x, y), in the (x, y) 
plane, corresponds to a point of discontinuity of F (£,rj) in the (£, tj) 
plane, the measures of discontinuity at the corresponding points being 
the same. Since those points of ( x , y) at which the saltus of / ( x , y) is 
2; k form a closed set of zero content, it follows that the points of (f, tj) 
at which the saltus of F (£, tj) is £ k form also a closed set of zero content; 
and therefore F (£, 77) is integrable in (*. 


In order to transform j f (x, y) d ( x , y ), taken throughout 6r, into an 

integral taken throughout G , it is convenient to make use of an inter¬ 
mediate transformation* x = i/j (u ly u 2 ), y = u 2 , followed by the trans¬ 
formation Mj = f, u 2 =/ 2 (£, 77); the function (u lt u 2 ) being such that 

0 («!, W 2 ) =/i (f, rj). 

It is easy to see that each of these transformations satisfies the conditions 
of § 372 . 

Since is the Jacobian of (x, y) with respect to (u l9 u 2 ), we have, by 

C7»i 

a known theorem, dx d (u u u 2 ) _ 

0M, ' 0 (f, Tj) 

hence, since J never vanishes, 


also never vanish. 


3 * Q „J 3 («!.«*) 

3 % nfv) 


* This method is employed in the general case of multiple integrals by Pierpont; see his paper 
“On multiple integrals/’ Trane. Amer. Math. Soc. vol. vi (1905), p. 416. It is, however, there 
assumed that / (x, y) is integrable with respect to x for each value of y : but this is unnecessary. 
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Since / (x, y) is integrable in G, we may, in accordance with the result 
of § 363, replace the double integral 

j / (*> y) d (*> y) 

by the repeated integral j dy Jf (x, y) dx, 

or by J dy jf (x, y) dx. 

Applying the transformation x = \fs (u^ , u 2 ) to the upper and lower 
integrals — . 

jf(x,y)dx, jf(x,y)dx; 

these may, in virtue of the theorem of § 360, be transformed into the 
single upper and lower integrals 


f | <t> («i > «*) sir > 


0 a: 

’ du± ’ 


where <f> (u x , w 2 ) represents the function of , ?^ 2 which corresponds to 
/ (a;, y). We thus have 

[/ (*, y) d (x, y) = j du 2 f <f> (w, , u t ) ~ du 

= f ( <t> K, « 2 ) <*Wl 

= [ <M M 1 > d («I . “i). 

the double integral being taken through the domain in the plane , u 2 ), 
which corresponds to G in ( x , y). 

Applying the same method of transformation to 

I $ ( u i > ^2) d (^1 > ^2)? 

where u 2 = / 2 (£, 77 ), 

we have [ / (*, «/) d(x,y)= f F (f, v ) d (f, 17), 

where 8m » 

^ a(£,*?) ’ 

hence finally we obtain the formula 

J/fo 2/) d(x,y) = | F (f, ?;) Jd (£, rj); 

which is the formula of transformation of the integral of / {x, y) throughout 
G into an integral throughout G . 
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It has been assumed that J has a fixed sign throughout the domain of 
integration. If now this sign be negative, the product A£Ar), in JA£Arj, 
which corresponds to Ax Ay in the plane of (x, t/), must be accounted 
negative, when Ax Ay is positive. It is however more convenient to con¬ 
sider A£At] as essentially positive, otherwise the measure of a set of points 
in the (£, 77 ) plane would have to be reckoned as negative. Adopting this 
convention, we write | J | d (£ , 77 ) instead of Jd (£, 77 ); and therefore the 
formula of transformation will be written in the form 

J/(*> y) d (*. y) = | f (£> v) I J \ d (f> v)- 

374. Let us now assume that, at certain points of 0 , which form a 
set L of zero measure, either ( 1 ), / (x, y) has an infinite discontinuity, or 
( 2 ), one or more of the partial differential coefficients 

% % % 

Sr,’ Sf’ Sr, 

is discontinuous, or (3), the Jacobian J vanishes. In case J be positive 
over a part of G, and negative over another part, it is convenient to divide 
the double integral into two portions, taken over these two parts of G 
respectively, and to transform these two portions separately. It will 
accordingly be assumed that J never actually changes its sign in the 
domain G, although it may vanish at the points of the part L, of G. We 
may denote by L the set of points on the (f, 77 ) plane which corresponds 
to L : it will be assumed that L has zero content. It will be shewn that, 
if one of the two integrals 

f f (*> y) d (x, y ), 

taken over G, and J F (£, r,) | J | d (£, •>,), 

taken over G , exists as an absolutely convergent improper integral, or as 
a proper integral, then the other one exists, and the two have the same 
value. 

Let us assume that / /(x, y) d (x, y) exists: it will then be sufficient, 
Jo 

in order to establish the existence of the other integral, and its equality 
with the first, to shew that, for any domain G 1 , contained in G , and itself 
containing no point of L, either in its interior or on its frontier (which 
frontier is to be taken to be of zero measure), the condition 

If f(x,y)d (X, y)- f F (i, 77) | J | d (f, t,) | < rj 

I Jo • Oi I 

is satisfied, provided that m\G) — m{G l ) be less than some fixed finite 
number dependent on rj. 
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A domain g y interior to 6r, and containing, in its interior and on its 
frontier, no points of L, can be found such that 

f f (X, y) d (X, y) - [ f (x, y) d (x, y) < 6. 

Jo J a 

If h be any domain contained in <7, such that m (g) — m (h) is sufficiently 
small, we have 

j j f (x, y ) d (x, y) - jj (x, y) d(x,y) < e\ 

and therefore / f (x, y) d ( x , y) — If (x, y) d (x, y) < 2c. 

I Jo J h 

Now let k be a domain interior to G, containing in its interior, and on 
its frontier, no points of L, and containing h, then 

f / (x, y) d(x,y)-[f (x, y) d (x, y) < 2c. 

Jo J k 

For, let p denote the domain obtained by taking the two domains 
g and k together, then 

| / (*. V) d (x, y) — f (x, y) d (x, y) < e, 

JO jp 

\ f (x, y) d (x, y) — f (x, y) d (x, y) < e, 

J p J k 

and by combining these inequalities the result follows. 

We have now 

fg. f (£, v) \J I d (i, V ) = \ v F V ) I J I d {(, V )-\_F(i,r,)\J\d (f, V ), 

where U is the domain formed by taking all points which belong to one 
or both of the domains G 1 and h\ and V consists of those points which 
belong to h but not to G x . 

Now U corresponds to a domain in the ( x , y) plane which contains h , 
and which domain may be taken to be identical with k ; therefore 

L F (£> Vi) I J I d (£, tj) differs from ( / (x, y) d (x, y) 

JU JO 

by a number numerically less than 2 c. Again 

Jp F (€> v) I J I d (£, v )\<n{m(G)-m (GJ}, 

where p is the upper boundary of | F (f, rj) J | in the domain Y y obtained 
by removing from h those points which belong to G x . 

We thus have 

| \ Q f ( x > V) d (x, y)- (£, 7 ))\J \d (f, J < 2c 4- p {m (G) - m (G x )}. 
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Now let e be so fixed that it is < \rj, then h is fixed, so that fx cannot 
exceed a fixed finite number fx x . If then G x be so chosen that 

m (G) — m (G x ) < , 

the inequality 

I ( /(*, y) d (z, y) - L F ((, rj)\J\d (f, tj) < rj 
will be satisfied. Therefore it follows that 

(£> v) I I I d (£, 7)) 

exists, and is equal to / ( x , y) d (x, y). 

Jo 

375 . This method of transformation may be extended to the case in 
which one of the domains G, G is infinite, or to the case in which both 
are infinite. It can be shewn that, if either of the integrals 

f / (*> V) d (x, y), L F (f, rj) | J | d (£, t j) 

Jo Jo 

exists, the definition of § 366 being applied when G or G is infinite, then 
the other integral also exists, and the two integrals are equal. The proof 
can be given by slightly modifying the procedure of § 372 . 

There may be a set of points of zero measure, in the domain G , such 
that the corresponding values of f, 17 are infinite, or such that one of them 
is infinite. This set now takes the place of the set L. Whether G be finite 
or infinite, the domain h , contained in G , may be so fixed as to exclude 
all points which correspond to infinite values of f or The domain k 
including h , and containing no points which correspond to infinite values 
of f and 77, may then be fixed as before, and will satisfy the condition 

[ f (*, y) d (x, y) — f f ( x, y) d (x, y) < 2e, 

Jo Jk 

it being assumed that the integral of / ( x, y) over G exists. 

The domain h contains all points of G of which the distance from the 
origin is less than some number R depending on the domain G — h, which 
contains in its interior all points (x } y) that correspond to infinite values 
of £ or 77, or of both. The same statement holds for k , which contains h. 
When the finite domain G x is such that the condition 

J T F ({, ■>?) | J | d (£, tj ) < ly 

is satisfied (and, in order that this may be the case, G x must certainly 
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contain all points of G whose distance from the origin is less than some 
fixed number R y ^ R ), we have as before 

If / (*> V) d (x, y) - f F (i, v )\J\d (ft rj) I < v ; 

[JO J G x I 

and as rj is an arbitrarily fixed number, we thus see that 

[ F(f, v )\J\d((; v ) 

Jo 

exists, and is equal to / f (x t y) d ( x , y). 

Jo 

THE RIEMANN-STIELTJES INTEGRAL 

376. The notion of the integral of a bounded function / (#), defined in 
the linear interval (a, ft), with respect to another bounded function </> (a?), 
defined in the same interval, is a generalization of the integral of a function 
/ (a:), with respect to the variable x, that was first introduced* into Analysis 
by Stieltjes in connection with the theory of continued fractions. This 
integral, in a generalized form, has recently become of considerable im¬ 
portance; and consequently, an account of it, so far as it can be regarded 
as a generalization of the -integral, will be given here. 

If (a, x 1} x 2 , ... x m -i , ft) be a net, fitted on to (a, ft), and in which the 
breadth of the greatest mesh is d , let us consider the sum 
S d ^f (ft) {<f> (*,)-* (a)} + / (ft) {cf> (x 2 ) - <f> (x,)} 4 ... 

+ /(ftn) W (ft) - <f> 

where / (x), cf> (x) are the bounded functions, defined in the interval (a, ft), 
and ft, ft, ... ftn are points, assigned in any manner, which are in the closed 
meshes ( 0 , x x ), (a:, ,x 2 ) ... , b) respectively. 

We may denote x 0 by a, and x m by ft. 

If the functions / ( x ), </> (x) be such that S d converges to a definite 
number, as the number rn of the meshes is increased indefinitely, subject 
to the condition that d converges to zero, and if this limit is independent 
of the mode of successive sub-division of the interval by the nets, and of 
the mode in which the sets of points ft, ft, ... are assigned in the nets, 
/ (x) is said to have a Stieltjes Integral with respect to <f> (#). Such integral 
is defined to be the limit of S d , as d ~ 0, and it is denoted by 

( / (x) d<f> (x). 

J a 

It will here be spoken of as a Stieltjes integral. 

In the case in which / (x) is continuous in (a, ft), and <f> (x) is monotone 
and bounded in the same interval, the existence of the integral was estab- 


* See Annales de la Faculty dee Sciences de Toulouse , vol. vm (1894), p. J 71. 
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lished by Stieltjes. This will be proved below as a particular case of a 
more general theorem. 

It will be shewn that: 

fb fb 

(1) . If f (x) d</> (#) exists as a Stieltjes integral , then <j> (x) df (x) also 

J a J a 

exists; and that 

[ / (x) d<f> (x) + f <f> (x) df (x) — f (b) f> (b) - f (a) <f> (a). 

-a J a 

(2) . If <f> (x) — <f>i (x) — <f >2 ( x ), and the bounded function f (x) has Stieltjes 
integrals with respect to the bounded functions <f> x (x), <f > 2 (x), then 

[ f (x) d<f> (x) 

J a 

also exists as a Stieltjes integral , and has the value 

[ f (x) d(f >! (x) - [ f (x) dj > 2 (x). 

J a J a 

r - m T Xr 

To prove (1), we see that S <£ (f r ) A I r ^f {x), where A f r _ y f (x) denotes 

r »1 

/ (^r) “ / (^r-i), may be written in the form 

r~m+l * 

- S f(x,^)At_ t <l>(£)+f(b) 4 ,(b)-f(a)i(a), 

r «1 

where £ 0 denotes a , and £ m+1 denotes b. The points a, £ lt ..., £ m ,b define 
a net with m -f 1 meshes, the breadth of each of which is £ 2 d. If d be 
rb 

sufficiently small, since f (x) d<f> (x) exists, we see that 

J a 

r-r/i+1 > 

2 /(*,_,) *£_*(*) 

r 1 

r~m ^ 

differs from this integral by less than e ; therefore E <f> (f r ) A Xr _j/ (x) 

r «1 

differs from f b 

- f(x) d<f> (x) + / (b) <f> (b) - / (a) <f> (a), 

J a 

by less than e, provided d is sufficiently small, however the points £ are 

fb 

chosen. Since e is arbitrary, it follows that <f> (x) df (x) exists, and is 

J a 

equal to 

- f(x)cUf>(x)+f(b)f>(b)-f(a)<f>(a). 

J a 

In order to prove ( 2 ), we employ the identity 

r-m _ r-m r~m 

£ / (tr) AS-. 4>X (*) - 2 / (fr) AS-. <f >2 (*) = £ / tf r) A^ <f> (x), 

r-1 r-1 r-1 

and the result follows by taking the limits of the expressions on the left- 
hand side. 
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As in the case of the i?-integral (see § 330), we may, in the definition 
given above of the Stieltjes integral, replace the numbers / (f r ), where 
r — 1, 2, 3 ... w, by any numbers M%_ x f(x), where M%_ x f (x) is restricted to 
be in the interval bounded by U%_ x f (x), L%_ x f (x)\ these numbers de¬ 
noting the upper and lower boundaries of / (x) in the interval ( x r ^, x r ); 
we thus obtain the following definition of the Riemann-Stieltjes or RS - 
integral : 

Tr 

If S D denotes the sum E M/ r _ x f (x) A£_ x (x) taken for a net 

r- 1 

(a,x lf x , 2 , ... x m _ ly b), 

fitted on to ( a, b ), the breadth of the greatest mesh being d, where M%_ x f (x) 
is in the interval bounded by the upper and louder boundaries of f (x) in the 
closed mesh (Xj .^, x r ); and if S D converges to a definite number , as the number 
m is increased indefinitely , subject only to the condition that d converges to 
zero , the limit of S D being assumed to be independent of the mode in which 
the successive nets are defined , and of the mode in which the numbers 
M%_ x f (x) are chosen , then the limit of S D is said to define the RS-integral 

f f(x)d<f>(x). 

J a 

It is clear that, in case the ^-integral of / (x) with respect to (f> (x) 
exists, the Stieltjes integral also exists. It will be shewn that the converse 
also holds good. In the case <j> (x) = x , we have the ordinary R -integral; 
and in this case the equivalence of the two definitions has been established 
in § 330. 

In accordance with the above definition of the i&S-integral, when the 
integral exists, either of the sums 

r-m _ _ r-m _ _ 

s £ L%_j(x) a£_,<M*) 

1 r-1 

has the RS-integral for its limit. Assuming the existence of the Stieltjes 
integral, since it is always possible to determine the points fi 
so that 0 ^ U%_ x f (x) — f Hr) < for r = 1, 2, ... m ; where c is a pre¬ 
scribed positive number, we see that 

r “W _ _ r-m 

S UZ-J(x) A£_, <f>(x)~ £ / tf r ) AS_, <f> (X) < e £ | 4>(x)\. 

r-1 r-1 

For the particular net Z>, € may be so chosen that c | A&_ x <f> ( x ) | does 
not exceed an arbitrarily chosen number tj ( D ). Thus the number on the 
left-hand side is < rj ( D ). In case the function <f> (x) is of bounded varia¬ 
tion in (a, 6), the number £ | A^_, <f> (x) | is less than a number K , in¬ 
dependent of the net; in this case we may take e to depend only upon d 9 
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the maximum mesh of D , and so that c converges to zero, as d ~ 0. We 
then see that 

r - m 

2 UZj(x) *£_..*(*) 

r = l 

r w 

has the same limit as 1] / (£ r ) </> (x), 

r = 1 

which is the Stieltjes integral. In case </> (x) is not of bounded variation, 
we consider a particular sequence of nets such that d converges in the 
sequence to zero, whether the sequence forms a system of nets or not. 
The values of e in the sequence may be so chosen that rj (D) converges to 
zero; thus the equality of the limits holds for any particular sequence; and 
we have the same result as before. In a similar manner it can be shewn that 

r = m _ 

r = 1 

has the Stieltjes integral as its limit. It then follows that 

r -77i 

s c,/wa:jw 

r -1 

has the Stieltjes integral for its limit; and therefore the ES -integral exists. 

The most important case is that in which <f> (x) is of bounded variation 
in (a, 6), or in particular is bounded and monotone. Since the BS- 
integral exists when the Stieltjes integral exists, in virtue of the theorem 
(2) above, the relation 

( / (x) d(f> (x) = [ / (x) dfa (x) — f f (z) d<f> t (x) 

J a .'a J a 

holds good for iZS-integrals, where </> (x) = <f> ± (x) — </> 2 (x) ; the functions 
4 *i ( x )> 4 >2 ( x ) being monotone, and it being assumed that the integrals on 
the right-hand side exist. 

It can easily be shewn that the i?$-integral cannot exist if, at any 
point x of (a, 6), both the functions /(x), </> (x) are discontinuous. For, 
let a net be so chosen that x is interior to the mesh (x r _j, x r ); for this mesh 
we may take M%_ x f (x) to have either of the values Ull_ x f (x), (x); 

and the corresponding sums will differ by 

(UZ_J (X) ~ (X)) A£_^(x), 

provided the values of (x) for s ^ r are taken to be the same in the 

two cases. This difference is greater than some fixed number, if the interval 
(x r _!, x r ) y including x in its interior, is properly chosen, when /(x), <f> (x) 
are both discontinuous at x, the trivial case in which 4 > (%) has merely a 
removable discontinuity at x being excluded. It follows that a sequence 
of nets and two sets of values of M%_ x f (x), which differ only in one mesh 
of each net, can be so chosen that the corresponding sums cannot converge 
to the same limit. 
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376 1 . The following theorem* will be established for the RS -integral: 

If f (or) be bounded in (a, b ), and cf> (#) be of bounded variation in (a, b ), 
the necessary and sufficient condition that f ( x) should have an RS-integral 
with respect to <f> (x) is that the variation of <f> (x) over the set of points of 
discontinuity of f (x) should be zero. 

It will be observed that the condition cannot be satisfied if / {x), <f> (x) 
have any common point of discontinuity. 

The upper and lower variations of <f> (x), in case it be monotone, over 
a set of points in ( a , b ), have been defined in § 252; when the upper varia¬ 
tion is zero, the variation over the set of points exists, and is equal to zero. 

We may, without loss of generality, assume <f> (x) to be monotone and 
non-diminishing; for in the general case </> (x) is the difference of two such 
functions P ( x ), N (x), and the variation of <f> (x) over a set of points is 
the sum of the variations of P (x) and N (#) over the set (see §244). 
We assume accordingly that ch (x) is monotone non-diminishing in (a, b). 

Let us consider a system of nets {!)„}, with closed meshes (see § 51) 
fitted on to (a, b); and let the sums &jy n , Sj) n corresponding to the >?th net 
of the system be defined by 

__ r *= m n r r r m n _ 

s Dn = z ££_./(*).a;:jw, s 

r * 1 r - 1 

It is easily seen that, when a net (x r l , x r ) is divided into two parts, 
U s x r r _J(x). A*'.., <f> (x) cannot be increased, and L J x r r _ i f(x).A T r ' r _ l cf>(x) cannot 
be diminished. For, if x' be the point of division, Uf r ,/(.r). A^_, </> (z) 
becomes f/^j (x) . </> (x) + 17?/ (x). A? <f> (x), 

where A£_» <f> (x) = A^_, <f> (x) -} A? <f> (x), 

and U*;_J (x) s U x ; r _J(x) f l%f (x) * U%_J (x), 

from which the result follows at once. For //£_,/ (x) <f> (a*) the corre¬ 

sponding result is proved in a similar manner. 

It follows that Sd h cannot increase, and Sj) n cannot diminish as n is 
increased; thus {Sd k } is a monotone non-increasing, and {S/j n } a monotone 
non-diminishing, sequence, for the system of nets. Hence So n has a lower 
boundary S, and $/> n an upper boundary S, for the system of nets. 

It has been shewn in § 334 that, if l, be a prescribed positive number, 
the interval (<z, b) can be covered by a finite set of intervals, each of 
length £ £, such that, in each interval, the fluctuation of / (x) is < a> «) + 
where £ is a definite point interior to the interval, and w (£) is the saltus 
at f, and e is an arbitrarily chosen number. The end-points of this finite 

* See W. H. Young, Proc. Lond. Math. Soc. (2), vol. xm (1913), p. 133. 
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set of intervals divide (a, b) into a finite number of parts ; let rj be the 
length of the least of these parts. Any interval whatever 8, in (a, b), of 
length < 77 , is interior to one of the intervals of the finite set. Hence, 
in 8, the fluctuation of / (a?) is < e 4 to (£), where £ is some point in an 
interval of length -• £, which contains 8 in its interior. 

Let G e be the closed set of points at which a> (#) ^ e; with centre x , 
any point of G t , an interval of length 2 p can be taken; all such intervals, 
or the parts of them in (a, 6 ), coalesce into a non-overlapping finite set 
//« fP , of which G e is the inner limiting set, as p ~~ 0 . Assuming that the 
variation of </> ( x) over G e is zero, p can be so chosen that the variation 
of <f> (x) over H, tP is < £', an arbitrarily chosen positive number. In any 
interval of the set II ftP there is no point of G , of which the distance from 
either end-point is < p. We can suppose £, ai\d consequently rj, to be so 
small that, if 8 be not interior to an interval of there is no point 

of (7 e in the interval 8. 

It will now be proved that S S, if the condition that the variation 
of </> (x) over the set of discontinuities of / (a;) is zero be satisfied. We have 

Sj> H - Sd„ = x { Ut_J (X) - L%_J (X)} AZ, 4> {x). 

r -1 

The number n being taken so large that every mesh of D n is < 17 , we find 

that Sv n - Sj >n < 2 e (ft) - <!> (a)} + C'(U-L), 

where the first term on the right-hand side arises from those meshes that 
are not interior to an interval of H ttP \ and in each such mesh the fluc¬ 
tuation of / (a;) is < 2 e. The second term arises from the remaining meshes. 
Since e, £' can be made to converge to zero, as n ~ 00, we see that S ~ S. 

This has been shewn to hold good for any one system of nets. It 
will now be shewn that the number S — S =-- S is the same for all systems 
of nets. If possible, let it be assumed that for a second system of 
nets, the number S' — lirn Sn n ' — lim Sz> n ' has a value different from S, 

7l~00 7l~O0 

and let it be supposed that S' > S. If e be an arbitrarily chosen positive 
number, n may be taken so large that 

Sn n < S 4- €, Sj) n > S — €, Sn n ' < 8' 4 $d h ' > S' — e. 

Let the two nets of order n in the two systems be superimposed, and let 
S n ", S n " be the sums for the new net so obtained. 

We have then 

8 n n " ^ So n < 8 4- e, S Dn " £■ Sj) n > S — e ; 

and similarly 8n n " <&' + *> 8 Dn " > S' — €; 

from these inequalities we deduce that 

S» n " ~ S Dn " <S - S'+ 2e. 
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If € be chosen to be less than the positive number | (S' — S), we see 
that $j> n " — $D n " has a negative value, and this is impossible. It has 
thus been shewn that S — ’S '; therefore the number S is the same for all 
systems of nets. 

It will next be shewn that the convergence of Sjj n to S, or of S# n to S, 
is uniform with respect to all possible systems of nets, in the sense that a 
number d can be so chosen that, for any net of which all the meshes are 
^ d y the sum of the products of the fluctuation of / (x) in a mesh into 
the difference of values of </> (x) at the ends of the mesh is less than a 
prescribed positive number €. 

Denoting U%_ x f (x) — I?J r _ x f (#) by F x r r ^f ( x ), let a fixed net T) x be so 

r -= rn 

chosen that 21 F^_ x f (x).& x [_ x <]> (x) < le . 
r» 1 

We consider a second net D 2 , for which all the meshes do not exceed 
in length a number d to be fixed hereafter, and which may be faken in 
the first instance so small that no mesh of 1) 2 contains more than one of 
the points x r in its interior or at an end-point. Those meshes of 1) 2 that 
contain, as interior point, or as an end-point, one of the points x r are 
at most 2 m in number; the remaining meshes of 1) 2 being interior to meshes 
of D x . A mesh 8 of D 2 which contains x r , either as an interior point or 
as an end-point, is such that the term corresponding to it in the sum for 
I) 2 is the product of the fluctuation of f (x) in 8 d) into the difference of 
the functional values of <£ (x) at the ends of 8. Since f (x), <f> (x) are not 
both discontinuous at x ri one or other of the factors in this term diminishes 
indefinitely as d does so, the other factor being less than a fixed number. 
The number d can accordingly be taken so small that the magnitude of 

the term in the sum for D 2 does not exceed ~ . The value of d can be 

4m 

chosen so small that this condition is satisfied for each mesh w hich contains 
one of the points x r \ then the sum of the corresponding terms in the sum 
for D 2 is ?; Je. The remaining part of the sum for I) 2 is less than Je. There¬ 
fore the sum for D 2 is less than €. Therefore d has been so chosen that, 
for any net D 2 whose meshes are all of length ^ rZ, the sum 

r --to' , 

2 r,v-,/(*)- A*v_, <f> (x) is < e, 

r «1 

which is what we had to prove. The sufficiency of the condition stated in** 
the theorem has now been proved. 

To shew that the condition that the variation of <f> (x) over the set of 
points of discontinuity of / ( x) is necessary, in order that S = we 
observe that Sj? n — $z> n is ^ . 2! {<f> (x r ) — <f> (av-i)}, where the summation 

is taken for those meshes of D n which contain points of 0 t within them, 
or at an end-point. For, if a point of G t be a common end-point of two 
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meshes, in one at least of those meshes the fluctuation of / (x) is i- 
Unless the variation of (f> ( x) over G t is zero, the sum 

E {</> (X r ) - (f> (X^)} 

remains greater than some positive number, however large n may be, 
and in that case So n — $z> n cannot converge to zero. Thus it is necessary 
that the variation of </> (#) over G t should be zero for every value of e, 
and this cannot be the case unless the variation of </> (x) over G, the outer 
limiting set of G t , for a sequence of diminishing values of e, converging 
to zero, is also zero. 

It should be observed that the condition that </> (x) has variation zero 
over the set of points of discontinuity of / (#) may be satisfied even when 
the measure of the set is > 0. For example, if <f> (x) — x, at the points of 
a closed set E, of positive measure, and if (f> (x) is constant over each 
contiguous interval of E, whilst all the discontinuities of / (x) are in these 
f b 

contiguous intervals, / (a?) df> (x) exists as an i&SMntegral, although 
J a 

fb 

f (x) dx may not exist as an 72-integral. 

J a 


376 2 . It is seen as a particular case of the above general theorem that: 

In case f (#) is continuous , the RS-integral of f (x) with respect to any 
bounded monotone function , or function of bounded variation , always exists. 
In case </> (x) is absolutely continuous , and f (x) has an R-integral in (a, b ), 

fb 

the RS-integral f (x) d</> (#) always exists. 

J a 

We need only consider the case in which the absolutely continuous 
function <f> (#) is monotone, for, in the general case, </> (#) is the difference 
of two such functions. The set of points of discontinuity of f (x) has 
measure zero, and the variation of an absolutely continuous function over 
a set of measure zero is zero (see § 352). Thus the condition in the general 
theorem is satisfied. 

In case <f> (#) is continuous , and, of bounded variation , and f (x) has only 
an enumerable set of points of discontinuity , and in particular in case it is 
of bounded variation , the RS-integral of f (x) with regard to <f> (x) exists . 

We need only consider the case in which <f> (x) is monotone and con¬ 
tinuous. To an enumerable set of points on the ^-segment, there corre¬ 
sponds an enumerable set of points on the ^-segment, where £ = (f> (x). 
The measure of this latter set is zero, and therefore the measure of (x) 
over the set of discontinuities of f (x) is zero; the sufficient condition is 
thus satisfied. 
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If f (x), <f> (x) are both of bounded variation in (a, b) and have no points 
of discontinuity in common , either of them has an RS-integral with respect 
to the other. 

We consider the case in which <f> (x) is monotone, as the general case 
is deductible from this. Since / (x) is of bounded variation, its set of points 
of discontinuity is enumerable. Since <f> (#) is continuous at all the points 
of this set, its measure over the set is zero (see § 252); and this is the 

rb 

sufficient condition for the existence of | J (x) dcf> (x). 

a 

THE GENERALIZED RIEMANN-STIELTJES INTEGRAL 

377. Definitions of an /^-integral have been given by Hardy*, W. H. 
Youngf, and PollardJ. These definitions vary in scope, and some of them 
are more general than the definition given in § 376. 

If f (x), <f> (x) be any two functions, bounded in (a, b), and such that 
(f> (x) is monotone non-diminishing ; let D denote a net with closed meshes, 
fitted on to (a, b). We consider the two sums, for the net D , 

_ r m tw 

s D - 2 cC./w-A,;,,<£(*), &>- z C,/(x).a;;jw. 

r 1 r 1 

It is clear that z L {<f> (b) — <f> (a)}, S D U {<f> (b) - </> (a)}; where U 
and L are the upper and lower boundaries of / ( x) in (a, b). It follows that, 
when all possible nets D are considered, Sn has a finite lower boundary, 
and S D has a finite upper boundary; these may be denoted by 

(G) f / ( x ) d<t> (*>, or I, (G) [ f (x) d<f> (x), or /, 

J a J a 

respectively. They may be spoken of as the upper and lower generalized 
BS-integrals of / (*) with respect to <f> (x), in (a, b). If e be an arbitrarily 
chosen positive number, it is possible to determine a net D, for which 

Sn< (G) Tf(x)d<j>(x) + e , 

J a 

and also a net D\ for which 

S D ‘ > (G) jf (x) d<f> (x)-€. 

It can be shewn that 

( G ) [ / ( x ) d<f> (x) £ (Cr) [ f (x) d<j> (x), or 7 a /. 

Ja .'a ” 

* Messenger of Math . vol. xlvui (1918), p. 90. 

t Proc. Lond. Math. Soc. (2), vol. xm (1913), p. 109; ibid. vol. xv (1916), p. 35. 

% Quarterly Journal of Math. vol. xlix (1923), p. 73, where a discussion of several definitions 
is given, and the properties of the corresponding integrals are discussed in detail. 
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Assume that, if possible / < /, then for the net Z), Sd < l -{ t \ and for 

the net D f , S D * > Z — c. If we superimpose the nets Z), D' into a single 

net (Z), Z)'), we have 

^ S D < / f f, S(D,D') ^ $/>' > I — €• 

Choosing 6 to be less than \ (I — I), we see, from these inequalities, that 

S(n.D') > S(u,D')> which is impossible in accordance with the definitions of 
S(n,D')> £(/>./>'). Therefore I cannot be less than 7 . 

In case the upper and lower generalized RS-integrals I, I are equal , their 
common value I is said to define the generalized RS-integral 

(G) [ / (x) d<f> (x), 
a 

which then exists. 

It will be shewn that: 

When the generalized RS-integral exists , there exists a system of nets {D n } f 
such that Sd u , Sn n both converge to I, as n ~ co. 

If {€ n } be a sequence of diminishing numbers converging to zero, a set 
of nets Z) 1 (1) , Z) 2 m , ... Z) w (1) , ..., for which d n (1) , the maximum length of a 
net, converges to zero, as n ~ 00, exists such that Sd^d < I 4- € n , for 
n — 1 , 2, 3, ... ; and there also exists a set of nets Z) 1 (2) , Z> 2 (2> , ..., Z) w (2) , ..., 
for which S Dn w > I — e n , and for which d n {2) converges to zero, as n ~ 00. 
Consider the sequence of nets 

(z^^zv^uzv"^^^ 

.... This sequence forms a system D of nets; and for the nth net l) n of this 
system we have Sr )n < Z + e n , Ss n > I — € n . It then follows that 

lim Sn n = li m §D n — 7 . 

n~ 00 n~oo 

Consequently we have the following statement: 

The necessary and sufficient condition that the generalized RS-integral of 
a bounded function f (x) with respect to a monotone function <f> (x) exists is 
that , a system D of nets exists such that 

T T Ft_ l f(x).A x ^ 1 4 >(x), 

r~ 1 

taken for a net D n of the system , converges to zero , as n ~ 00. F x x r r _ x f (x) 
denotes the fluctuation of f (x) in the interval (x r _ x , x r ). 

It is not necessarily the case that, when Z exists, the numbers Sn n , Sj) n , 
for an arbitrarily chosen system of nets, converge to Z, although, as has 
been shewn, it is possible to choose the system of nets so that this is the 
case. For any particular system of nets lim Sp n must be £ Z, and lim S& n 

n~ 00 n~ao 

must be ^ I. Thus S& n and S& n cannot have the same limit, for n ~ 00, 
unless that limit has the value Z. 
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It may happen that the generalized I 2 $-integral exists in a case in 
which / (x), <f> (x) have one or more points of discontinuity in common, 
and when consequently the i?$-integral does not exist. It will however 
be shewn that, when the generalized /?$-integral exists, such a common 
point of discontinuity of f (x) and (f> (x ) must be an end-point of meshes 
of the special system D of nets. For, if possible, let a point f, which is, 
for every value of n , an interior point of a mesh d n of the system D , be 
a point at which / ( x ) and <f> (x) are both discontinuous. The difference 
Sj) n — $v n contains a term F dn f (x ). A^ <f> ( x ), where F^f (x) is the fluctua¬ 
tion of / (x) in d n , and A(a:) is the difference of the values of <f> (x) at 
the end-points of d n . Since d n contains the point f, both F(x) and 
A<*n <f> ( x ) are > for a d values of n, not less than fixed positive numbers; 
therefore Sn n — Sj) n cannot converge to zero, as n ~ o o. Thus no such 
point as £ can exist. 

It will further be shewn that, when I exists, and when / (x) } <j> (x) are 
both discontinuous at an end-point of meshes of the system 2 ), / (x) must 
be continuous at the point on one side, and <f> (x) must be continuous at 
the point on tlie other side*. 

We have, for a mesh of the net D n 

C/(x).At, 4> (x) s <f> (X) 

+ uz:; + j(x). a%:; + . $ (x) + uZ-jw. az~. 4, (*), 

where e is any positive number < \(x r - x rl ). Let c converge to zero; 
we then have 

U%-if (x)■ A*'_, <f> (x) a [<f> (x T _ 1 + 0) - <f> (x r _,)J 

+ UZr-i+o f (x) (x r — 0) — (j> (x r _j + 0)] 

+ MZf (x) [ 4> (x r ) - 4> (x r - 0 )], 

where M x f (x), Ml f (x) denote the maxima of / (x) at x, on the right 
and . left, respectively, and UZl?+of(x) is the upper boundary of / (x) in 
the open interval (x r _,, x r ). In a similar manner it can be proved that 

Lxr-i f (*) • A*>_, 4 (x) s Tn Xr _ 1 f (x) [ <f> (x r _j + 0 ) — <f> (x r _,)J 
+ Lxr-f+o f (x) [<f> (x r — 0) — <f> (x r-1 + 0)] 

+ rnlf(x) [</> (x r ) - 4>(x r - 0 )], 

where m*,/(x), mlf (x) denote the minima of f(x) at x, on the right and 
left, respectively, and LZl?+of(x) is the lower boundary of f(x) in the 
open interval (x r _ 1; x r ). 


* Another proof of this was given by Pollard, he. tit. p. 137. 
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We now have So So' a 7; So * So' /, where 

s D ' =T uz:? +0 f(x). a:;, 0 +0 </> (*) 

r-1 

r-ra-1 , 

+ L iC/(*) [* (x r + 0) - <£ (x r )] 

r- 0 

r ”Wi 

+ S If*/(*)[*(*r)-*(*r -<>)], 

r-1 

and $/>' denotes a similar expression, with m written for M ; and X for f7. 
It can easily be shewn, by re-introducing e, and proceeding to the limit, 
that $ 1 / is non-increasing, and Si/ non-diminishing as n is increased in D n . 
In case Si) n , S]j n converge to /, as D n are the successive nets of the system 
D, it is clear that Sz> n ', Sp n ' also converge to /. Now Sz> n ' — $d„' contains 
the terms 

( x ) - to*/ (a:)] [</> (* r + 0) - <f> (x r )] 

+ {Mlf (x) - m~f (x)] [<f> (x r ) - <f> (x r - 0)], 
and this will, for a fixed be a term of Sn n ' — S Dn ', for all values of n, 
from and after some fixed integer. Unless this term vanishes it is im¬ 
possible that SD n ' — S Dn ' should converge to zero, as n ~ oo. If both 
f (x) and <f> (x) are discontinuous at x ry the term can only vanish when, 

either M£f(x) = to* / (x), and <f> (x r ) = <j> (x r - 0), 

or else M Xr f (x) = m/ r f(x), and </> (x r -f 0) = (f> (x r ). 

Thus <f> (x) must be continuous at x r , on one side, and / (x) must be con¬ 
tinuous on the other side. 

The following theorem has now been established: 
rb 

If (G) f (x) <f> (x) dx exists , where <f> (x) is monotone, all the points at 
J a 

which f (x) and <f> (x) are both discontinuous must belong to a finite , or 
enumerable , set of points H; and at each point of H,f (x) must be continuous 
on one side and <f> (x) must be continuous on the other side. All the points 
of H must be end-points of meshes of a system of nets for which 

r TKlj(x). 

r-1 

converges to zero , as n ~ oo. 

It has also been shewn above that: 

rb 

In order that ( G) f (x) <f> (x) dx may exist , it is necessary that a system 

J a 

of nets D exists , such that 


converges to zero as n ~ oo, in the successive nets D n , of the system D. 
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This is an extension of the result at the end of § 331, for the R -integral. 
In case {" f (x) </> (x) dx exists as an _R,S-integral, D may be taken to be 

J a 

any system of nets. Thus we have the theorem that: 

For an RS-integral 

r -1 

converges to zero , for any system of nets. 

The generalized ^-integral of a bounded function / (x) with respect 
to a function of bounded variation </> (x), or P (x) — N (x) (see § 244) may 
be defined by 

(O) f / (x) d<f> (x) = (G) f f (x) dP (x) - (G) \'f (x) dN (x), 

J a .a -'a 

whenever the integrals on the right-hand side exist. 

377 1 . If, for the net I), or (a, x lf ..., 6 ), we substitute the net 
(a, a -f 6 , x x — e, x x + c, x 2 — t, x 2 -f e, ..., b — e, 6 ), where e is smaller than 
the least of the numbers \ ( x r — x r _ x ) y we obtain, when e converges to 
zero, the sums 

_ r ~ra r _ n _ n r=m 

Sj>" - s C + o/W.A::, (, + o < tW+ E + 

r“1 r“0 

r “ m r - ft ^ -ft r - TO 

Sz>" == £ L x ' r Z t +of (x) . A x :_ l+0 <f> (x) + S m* r /(x) [<£ (x r + 0) - <f> (x r - 0)], 

r-1 r“0 

where M Xf f (x), m Xt f (x) denote the maxima and minima of f (x) at x ry 
and (f> (x 0 — 0 ) denotes </> (a), and <f> (x m + 0 ) denotes <f> (b). These two 
sums may be employed to define an integral of / (x) with regard to the 
monotone function (x) in a manner precisely similar to that employed 
in the case of the generalized ^-integral. It will be shewn that such a 
definition leads only to the ordinary RS -integrals, and consequently affords 
an alternative definition of these integrals. It can easily be shewn, by 
re-introducing e, and passing to the limit € = 0 , that Sd" is not increased, 
and S D " is not diminished, when further points of sub-division of (a, b) 
are introduced. 

If SDn' ~ converges to zero, for a particular system of nets, it 
can be shewn, precisely as in § 377, that any point at which / (x), <j> (x) are 
both discontinuous must be an end-point of meshes of nets of the system. 
Also, since Sd” — &d' contains a term 

[M Xr f(x) - m Xf f(x)] [cf> (x r 4 - 0) - <j> (x r - 0)] 
which belongs to S Dn " - S Dn ” for all values of n, from and after some fixed 
one, this term must be zero. Therefore either <f> (x) or f (x) must be con¬ 
tinuous at x r ; and thus <f> (x) and / (#) have no points of discontinuity in 
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common. It is accordingly necessary and sufficient for the existence of 
the integral, that there be no points in common of discontinuity of / (#) 
and <f> (x), and that a system of nets exists such that 

r *= 1 

should converge to zero for the system of nets. 

For a particular net D of the system, we have 

r i m K:! +0 f(x).A?,:? + o4> (x) < h; 

r -1 

let another net D', all of whose meshes are < d, be superimposed on this 
net; the inequality will then hold for the new net (I), /)'). The number 
of meshes of D' which are not interior to meshes of I) is at most 2 m. 
At an end-point of a mesh of D, 

converges to zero with €, since / (#), <f> (x) are not both discontinuous 
at x r . It is accordingly possible so to choose d that the sum of the terms 
in the expression Sd> — §i>' > for the net /)', which correspond to meshes 
of D' that are not interior to meshes of Z), and of which the number is at 
most 2m, is less than The sum of the terms of Sn> — S &for the re¬ 
maining meshes of />' is less than J 77 ; therefore the sum for all the meshes 
of I)' is less than 77, provided d is sufficiently small. Thus, if the integral 
exists, it is such that 

r = m r 

2 F? r _J(x). A x ; _,<£(*) 

r «1 

for a net such that every mesh is of length < d, is < 77. This is the con¬ 
dition for the existence of the -integral of f (x) with respect to <f> (x). 
Therefore 

An RS-integral is such that a net exists * for which Sp" — Sjj" is 
arbitrarily small , when D is properly chosen. 

It has also been shewn that: 

The necessary and sufficient conditions for the existence of 

f / (x) d<f> (x), 

• a 

as an RS-integral are (1), that f (x), f> (x) have no points of discontinuity in 
common , and (2), that , if e be arbitrarily chosen , a net I) exists such that 

r 2"V;:.;’ 0 /<*). a^° +0 <t> < «■ 

r-1 

These conditions are equivalent to the necessary and sufficient con¬ 
dition given in the theorem of § 376 1 . 

* That this is the case was stated to the author by Prof. D. C. Gillespie of Cornell University. 
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378 . It was pointed out in § 252 that the upper and lower variations 
of a monotone non-diminishing function <f> (x) over any set of points in 
(a, 6 ) are the exterior and interior measures of the corresponding set of 
points f (= <f> (x)) in the linear integral of provided the closed interval 
{<f> (x y — 0), <f> (Xy +■ 0)} be taken to correspond, on the ^-segment, to a 
point x lf on the ^-segment, at which <j> (x) is discontinuous. If y (f) be 
defined on the ^-segment as having the value of / (x) at the point x which 
corresponds to the function x (£) is discontinuous at the point f corre¬ 
sponding to a point x at which / (x) is discontinuous and cf> (x) is continuous; 
but if (f> (x) is discontinuous, x (f) is constant in the closed interval corre¬ 
sponding to x , and is discontinuous at one or both of the end-points 
<f> (x — 0 ), <f> (x + 0 ) of that interval, in case/ (x) is also discontinuous at x. 

It will be shewn that: 

[b 

In order that (G) f (x) dxf> (x) may exist , where <£ (x) is monotone , it is 
J a 

ffi 

necessary that x (£) should exist as an R-integral; where a — <f> (a), 

ft = <f> (b). A fortiori the condition is necessary in order that the RS-integral 
rb 

f (x) d(f> (x) should exist. When the generalized RS-integral exists , its value 

la 

[ft 

is that of the R-integral x (f) • 


It has been shewn that the necessary and sufficient condition for the 
existence of the generalized jR$-integral is that, when e is arbitrarily 
prescribed, a net should exist such that 

r~m. 

S FZ_J{x).^^(x)<,. 

r * 1 

There corresponds to this net (a, x l9 x 2 , ..., b), a net (a, & , f 2 , , 0) fitted 

on to the ^-segment, where £ r = <f> (x r ), for r - 0 , 1 , 2 , , m. We have then 

r ~ ra t 
r ~1 

and that there should exist such a net, for each value of €, is the sufficient 

f/3 

condition that the I?-integral x (f) should exist. 

. a 

[b 

For each value of e, / (z) d<f> (x) lies between 

1 a 

r-m r-m 

2 U Xr _J (x). A av _ 1 (f> (x) and S Lxr-\f (#). <f> (x) 

r-1 r-I 

or between 


2 r - 

r-1 


r —wi f. 

ir-i) and S ^ (x) (f r — £ r . 

r-l 
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taking a sequence of values of c, converging to zero, and the nets corre¬ 
sponding to these values of e, it is clear that, since the last two of these 

sums converge, as c ~ 0, to |* x (f) the two first of these sums converge 

to the same value, and this is consequently the value of 

(G) [ / (x) dcf> (x). 

J a 

The converse of the above theorem does not hold good. It is possible 
[ft 

that x (£) may exist as an jR-integral in cases in which / (x) and f> (#) 

J a 

have points of discontinuity in common and are such that the condition 
is not satisfied that, at all such points, / (x) is continuous on one side of 
the point and <f> (x) on the other side. Thus the i£-integral may exist when 

rb 

(G) f (x) d<f) ( x ) does not exist. 

J a 

For simplicity let us consider the case when f (x), <f> (x) are both dis¬ 
continuous at a single point x in (a, b), and are elsewhere both continuous. 
The function x (£) has the constant value f (x) in the interval 

[<f> (x — 0), (f> (x + 0)]; 

its only discontinuities are at the end-points of this interval, and thus 

r/3 

x (£) d£ exists, as an /^-integral. A net may be fitted on to the interval 
J a 

(</> (a), </> (b)) of f, such that 

r - m t 

2 *t-,X (*>&-*,-,)<«■ 

r-1 

For those values of r which are such that f T _ x , f r are both in the interval 

[ <f> (x — 0), <f> (x + 0)] we have x (€) = 0. Also we may suppose that for 
some value of r, has the value <f> (x — 0), and for some other value of r, 
f r ~ <f) (x -f 0); for this may be secured by dividing those meshes of the 
net which contain the two points <f (x — 0), <f> (x + 0) into two meshes, 
and when this is done, the inequality above still holds good. Corresponding 
to the two meshes ($ 8 , <f> (x — 0)), (<f> (x 4- 0), f t ), of the net, we have the 
terms which are equivalent to 

(X) [<f> (X - 0) - <f> (*,)], F x ‘ f (X) [<f> (x t ) -</>(x + 0)], 

where = <f> (x s ), £ t = <f> (x t ). These two terms converge to zero as x 8i x t 
converge to x ; but the terms corresponding to the meshes (x 8y x), (x y x t ) 
in the expression for S D — S D in the corresponding net on the x-segment are 

</ <*) [<£ (x) - <f> <*,)], Kf (x) [4 (x t ) - 4 ($)], 
which do not in general converge to zero as x s , x t converge to x. 
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In case <f> (x) is of bounded variation and ( G ) [ /( x) d<f> ( x ) exists, it is 
expressible as (0) f (x) dP (x) - ( G) f* f(x)dN (x), where P (x), N (x) 

Ja J a 

rb 

are defined as in § 244. The integral (0) f (x) defy (rr) is then expressible 

J a ^ 

as the difference of the two 72-integrals corresponding to / ( x) dP ( x ), 

J a 
rb 

f (x) dN (x), respectively. 

J a 

rb 

It is possible to define the integral f (x) d<f> ( x ), where (f> (x) is mono- 

J a 

/*<#>(&) i , 

tone, as given by the 72-integral x ( x ) whenever the latter exists. 

J 6 (a) 


J <t> (a) 


This definition is more general than, but includes, the definition given in 
§ 377, of the generalized ^-integral. It may be adopted as the most 
general definition of an 72$-integral, and it will be extended in § 445 to 
the case of Lebesgue-Stieltjes integrals. 

378 1 . Let <f> (x) be the 72-integral f (j> x (x) dx , where <f> x (x) £ 0 , in 

J a 

(a, 6 ), and is bounded ; the function </> (x) is then monotone non-dimin- 

ishing, and is also absolutely continuous (see § 343); thus / (x) d<jy (x) 

J a 

exists as a Stieltjes integral, provided x (£) exists, and the two have 

J a 

the same value. In accordance with a theorem proved in § 360, if either 

ffi fb 

of the integrals y (£) d£, f (x) D<j> (x) c?x exists as an 72-integral, the 

- o .‘a 

other also exists and the two have the same value, since the condition is 

f b 

satisfied (see § 345) that I D<fy (x) exists as an 72-integral ; D<f>(x) denotes 

J a 

any one of the four derivatives of </> (x) and is equal almost everywhere 
to <fy x (x). If either of the integrals 

\ h f(x)D<t> (x)dx 9 \ b f(x)<f> 1 (x)dx 

•a J a 

exists, then both exist, and they have one and the same value, since 

f f ( x ) [D<f> ( x ) — <f>i (#)] dx has its integrand bounded, and almost every- 
J a 

where zero (see § 341). 
rb 

If I / (x) defy (;x ) exists, either as an 72$-integral or as a generalized 

j a 

re 

72 $-integral, then J x (£) exists as an 72 -integral. It then follows that 
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[ / ( x) D(j> (x) dx and ( f(x) <i>i ( x ) dx both exist, and have the same value 

J a J a 

as [ / (x) d<f> (x). 

• a 

rb rb 

If it be assumed that | / (x) (x) dx exists, then / (x) D<f> (x) dx 

'a J a 


exists, and consequently x (£) d£ exists, and has the same value, which 

a 

is then that of (0) j / (x) dcf> (x). 

J a 

The following theorem has now been established: 

If </>! (x) 0) is integrable (R) in (a, 6), and if f (x) is bounded in 

(a, 6), we have 

[ / (x) (x) dx = (G) I f (x) d<f> (x), where <f> (x) = [ <f >j (x) dx , 

J a J a J a 


'provided it is known that either of the integrals exists. 

This theorem was stated* by J. M. Whittaker, and he has given a 
direct proof of it. 


379. The fact that the generalized RS -integral of a bounded function 
/ (x) with regard to a bounded monotone function g (#) is represented by 
cp 

an /^-integral x (£) ma y employed to extend the properties of an 

- a 

i?-integral to the case of a generalized /^-integral, or in particular to the 
case of an /^-integral. The extensions may be made also to the case when 
<f> (x) is a function of bounded variation, since the generalized RS -integral 
is then representable as the difference of the i?-integrals. 

We have thus the theorem: 

If (G) f f(x)d<f> (a:) exists, then (G) [ / (x) d<f> (#), (G) [ f (x) d<f) (x) both 
J a J a J c 

exist, and 

(G) ( b f(x) d<f> (x) - (G) f C f(x) d<f> (x) + (G) ["/(x) d<f> (x), 

J a J a Jc 

where a < c < b, and </> (x) is either monotone, or of bounded variation; 
the converse also holds good. This theorem holds in particular for the 
Stieltjes integral, provided all three integrals are of this type. It should 

however be observed that f f (x) d<f> (x) is not necessarily an ^-integral, 

J a 

when I f(x)d<f) (x), / f (x) d<f> (x) are both i&S-integrals. For, taking the 
J a J c 

case in which <f> (x) is monotone, let / (x) be discontinuous at c on the right 
and continuous on the left, whilst <f> (x) is continuous at c on the right, 
* Proc. Lond. Math. Soc. (2), vol. xxv (1926), p. 213. 
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and discontinuous on the left. In the interval (a, 6 ), / (x) and <f> (#) have 
the common point of discontinuity c, whereas in the intervals (a, c), (c, b), 
the point c is not a common point of discontinuity of / (x) and <f> (a?); 
thus the integral over (a, b) cannot be an RS -integral, whilst the integrals 
over (a, c), (c, b) may both be integrals. 

Since (G) f f (x) d<f> (x) - f* x (f) ^» it follows that, at a point of con- 

tinuity of <f> (x) or f, (G) f f(x)d</> (x) is a continuous function of x . Also, 
J a 

at any point at which / ( x ) and <f> (x) are continuous, 

lim (G) f f (x) d</> (x)/{<f> (x + h) - <f> (x)} = f(x); 
h~ 0 J x / 

and this is a generalization of the theorem to which it reduces when 
<f> (x) — x, that J f (x) dx has a differential coefficient equal to / (x) at a 

• a 

point at which / (x) is continuous. 

The following theorem will be established: 

If I (x),f 2 (x) both have RS-integrals with respect to the monotone function 
<f> ( x ), then /j (x) + / 2 (a:) has also an RS-integral with respect to cf> (x), and 

f {/i (*) + U (*)} d<f> (*) =- f /i (*) d<f> (x) + [ f 2 (x) d<f> (x). 

Since the variation of </> (x) is zero over the sets of discontinuities of 
/x (x) and of / 2 (x), the variation is zero over the set of discontinuities of 
fi ( x ) +/a ( x )> an ^ therefore the ^-integral on the left-hand side exists. 
The equality then follows from the property 

r ixi ({) + x* m d€ = r Xi (0 d£ + f X 2 (() dt 

J a J a J a 

In general, the properties of the ^-integral, given in § 337, may 
be extended, in the manner indicated above, to the RS-integral, or its 
generalization. 

380. It will be shewn that: 

If <f> (x) is continuous and monotone in (a , b), and f (x) is bounded in 
(a, 6 ), then 

(G) f / (x) d<f, (x) - (G) f V (x) d<f> (x) = (G) f a> [/ (*)] dd> ( x), 

where a>[f (a:)] denotes the saltus of f (x) at the point x. 

Corresponding to a net fitted on to (a, b) in which the meshes are all 
of length < d , there is a net fitted on to (a, P) of which the lengths are 
all less than a number d\ The numbers d , d f converge together to zero, 
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on account of the uniform continuity of </> (x) in (a, b). To a system of 
nets fitted on to (a, 5), there corresponds a system of nets fitted on to 
(a, /?); and conversely. It follows easily that 

(#) I / (*) d <f> (*) = (x (£) an(1 ( / (*) d 4> (*) = r X (f) 

.'a Ja Ja -'a 

For a particular system of nets can be fitted on to (a, 6), such that, for 
the system the sums Sn n , converge, as w~oo, to the upper and 
lower generalized integrals of / (z) with respect to (:r). For the corre¬ 
sponding system of nets fitted on to (a, ft), the sums 

x ($). (f r -EAtj (f). tf r - 

converge to the upper and lower generalized integrals of / (#) with regard 
to f. But, on account of the property of the upper and lower i^-integrals 
given in §331, these sums converge to 

Jtt J a 

respectively. Therefore 

(G) f / (*) d<t> (*)=[' X (£) and (<?) [ / (*) # (*) = f X (£) 

Jo J a J a Jji 

It has been shewn in § 334, that 


\[ X (f) ^ - j' X (f) df = /' a, tx («] df. 

where co [x (£)] is the saltus of x {£) at and this is equal to the saltus of 
/ (x) at x. Moreover, applying the above result to the function w [/(*)], 
we have - b - 

J a °> [/ (*)] # (*) = J <» tx (£)] d £- 

The truth of the theorem now follows from the theorem of § 334. 

This theorem has been established* otherwise by Pollard. 

If Xi (f) be integrable (R) in the interval (a lf ft), and is zero outside 
that interval; and if x 2 (£) i 8 integrable (R) in the interval (a 2 , ft), and is 
zero outside that interval, we find by applying the theorem given in 
§ 351 to an interval ( A,B) which contains both the intervals (a l9 ft), 
(a 2 , ft), that 


[ Xi (£) \f *2 (i) d i\ d £ + ( X 2 (f) [ xi ({) d f[ d i 

J Oi J J a t |_J a, _J 

= [ xi (£) J X2 (f) d t 


Quarterly Journal , vol. xux (1923), p. 130. 
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Let <p (x) , ifi (x) be monotone bounded functions defined in the interval 
(a, b) ; and let f ( x), g (x) be bounded functions defined in the same interval. 
Let f (x) = {(), where f - </> (x), and let g (x) = x 2 (£), where i^ip(x)\ 

also let tt! — <f> (a), ft - <f> (b), a 2 - </- (a), ft - <(> ( b ); the above formula is 
then equivalent to 

\ f ( x ) f 9 ( x ) # (*) ( H ( x ) + f 9 ( x ) I / ( x ) ( i<t> ( x ) # (*) 

J a [_Ja J • a L* a 

= f f(x)d<P(x) [ g(x)dip(x) .(A), 

.a J a 

provided all the integrals in this expression exist as generalized RS- 
integrals. This formula (A) corresponds, for generalized i?$-integrals, to 
the formula of § 351, for /*!-integrals. It clearly holds when 0 (#), 0 (x) 
are functions of bounded variation, as the result is then obtained by 
applying (A) to the four pairs of monotone functions 

0 1 ( X )y 01 ( X ) j 02 (**')? 02 ( X ) ’ 01 ( X )y 02 (•*') i 02 ( X )y 01 ( X ) > 

where 0 (a:) - 0 X (x) - 0 2 (x), 0 (x) - 0 X (a;) - 0 2 (a:). 

rb rb 

Let us now suppose that / (#) d<f> (x), g (x) dip (x) exist as RS- 

J a -a 

integrals. In case <p (x), ip (x) have no points of discontinuity in common, 

| / (x) f g (x) dtp (x) d<f> (x) 

. a L-' a 

exists as an RS -integral. For, in order that this may be so, it is only 
necessary that the measure of 0 (#) over the set of discontinuities of the 

integrand should be zero. Since j g (x) dip (x) is discontinuous only at the 

J a 

points of discontinuity of 0 (x), which form an enumerable set, the measure 
of 0 (x) over this set is zero (see § 252); also the measure of 0 (x) over 
the set of points of discontinuity of / (x) is zero, since / (x) has an RS~ 
integral with respect to 0 (x). It follows that the measure of 0 (x) over 
the set of points of discontinuity of 

f{x) I g (x) dip (x) 

_J a 

is zero, and therefore 

f / (*) f 9 (x) <hp (*)] d<p (x) 

J a [J a J 

is an i?<S-integral. In the same way it is seen that the second integral 
in (A) is an AiS’-integral. 

If <p (x), ip (x) had a common point of discontinuity, <p (x) and 
f(x)[ g (x) dip (x) 

J a 
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would have a point of discontinuity in common, and thus the integrals 
in (A) would not exist as RS -integrals. 

It has now been shewn that: 

If (f> (x), ifj (#) are functions of bounded variation in (a, b), and have 
no point of discontinuity in common; and if f (u;) is bounded and possesses 
an RS-integral with respect to <p (x); and if g ( x) is bounded and possesses an 
Rti-integral with respect to ip (x); then the formula (A) holds good. 

This theorem was given* by Hardy who furnished a direct proof of it. 

EXAMPLES 

1. Let (x) = 0, for 0 ;> x < 4; <f> (x) = 1, for £ ^ x 1; and / (x) »1, for x ^ J; 
/(I) — 0. The function f (x) being discontinuous on both sides at the point where <f> (x) 

is also discontinuous, (G) j f (x)d<f>(x) does not exist; but j x (£) exists, and has the 

value zero; the interval (0, I) being <f> (J — 0), <f> (£ -f- 0), in which x (£) — 0. 

2. Let <f> (x) =■ x, for 0 x < 1; <f> (x) = 2x, for 1 ^ x 2; f (x) — 1, for 0 £ x ^ 1; 
/ (x) = 2, for 1 < x £ 2. The functions / (x), <f> (x) are both discontinuous at x = 1; but 
/ (x) is continuous on the left, and 4> (x) is continuous on the right of the point. The 
generalized ita’-integral exists. Moreover, in the intervals (0, 1), (1, 2), / (x) and <f> (x) have 
no point of discontinuity in common. In these intervals the /^-integrals exist, although 
the integral over the interval (0, 2) is not an /^-integral. 


THE R1EMANN-STIELTJES INTEGRAL FOR FUNCTIONS OF TWO VARIABLES 

381. Let a bounded function/(x (1) , a: (2) ), defined in a cell (a a \ a (2) ;6 (1) , 
6 (2) ), be bounded in that cell, and let <f> (x (1) , x {2) ) be a function defined in the 
same cell, and quasi-monotone, in accordance with the definition in § 255; 

thus, employing the notation there introduced, Ajji!' ^»)j <f> *<*’) £ 0, 

provided ;h (1) ^ x {1> , x (2) ^ x (2) . 

Let a net be fitted on to the cell in which the functions are defined, and 
let §!, 8 2 , ... b m denote the meshes of the net, and A* r <j> (x {1) , x {2) ) denote 

A( ( ^(>)]a %\<f> (# (1) , x {2) ), where (a r (1) , a r (2> ; £ r (1) , £ r <2) ) is the mesh 8 r ; more¬ 
over, let U (8 r ), L (8 r ) denote the upper, and the lower, boundary of 
/ (x (1) , x (2) ) in the cell 8 r . 

r “ m 

The sum 2 U (8 r ) A & r <f> ( x (l) , x (2) ) has a lower boundary, when all 

r -1 

possible nets are taken into account; and this lower boundary is said to 
define Tcfco, &<*>) 

/ (a& (1) , x {2) ) d<f> (x< 1 >,x (2 >), 

J a<*>) 

the upper integral of f (x (1) , x (2) ) with respect to <f> (# a> , x {2) ), in the cell 

b a \ 6 <2) ). 


Messenger of Math. vol. xlviu (1918), p. 90. 
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Similarly, the lower integral of f (x a> , z ,v ) with respect to <f> x l2> ) is 

r «= m 

defined as the upper boundary of 2 L (8 r ) A s r <f> (x (1> , x i2) ), for all possible 

r -1 

nets fitted on to the fundamental cell, and this lower integral is denoted by 


/*(&<!), 

f(x^,x^)d<f> (x“\x< 2) ). 

In case the upper and lower integrals of f (x {1) , x {2) ) with respect to 

r(bO),bO)) 

<f> (x {1) , x {2) ) are equal , their common value I / ( x {1) , rr (2) ) (# (l> , # (2) ) 

J (a<‘\ a< 4 >) 

w taken to define the Riemann-Stieltjes , or RS-integral, of f (x {1) , x {2) ) with 
respect to f> (x ll) , x {2) ). 

In case <f> (x {1) , x i2) ) is simply the product £ (1, .# (2) , we see that 
A & r 4> (# (1) , # (2) ) reduces to (/3 r (1) — a r a) ) (P r (2) — a r (2) ), the measure of the 
cell 8 r ; the RS -integral is then the /^-integral, as defined in § 338. 

If <f> (x {1) , x (2) ) be a function of bounded variation, in accordance with 
the definition of Hardy and Krause, given in § 254, it is expressible as the 
difference of two quasi-monotone functions P (# (1) , x i2) ), N (x {1) , # <2) )> ( see 
§§ 254, 255), which are also monotone. The i?$-integral of f (x {1) , x {2) ), 
with respect to cf> (x (1) , x {2) ), is defined by 


/, 


(bO), &<*>) 


a< 2 >) 


f{x^,x^\df> x< 2 >) 


fibi'K&v) _ /•(&<»>,&<*)) 

- f(x^\x^)dP(x (l \x {2) )- f(x lv , x (2) ) dN (x"\x (2) ). 


whenever the two latter integrals exist. 

It can be shewn that a continuous function / (x {1) , x (2) ) certainly has 
an i?$-integral with respect to a function </> (a; (1) , x {2) ), of bounded varia¬ 
tion. We need only consider the case in which <f> (x (1) , x i2) ) is quasi¬ 
monotone. In case the spans of all the meshes of a net, fitted on to the 
fundamental cell, are sufficiently small, 


r*» m r —wi, 

2 U (S r ) A Sr cf> (x {l \ x {2) ) exceeds 2 L (8 r ) A$ r <j> (# (1) , x {2) ) 

r-l r -1 

by less than e Af a d)\ f> (rr (1) , x {2) ), which is an arbitrarily small number. 
Consequently the upper and lower integrals of/ (z (1) , x {2) ) with respect to 
<f> (z {1) , x {2) ) have the same value, and the ^-integral of f(x {1) ,x (2) ) 
therefore exists. 


If f(x {1) , x {2) ) have discontinuities, the criterion for the existence of 
the integral with respect to the function <f> (# (1> , x {2) ) is contained in the 
following theorem*, which may be established by a modification of the 
method of § 377. 

* See W. H. Young, Proc. Lond. Math. Soc. (2), voi. xvi (1916), p. 281; also Proc. Roy. Soc . 
vol. xcra (1917), p. 28. 
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The necessary and sufficient condition that the bounded function f (x a \ x {2) ) 
should possess an RS-integral with respect to the function </> (x {l) t x i2) ) y of 
bounded variation , in accordance until the definition in § 254, is that the 
variation of <f> (a? (U , x {2) ) over the set of points of discontinuity of f (x (1) , x {2) ) 
should be zero. 

The variation of a quasi-monotone increasing function <f> x {2) ), 

over a set of points G, is here taken to denote the lower boundary of 
2 A(a; (1) , x {2) ), where the summation is taken for all cells of a set which 
contain in them all the points of G, the lower boundary being taken for 
all such sets. The variation of a function <f> (x {1 \ x (2) ), of bounded varia¬ 
tion, over the set G y is taken to be the sum of the variations over G of the 
two monotone increasing functions, P (x {l) , x {2) ), N (x (1) , x {2) ), the differ¬ 
ence of which is </> (£ (1) , x (2) ) (see § 254). 

Further properties of the /^-integral for functions of two variables 
will be given in § 448 1 . 



CHAPTER VII 

THE LEBESGUE INTEGRAL 

382 . The definition of a definite integral which has been introduced 
into Analysis by Lebesgue is of much wider scope than the definition of 
Riemann. The theory of Lebesgue integration has as its foundation the 
conception of the measure of a set of points, in the sense in which the 
term is employed by Lebesgue. An account of this theory of measure 
has been given in Chapter m. Jn Riemann integration, the domain over 
which the integral is taken is divided into a finite number of intervals, 
or of cells, and the integral is defined as the limit of the Riemann sum for 
this set of intervals, or cells. In Lebesgue integration, on the other hand, 
the domain over which the integral is taken is divided into a number of 
measurable sets of points, having a certain property relative to the 
function to be integrated, and the integral is defined as the limit of a 
certain sum taken for all these measurable sets of points, as the number 
of sets is indefinitely increased. The distinction between the Lebesgue 
integral and the Riemann integral rests essentially upon the difference 
between the two modes Of dividing the domain of integration into sets of 
points. 

MEASURABLE FUNCTIONS 

383 . The definition of Lebesgue is applicable to functions belonging 
to the family of measurable functions, in one, or more, dimensions. 
A measurable function / ( x) has already been defined, in § 295, as such 
that the set of points x, for which / (x) > A , is measurable, whatever real 
number A may be. This definition is applicable, whether x denote a point 
of a linear set, or denote a point (x n) , x {2) , ... x ip) ), in any number p , of 
dimensions. 

If f (x) be a measurable function , defined at each point of a given domain , 
the sets of points for which 

A <f(x) < B; A £f (x) < B\ A ^f(x) ± B;f(x) < A;f(x) £ A 

are all measurable , whatever real numbers A and B may denote , provided 
A<B . 

In the first place, the domain for which / (x) is defined, and for which 
it has a definite value at each point, is measurable. For let A have the 
values — N l9 — N 2i ... — N n , ... successively, of a sequence such that N n 
increases indefinitely as n ~ o o. The set E n , for which f (x) > — N n , is 
measurable, by hypothesis, for every value of n. The domain for which 
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/ (a?) is defined is the outer limiting set of the sequence {E n }, of measurable 
sets, and is therefore itself measurable. The set of points for which 
/ (x) £ A, being complementary, relatively to the domain of the function, 
to the measurable set for which f (x) > A , is measurable. If {A n } be a 
monotone increasing sequence of numbers converging to A, all the sets 
for which f (x) s A n are measurable, and their outer limiting set, for 
which f (x) < A , is consequently measurable. As the sets for which 
/(*) < A and f (x) £ A are measurable, it follows that the set for which 
/ (x) = A is measurable. 

The sets for which / ( x) < B, and / (x) < A, being measurable, their 
difference, the set for which A s / (x) < B, is also measurable. The other 
results in the theorem follow at once. 

A junction f (x) is measurable if the set of points x is measurable , for 
which a < f (x) < /?, for every pair a , /?, of real numbers which belong to a 
given set , everywhere dense in the indefinite interval (— oo, oo). The given 
set may be taken to be enumerable . 

Let A and B be any pair of real numbers such that A < B. The 
number A can be expressed as the upper limit of a sequence {a n } of 
increasing numbers, all of which belong to the given everywhere dense 
set; and the number B can be expressed as the lower limit of a similar 
sequence {f$ n } of diminishing numbers. The set e n for which a n < f (x) < fi n 
is measurable, for each value of n; the inner limiting set {e„}, of the se¬ 
quence, is the set for which A £ / (x) s B ; and this set is consequently 
measurable. Since this is the case whatever values A and B may have, 
it is seen, as before, that / (#) is measurable. 

A function f (x) is said to be measurable (B), if the set of points for which 
f (x) > A is measurable (B) } whatever value A may have. 

The proofs given above shew that the sets for which 
A <f(x) <B ; A &f (x) < B \ A &f(x) z B ; f (x) < A ; f (x) £ A 
are all measurable ( B ). 

384. If <f> l7 <f> 2 , ... <f> n be a finite set of functions that are measurable in a 
measurable domain G , linear , or of higher dimensions , and if F <f> t , ... <f> n ) 

be a function that is continuous relatively to (<f> Xi (/> 2 , ... f> n ), for all values of 
<f>i, <f> 2i ... <f>n> th en E (<f> x , <f> 2 , ... </> n ) * 8 measurable in the given domain. 

First, let us assume that all the functions <f > t , <f > 2 , ... <f> n are bounded in 
the given domain for which they are defined; suppose their values all to 
be in the interval (— N,N). Let a net (c 0 , c lt ... c m ) be fitted on to the 
linear interval (— N, N), where c 0 = — N, c m = N , and suppose the breadth 
c r — c r _! of each mesh to be less than the positive number 97 . Let the 
function ip 9 be defined, corresponding to each function <f> 8 (s = 1 , 2 , 3 , ... n) 
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by the conditions fi s = c f _j at every point at which c r sfi a <c r) for 
r — 1, 2, 3 , ... m, and </r 5 = c m where fi a = c m . We have then 

0 s <f> s — ^ < 17 , 

and the function fi s taking only the values in the finite set c 0 ,c J} ... c m , 
this function is measurable in the given domain. 

Since F (fi x , fi 2 , ... <f> n ) is continuous in the closed domain, 

N a ...) a 

we have | F (fa, fa, ... fa) - F (fa, fa, ... fa) | < e, 

provided 77 be taken sufficiently small; the number € being arbitrarily 
chosen. The function F fi 2) ... fi n ) has only a finite set of values, and 
is measurable. If U and L are its upper and lower boundaries, we have 
L — € < F (</>! , fi 2 , fi n ) < U -f € 

in the whole domain. Let A and B be any two numbers in the interval 
(L, U), then the set of points for which A < F (fi x , fi 2 , ... fi n ) < B is 
measurable. 

Now let e have successively the values in a sequence {ej which con¬ 
verges to zero, then there exists a corresponding sequence {77 J, of values 
of 77, which converges to zero. 

The set of points E t , for which A < F (fa, fi 2 , ... fi n ) < B , is measur¬ 
able, for each value of 7 ] ti in {77J. Each point of the set for which 
A < F (fi lf fi 2 , ... fi„) < B 

belongs to all the measurable sets E t , from and after some particular 
value of t , and therefore, by a theorem established in § 131 , the set is 
measurable. It has thus been shewn that F (fi x , fi 2 , ... <f > n ) is measurable 
in the domain for which the functions are defined. 

Next, let the functions fi lf fi 2 ,... fi n be unbounded. Let fi r iN) be defined 
by the conditions fi r {N) = fi ry when N ^ fi r & — N ; fi r (N) = N, when fi r > N; 
and fi r {N) = — N, when <f> r < — N. From what has been proved above, 
we see that the function F ( fi 1 iN) , ... fi n iN) ) is measurable. Let N have 
successively the values in a divergent sequence {NJ of increasing numbers. 
Each point of the set for which A < F {fi lt fi 2 , ... <f> n ) < B belongs to all 
the measurable sets for which A < F (< f> x iN fi 2 (N d, ... fi^ Nt) ) < from and 
after some particular value of t. It thus follows that the set is measurable. 
Therefore the theorem holds when fi l9 fi 2i ... <f> n are unbounded. 

In particular we have the theorem that: 

The sum , or the product , of any finite number of measurable functions , 
defined in a measurable domain of any number of dimensions , is a measurable 
function. 

If all the functions fi lf fi 2f ... fi n are measurable (B), the function 
F (fi lf fi 2) .. fin) is measurable (J5). For the sets employed in the above 
proof are all measurable (B). 
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THE LEBESGUE INTEGRAL OF A MEASURABLE FUNCTION 

385 . Let us consider a measurable set e, bounded or unbounded, but 
of finite measure, and let the function / (x) — 1 at all points of e. The 

measure m (e), of the set e, is said to define the integral I / (x) dx , of the 

J (e) 

function / ( x ), over the set e. 

If c be any positive, or negative, number, and / (x) — c at all points 
of e, j f (x) dx is defined to be the number cm (e). The values of / ( x) at 


points that do not belong to e are irrelevant. Next, let e lt e 2 , ... e n be a 
finite number of measurable sets, no two of which have a point in common. 

Let/ (x) ■= c r at the points of e r , for r = 1, 2, 3, ... n ;; where e lt c 2 , ... c n 
are assigned numbers. Then, if E denote the set e x -f e 2 -f ... -f e n , the 

integral / f (x) dx, of / (x), over the set E , is defined as the sum 

J (E) 

r n 

2 c. r m (e T ). 

r -1 

Next, let / (x) be a measurable function, defined for the points of a 
measurable set E, of finite measure, and bounded in that set. Let U, L 
denote respectively the upper, and the lower, boundary of / (x) in E. 

Let the interval ( L , U) be divided into parts 

(a 0 > a i)i ( ( h 5 ••• i a n -1 > a rt) j 

where a 0 = L, a n — U, and such that the greatest of these parts a r — a r _ lT 
for r = 1, 2, 3, ... n, is < rj. 

Let e r be that measurable part of E, for all points of which 

«r-l s/(x) < a T , 

where r = 1, 2, 3, ... n — 1; and let e n be that part of f (x) for the points 
of which a n . x < f (x) ^ U. 

Let (x) be the function which has the value a r .. l5 at all points of e rt 
for r 1, 2, 3, ... n \ and let i/j n (a;) be the function which has the value a r , 
at ail points of e r , for r = 1, 2, 3, ... n. 

We have then, in accordance with the above definition, 


r r ~ n 

(f> n (x) dx = S a r ~ x m (e r ), 

J(E) r 1 

j* r ■ n 

i/j n (x) dx = S a r m (e r ), 

J(A’) r * 1 

and therefore 0 s \jj n (x) dx — (x) dx < rim (E). 
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If the interval (L, U) be successively sub-divided by introducing further 
points of division such that the corresponding values of 77 form a sequence 
{rj m } of diminishing numbers for which rj m ~ 0 , as m ~ 00, it is seen that 

f <f>m ( x ) dx 

does not diminish, and that tfj 0m (x) dx does not increase, as m'~ 00. 

These two sets of numbers consequently converge to a common limit. 

This limit lim <f> nm (x) dx == lim ift nm (x) dx is defined to be the value 
rn —qc J (E) J (K) 

of the Lebesgue integral / (x) dx, of / (a;), taken over the measurable 

Jan 

set E. 

In order to justify this definition, it is necessary to prove that the 
number so defined is independent of the particular mode in which the 
interval (L, U) has been successively sub-divided. 

Let <£,, (x), if/ n (x) be the functions which correspond, in a second mode 
of sub-division, to <f> v ( x ), ip n (x) defined above. It is easily seen that there 
is no loss of generality in taking the sequence {y n } to be the same in the 
two cases. 


Suppose the two sub-divisions of (L, XJ), corresponding to rf m , to be 
superimposed, and let (f> nm (a?) be the function corresponding to <f> nm (x) and 
<j> lm (*)• We have then 


0 S [ <t>1m ( x ) dx - I tlm (*) dx<r lm m (E), 

0 S I (*) dx - I <t>lm ( x ) dx < Vm™ (E), 

J (E) (E) 


and therefore </>,,„ (x) dx - <f>, m (, x) 

(E) .’(E) 


dx 


<r) m m (E). 


As to ~ 00 , 77 m m (E) ~ 0 ; and therefore 

Iim [ 4>>, m (*) dx = lim I 4> nm (x) dx. 

m~co J (E) m~aa J (E) 

Thus the number by which the Lebesgue integral is defined is inde¬ 
pendent of the mode of sub-division of the set E. 


Accordingly, the definition may be stated as follows: 


The Lebesgue integral f (x) dx of the bounded measurable function 

HE) 

f (x), taken over the measurable set E which is of finite measure , is defined as 


r -n r»n 

the limit to which either 2 a r _ t m (e r ), or 2 a r m (e r ), tends , as the greatest 

r ~ 1 r -1 
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of the numbers a r — a r _ x converges to zero \ where ( L , U) is divided into n parts 
(a r l , a r ), for r = 1, 2, 3, ... n \ and a 0 — L, a n = C7; and, where e r is that set 
of points of E, for all of which 

a r _i ^ f (x) < a r \ r = 1, 2, 3, ... n — 1; 
and e n is that set for which a n _ x t' / (#) a n . U and L denote the upper and, 
lower boundaries of f (x) in E. 

In case the set E is a set in ^-dimensional space, / (x) is used to denote 
/ (x {1 \ x {2) , ... x ip) ). 

If the set E consists of the points of a finite interval (a, 6), or of a cell 

[0 

(a, b), the integral over E is denoted by f (x) dx, 

J a 

r(W'), ... b<»>) 

or f{pd l \x'*\ ... x^)d(x^,x^\ ... 

J (a (,) , « (2) , ... a ( P>) 

It will be observed that the function <f> 0 (x) is a function which takes 
only a finite number of values, for all values of x in the set E; and that 
it is such that 0 5 / (x) — </>,, (x) < r). 

The integral / (x) dx is thus defined as the limit of the non- 
hv) 

diminishing sequence of numbers 

I K( x ) dx > \ (X)dx, ... [ 4>„ m (x) dx, 

HE) HE) J(E) 

where r) l , rj 2 , ... , ... form a sequence of diminishing numbers that con¬ 

verges to zero, as m ~ oo. 

386. Next, let the measurable function / {x) be unbounded in the 
measurable set E, of finite measure, and such that f (x) s 0, in E. 

Let /jv (x) be defined as a bounded function in E , by the specifications, 
In (*) =- / (x), where f (x) N, 
f N (x) — N, where / (x) > JV. 

The number N is any assigned positive number. 

If there be assigned to N the values in an increasing sequence N 1 , N 2 ,... 
without upper limit, the integral fx {x) dx is a number which does not 

J(E) 

diminish as N has successively the values in the sequence. It follows that 
f N (x) dx either converges to a definite upper limit, or that it increases 

HE) 

indefinitely as N does so. 

When lim f N (x) dx exists as a definite number > the integral f (x) dx 

N~*>J(E) * J(E ) 

is defined by that number. 
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The value of the integral is easily seen to be independent of the par¬ 
ticular sequence of values of N. Thus 

f f (a:) dx - lim f f N (x) dx. 

J (K) 

If / (#) be ^ 0, in E , / (x) dx is similarly defined as 

'(E) 

lim /-a (x) dx , 

A r ~oo J(E) 

where f .. N (a:) — / ( x ), for points x at which / (x) > — N , and / a- (x) = — N, 
when f (x) < — N. 

Lastly, let the unbounded function / (x), defined in E, have both 
positive and negative values. 

Let / (x) = /+ (x) - f~ (x), where /+ (x) - f (x), when / (x) - 0, and 
/+ (a:) - 0, when / (x) < 0; with a similar definition for /- (a:). Both the 
functions /+ (x),/~ (x) are measurable. 

The integral f (x) dx is defined as the difference , 

' (E) 

f f 4 (*) rf* - [ /" (*) 

provided both of the latter integrals exist as finite numbers. 

When a measurable function f (x), of one or more variables , is such that 

f (x) dx exists as a finite number, f (x) is said to be summable in the set E. 

HE) 

A measurable function is always summable if it be bounded, but not 
necessarily so if it be unbounded. 

A function / (x) that is summable in E is also said to be integrable (L) 

in E, and the integral f (x) dx is termed the Lebesgue integral , or shortly 

•' (E) 

the L-integral of / (x) in E . 

387. It will be observed that, in accordance with the above definition, 
in order that a measurable function / (x), defined for the measurable set 
of points E, may be summable in E, it is necessary that each of the two 
functions / + (x), f~ (x) should be summable, that is, each one of them 
must be integrable ( L ) in E. Thus the two limits 

lim [ (x) dx, lim j f x ( x ) dx 

N~aa](E) N~xJ(E) 

must be both finite. 

The definition may accordingly be stated in the following form, which 
is a generalization of a definition* due to de la Vallee Poussin, originally 


* Liouville’s Journal (4), vol. vm (1892), p. 427. 
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applicable to the case in which /+. (x), f~ (x) are restricted to be integrable 
(E) in a cell or interval (a, b). 

If the unbounded measurable f unction f (x) be defined in a measurable set E, 
of finite measure , and N and N' denote two positive numbers, let fx t N' (x) he 
suck that /at, (#) =- / (x),at points such that — N' -f {x) < N ; a ndf Nt n- (*r) = N 
at points such that f (x) > N; and f\, (#) ~ — N r , when f(x) < — A 7 ' ; then 

f ( x ) dx is defined as the double limit of /. v N’ (x) dx, as N and N' 

HE) '(E) 

diverge, independently of one another, to oo, whenever this double limit exists 
as a definite number. 

Tn accordance with this definition, when / (x) is summable, so also is 
| / [x) |; for | f (x) | — ff, (x) + fZ (x), and therefore f \f(x)\ dx is the 

lim f ft / (x)dx and lim [ f~(x)dx, 

N~ao J (E) A’~oo J(E) ' 

each of which is finite. 

Since the existence of / (#) dx involves that of | / (.r) | dx, the 

■ (E) J (E) 

A integral f(x) dx is said to be an absolutely convergent integral, thus : 

HE) 

The L-integral of an unbounded summable function is an absolutely con¬ 
vergent integral. 

It is easily seen that / (x) dx > \ f (x) | dx; for the absolute 

HE) I HE) 

value of the integral on the left-hand side is 

* \ f + (x) dx + [ f~ (x) dx. 

HE) HE) 

If / (l) (x) i- f {2) (x) > 0, the function f {1) (x) being summable in E, then 
the function / <2) (x), assumed to be measurable in E, is also summable. 
For we see that /^ (x) dx s f ( f (x) dx, hence 

f fjPMdz* \ f™(z)dr, 

HE) HE) 

and it then follows that / (2) (z) dx exists, and is & / (1) ( x) dx. 

HE) HE) 

388. The above definition of the //-integral, when f.(x) £ 0, in E, and 
the function / (x) is unbounded, may be replaced by another definition 
which we proceed to obtain. 

Let a 0 , a x , a 2 , ... a n , ... be an increasing sequence of numbers, such 
that a n has no upper boundary, as n ~ oo, and where a 0 = 0. Also let 
a r - a r _! & rj, for every value of r. 
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Consider the limiting sums a - Js « M » (e r ), o' = 2 a r m (« r ); where, as 

before, e r is the measurable set of points at which a r , s f (x) < a r . The 

difference of the two sums <r, a' is 2 (a r - a,.,) m (e r ), which is not greater 

than r,m (E). It follows that a, o' are both finite, or both infinite. Now 
fet a„, Oj , a 2 , ... be the end-points of the meshes of a net, fitted on to the 
infinite linear interval (0, oo); and consider a system of such nets, for 
which v has the values in a sequence { 7 ? n }, of diminishing numbers, that 
converges to zero. If it be assumed that a, o' are finite, it is clear that 
a has the values in a non-diminishing sequence, and that o' has the values 
m a non-increasing sequence, as the successive nets of the system are 
taken. It follows that o and o' have definite limits, as n ~ oo, and these 
limits must be identical, since o' — a ~ 0, as n ~ oo. 

It can be shewn that this limit is independent of the particular system 
of nets employed. For, if < refer to one system {D}, of nets, and 
a 2 , * to a second system {/>'}, we may take nets I), D' of the two systems 
such that for D, o ' - cq s v m ( E ), and for D', a 2 ' - V m (E). Now 
consider the net (D, D ) obtamed by superimposing the two nets, and 
let o be the corresponding value of a', then S' is s a/, and a 2 '; therefore 
a o, i, 7)m(E), and <x' - <r 2 is (E), and moreover 5 ' - a . „ 
are both i 0 . It follows that | a, - a 2 | s v m (E); and thus the limits of 
°i and <t 2 , as 77 ~ 0 , are the same. 

The limit as n ~ oo, of lta r ^m (e r ), which is identical with the limit of 

Q ' m the nets °f a systern {£>„}, defines the value of f / (x) dx, far 

an unbounded non-negative function f (x), measurable in the measurable set E 
whenever this limit exists . ’ 

givJn r§ 386 SheWn that thiS definition is com P letel y equivalent to that 

Let us suppose that a. = N, a fixed positive number; then if a be 
convergent, we have a «,) + i„ a' - 'i\m (e r ) + Rf, where 

N can be chosen so large that R a and R,' are both less than e . 

Assuming that fj <*, * i, the limit of (,,, + , or the 

system of nets, the integer « so varying that a'~N, we see that 

Jn (x) dx 


is the limit oi T i\ r m (e r ) + R/f where R „ R 

T 1 »8 


(E) 


< €. 
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Since J?/, R " differ from one another by less than e, it follows that 
/ (x) dx, f N (x) dx differ from one another by not more than e; the 

•' (E) J ( E) 

number N having been chosen sufficiently large. It follows that 
lim I f N (x) dx - f / (x) dx. 

N~a o.'(A’) J(E) 

Conversely, let f (x) dx be defined to be lim [ f N (x) dx, then 

'(E) A’-a oJ(E) 

f (x) dx is defined as the repeated limit 

HE) 

( r 8 *> ) 

lim lim j E a r x m (e r ) -f IV E m (e r )k 

A T —qo CC (r “1 r-s+1 J 

w denoting the order of the net (a 0 , a lf a 2 , ...), an d s denoting the integer, 
dependent on n , for which — IV. We assume that this limit exists. 


The sum £ a r _ x m ( E r ) is non-diminishing as n increases, and also 

r -1 

r 

as s increases, hence the repeated limit lim lim E a r ^m (e r ) must exist, 

N~ oo n~ ao r 1 

r 

as a finite number. Denoting E a T _ x m (e r ) by c nN , we see that c wA r is 

r *=1 

such that c nN ' > c niV , if w' £ n, IV' ^ IV. If we write £ 1/n, rj — 1 /IV, 

we may regard c nAf as a function F (£, 77 ) of the two variables £, 17 ; and the 
function is monotone (see § 307). In accordance with the theorem proved 
in § 307, the repeated limits of F (£, 77 ), as f — 0 , 17 ~ 0 , have one and the 
same finite, or infinite, value. That F (£, 17 ) is defined only when £, rj are 
the reciprocals of positive integers makes no difference as regards the 
validity of the theorem. We now see that, if either of the repeated limits 
lim lim c nN , lim lim c nN exists, as a finite number, the other exists, and 

n~oo N~~a 0 A^-ce n~ao 

has the same value as the former. 


We thus see that lim lim E a r _ x m(e r ) has a finite value. It follows 

n~- 00 N~~ 00 r - 1 
00 

that E a^m (e r ) is convergent, and that 

r -1 * 

00 r * s 

lim E o r _ x rM ( e r ) = lim lim E a r _ 2 m ( e r ). 

n~® r»l A~oc n~oo r - 1 

oo oo 

Now N E m (e r ) < E a M m(e r ); hence, if IV be sufficiently large, 

r-$+l r~ 8 +1 

oo oo 

N E m (e r ) becomes arbitrarily small, and lim N E m ( e r ) = 0. We 

r-s + l A r ~ oo r-s + l 

thus have, since this limit is independent of n , 

r rv-A n _ oo 

/ (x) dx = lim lim E a r ^vn (e r ) — lim E a r _ x m (e r ), 

J (E) n~~ac N~<*> [_r1 J n~oo r~l 
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f 00 

/ (#) dx, as lim 2 a r „ x m (e T ), is 

(E) 71-qo r -1 

equivalent to the definition as lim Jn (x) dx. 

iV-'Oo . (E) 

The following theorem has been proved in the course of the above 
proof : 

Iff (x) be a summable function which is s 0, in the set E, then 
lim Nm (E (N) ) = 0 , 

A r ~oo 

where E (N) is that part of E in which f (x) £ N. 

When / (x) is not restricted to be s 0 in E, we see from the definition 

of / (x) dx as /+ (x) dx — I f~ (x) dx , that the integral may be de- 
Ue) ]{E) J(E) 

QO 00 

fined * as the limit of either of the expressions 2 a r _ x m (E r ), 2 a r m (E r ), for 

—CO -CO 

a system of nets (... — a_. 2 , — a 0 , a lr a 2 , ...) cm the indefinite 

interval (— oo, co). 

We have also the following theorem: 

If f ( x ) be summable in the measurable set E, and E (N) denote that part 
of E for which | / (x) | ^ N, then lim Nm (E {N) ) = 0 . 

A 7 —oo 


OTHER DEFINITIONS OF AN INTEGRAL 

389. A general theory of integration has been developed by W. H. 
Young, independently! of the work of Lebesgue, in two memoirs. In the 
second of these memoirs, the theory there developed is brought into 
relation with the work of Lebesgue. The definition of W. H. Young is a 
generalization of that of the upper, and the lower, l?-integral, in which 
the domain is divided into a number of parts, of a special kind. W. H. 
Young defines the generalized upper and lower integrals of a function by 
means of a division of the measurable domain of the independent variable 
into sets of measurable parts. This definition may be formulated as 
follows: 

Let the measurable set E, which is the domain of integration , be divided 
into a finite , or enumerably infinite , set of measurable components , and let 
the measure of each component be multiplied by the upper boundary of the 
function in that component , and the sums of all such products be formed. 
The generalized upper integral is defined to be the lower limit of that sum , 
for all possible modes of division of E into components , as above described. 

* Thisis the definition given by Lebesgue; see his memoir “Integrale, Longueur, Aire,” Annali 
di Mat. vol. vii (1902), p. 258. 

t See the paper “On upper and lower integration,” Proc. Lond. Math. Soc. (2), vol. ii (1905), 
p. 52, and also “On the general theory of integration,” Phil. Trans, vol. cciv (1905), p. 221. 
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The generalized lower integral is defined, in a similar manner , by employing 
the lower boundary of the function in each component. If the generalized 
upper and lower integrals both exist, and have the same finite value, that 
value is said to be the value of the integral of the function over E. 

This definition can be applied, not only to bounded functions, but 
also to unbounded functions. In the latter case, such* sets only are em¬ 
ployed as have finite upper and lower boundaries of the function. It will 
be shewn in Vol. II that this definition is equivalent to that of Lebesgue. 

The definition of an integral has been further generalized by Pierpont, 
so as to apply to the case in which the set E is non-measurable. 

A set H is said to be measurable relatively to E (which may be non- 
measurable) if there exists a measurable set G such that H == D (G, E). 
Thus H is that part of E which E has in common with a measurable set. 

Pierpont’s definition! is obtained from that of W. H. Young by con¬ 
sidering those systems of sub-division of E into components which are 
such that each component is measurable relatively to E. In forming the 
expression, of which the lower boundary is the upper integral, we employ 
the exterior measure of each component of E, and the same in forming 
the expression of which the upper boundary is the lower integral. With 
these changes in the statement of the definition of W. H. Young, we 
obtain the definition of Pierpont. 

Another mode of defining an integral has been developed by W. H. 
Young. As this depends upon the theory of monotone sequences of 
functions, it will be referred to in Vol. II, in that connection. 

Another definition of an integral has been given by Borelf. 

THE L-INTEGRAL AS THE MEASURE OF A SET OF POINTS 

390. It has been shewn, in § 336, that the jR-integral of a function 
expresses the measure of a set of points that is measurable (J). We 
proceed to shew that the L-integral has a similar relation to a bounded 
and measurable set of points. This relation may be expressed in the 
following theorem: 

If the summable function f ( x ), defined for the bounded and measurable 
set of points E, be such that f ( x) £ 0, in E, then the set of points defined by 
0 £ y &f(x), x in E , has for its measure , plane , or (p -f 1) dimensional , 

according as E is a linear , or a p-dimensional, set , the value of f ( x ) dx . 

HE) 

* See Hildebrandt, Bulletin Amer. Math. Soc. vol. xxiv (1917), pp. 120-123. 

t Theory of Functions of Real Variables, vol. n, pp. 343 et seq. 

t Journal de Math. (6), vol. viii (1912), pp. 199-205; also Lemons sur la thdorie des fonctions , 
2nd ed. pp. 217-256; see also Hahn, Monatshefte fur Math. u. Physik , vol. xxvi (1915), p. 3. For 
a criticism of Borel’s definition see Lebesgue, Annales sc. de Vdcole normals (3), vol. xxxv (1918), 
p. 191. See also Borel, ibid. (3), vol. xxxvi (1919), p. 71. 
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The theorem will first be established for the case in which / (x) = c, 
over the set E. 


In that case the two, or (p -f 1 ) dimensional, measure of the set of 
points for which x is in E , and 0 s y & c, is cm (E). For the sets E, C (E) 
can be enclosed in sets of intervals, or cells, such that the measure of the 
set common to the two sets of intervals is < rj. The set, x in E y 0 y u c, 
and the complementary set, x in C (E), 0 y e, can consequently be 
enclosed in sets of rectangles, or (p -f l)-dimensional cells, such that the 
measure of the set common to the two sets is < crj. As 77 converges to 
zero, the measure of the set of intervals, or cells, enclosing E, converges 
to m (E), and the measure of the set of rectangles, or cells, enclosing the 
set ( 2 ? in E, 0 < y ^ c), converges to cm ( E ), which is thus the measure of 

the set. Therefore, in this case f(x)dx is equal to the measure of the 

J(E) 

set, x in E, 0 £ y % f (a:). 


r in 

Next, if E = 2 e r , where e r denotes a measurable set, and f (x) = c r , 

r “ 1 

for r = 1, 2, 3, ... m, the measure of the set (x in E, 0 £ y ^f(x)) is the 

r « rn 

sum of the measures of the r sets (x in e rt 0 £ y £ c r ), which is 2 c r m (e r ), 

r-1 

and is therefore equal to / (x) dx . 

HE) 

If / (x) is any bounded measurable function (2 0), the measure of the 
set (x in E, 0 £ y £ / (#)) is between the measures of the two sets 

(x in E, 0 £ y s <f> n (,x )), (x in JS', 0 £ y £ fa (x)) y 

r ~ n r *»« 

employed in §385; that is between 2 a r ^m (e r ) and 2 a r w (e r ); conse- 

r = 1 r=] 

quently it differs from f(x) dx by less than 77 (U — L) f which converges 
J(E) 

to 0, as t) does so. Therefore the measure of the set is equal to f(x) dx. 

J(E) . 

If / (x) be a bounded measurable function which has, in E, values of 
both signs, we see that 


[ f (x) dx = f f(x)dx-[ {-/ (x)} dx, 

J(E) J {Ei) J (E t ) 

where E x , E z are the parts of E in which / ( x ) is positive and negative 
respectively. 

It follows that j f (x) dx is the excess of the measure of the set 
Ue) 

(x in E lf 0 s y £/(*)), 

(x in E^O&yz f (x)). 


over that of the set 
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rb 

The value of the //-integral / ( x) dx, where / (x) is a measurable 

J a 

function k 0, representing the measure of the set of points 
(a tz x z b, 0 z y s f (x)), 

may be regarded as representing what is, in geometrical language, called 
the area bounded by the curve y = / (x), the ordinates at a and b , and the 
nb ,l \& (8) ) 

ur-axis. Similarly | / (# (1) , x {2) ) d (x n \ x {2) ) may be regarded as repre- 

*•/ («<*>, a<*>) 

senting the volume bounded by the surface y — / (x {l) , x l2) ), the plane 
y = 0 and the planes x {i) ~ a (1) , x (1) = b {1) , a: (2) — a (2) , a: (2) — 6 (2) . Accord¬ 
ingly, such a measure of an area, or of a volume, may exist, in accordance 
with the definition of Lebesgue, in cases in which it does not exist in 
accordance with the definition of Jordan, because a set of points which is 
measurable is not necessarily measurable (J). 

fb 

The L-integral I / (x) dx, when / (x) has both signs, may be fegarded 
J a 

as expressing the excess of the area above the #-axis over the area below it. 
If / (;r) i' 0, in (a, b), and is summable, but unbounded in the interval, 
the //-integral is the limit, as N ~ oo, of the area defined by a s x £ b, 
0 y 'j* f N ( x ), the function f N (x) being defined as in § 38fi. 

THE H INTEGRAL AS AN //-INTEGRAL 

391. It will be shewn that an /^-integral is also an //-integral, although 
the converse does not in general hold. 

Let / (#) be integrable ( R ), in an interval, or cell, (a, b). If E denote 
the set of points at which f (x) > A, any point of E which is a point of 
continuity of f (x) must be an interior point of E ; for such a point has a 
neighbourhood in which, at every point, / (x)> A. A point of E, at which 
/ (x) is discontinuous, need not be an interior point of E, but all such 
points belong to a set of zero measure, since / (x) is integrable (R). There¬ 
fore E consists in general of an open set together with a set of measure 
zero; and, since both these parts of E are measurable, it follows that E is 
measurable. Since A is arbitrary, it has thus been shewn that / (#) is a 
measurable function. Since it is bounded in (a, b), it is therefore sum¬ 
mable; and thus has an //-integral. The measure (J) of a set is identical 
with its measure, when both exist; and therefore the /^-integral of / (x) 
has the same value as the //-integral. 

If f (x) is measurable, and bounded, but not integrable ( R ), it is easily 

Tb fb rb 

seen that / (x) dx £ / (x)dxz f (x) dx. For, in any mesh 8 of a net, 

Ja Ja 2 m a 

ihe //-integral of / (x) lies between 8U (8) and SL (8). 
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THE LEBESGUE INTEGRAL AS A FUNCTION OF A SET OF POINTS 

392. It has been pointed out in § 208 that, if a class of sets of points E 
be assigned, and if by any rule, or set of rules, a definite number is given, 
corresponding to each set E of the given class, the set of all such numbers 
may be regarded as defining a function of E which has these numbers for 
its values. 

Let the sets of points E be all measurable sets, and let us assume, for 
the present, that the class E consists of all measurable sets contained in 
a measurable domain K which may be bounded or unbounded, but of 
finite measure. Let a function / (.r) be defined for all points of that domain, 
and let it be taken to be Hummable in K, whether the function be bounded, 
or not . It is easily seen that / (x) is summable in any one of the sets E. 
For | / (x) | is summable in K , and the function (f> E (x) which is equal to 
| / (x) | in E , and to 0 , in C (E), the complement of E relatively to K , is 
such that its integral over K is less than that of | / (x) |, and is therefore 
finite. The integral of <f> E (*) over K is the same as over E\ therefore 
| / (x) | is summable in E, and consequently / (x) is also summable in E. 

The value of f(x)dx may be regarded as defining a function <f> (E) 

• (E) 

of the measurable set E. In the particular ease in which f (x) — 1 , at all 
points of K , the L-integral is rn (E), which, as has been shewn in § 130, 
is a completely additive function. 

The function </> (E) ~ f (x) dx converges to zero, as m (E) ~ 0 , uni- 

'(E) 

formly for all sets E in the fundamental domain K. 

Let N be an arbitrarily chosen positive number, and let K N be that 
part of the fundamental domain K in which | / (x) | > N. Let 

a 0 , a 1} a 2 , ... N, u s+1 , a s+ 2 , ..., 

where a 0 0 , a s = N, be a sequence of numbers increasing indefinitely, 

such that a n+1 — a n < e, for all values of n. The numbers e, N can be so 
00 00 

chosen that E a r m (e r ), E a r+l m (e r ) both converge to values less than 77 , 

r '~8 r -8 

where e r is the set of points for which a r z \f (a:) | < a r fl . The integral 
| / (x) | dx , which lies between the two sums, is therefore also less than 77 . 

hz N ) 

Now [ \f(x)\dx=\ \f( x )\ dx -f [ | f(x) | dx; where K NE 

Ue) ■ (E NE ) .'( e~k ne ) 

denotes the set of points D (E, K N ), common to E and X N ; that this is 
the case follows from the definition in § 385. 
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We see that I / (x) I dx s I f (x) I dx < rj ; hence 

•' (Knk) j (Kn) 

\f(x)\dx<rj f Nm (E — K ne ) < q + Nm (E) < 2rj ; 

1 

provided m (E) < q/N ; whatever set E be taken that satisfies this con¬ 
dition. Since 77 is arbitrary, it follows that lim | / (x) | dx = 0 ; and 

J(£?) 

that this convergence is uniform with respect to E. 

Since f (x) dx ^ \ f ( x )\ dx, we see that / (x) dx converges 

J(E) J (E) J(E) 

to 0 , as m ( E ) ~ 0 , uniformly for all sets E. 

A function of E which converges to zero uniformly, as m (E) — 0 , in 
the fundamental domain K is said to be an absolutely convergent function 
of E. 

Accordingly the function / ( x ) dx is an absolutely convergent 

J(E) 

function of E . 

If an unbounded set G, of finite measure, be the : outer limiting set of a 
sequence {G n } of bounded measurable sets, each of which is contained in the 
next , then 


f (x) dx = lim / (x) dx. 
J(G) n~*>J((j„) 

For, since m (G) = lim m (G n ), (see § 134), we have 


lim [ / (x) dx = 0, 

00 i(G~On) 


from which the result follows. 


393. Since, in accordance with § 390, </> (E) is, i i f (x) ^ 0 , the measure 
of a two-dimensional set, or a (p -f 1 )-dimensional set, according as E is 
a linear set, or a p-dimensional set, it follows from the theorem (§ 130) 
that the measure of a set is completely additive, that, in the case in which 
/ (x) s 0 , in K, the function (f> (E) is completely additive. In the general 
case in which / (x) is the difference / f (#) — f~ (x), of two summable func¬ 
tions, each of which is s 0 , by applying the result to each of these functions, 
we obtain the following theorem : 

The function <j> (E) defined as the value of I f (x) dx, where E is a 

HE) 

measurable set contained in the fundamental internal, or cell, in which f (,x ) 
is summable, is a completely additive function of E. 

In particular we have 

( / (x) dx = f f (x) dx + [ / (x) dx. 

HE X + E t ) HE 1 ) J(E t ) 
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A special case of this theorem is that, if 8 1? S 2 , ... 8 W , ... be a finite, or an 
enumerable, sequence of non-overlapping intervals, or cells, contained in 
the fundamental interval, or cell, and A denote their sum, or limitingsum, 

then r O0 r 

f (x) dx - 2 f (x) dx. 

J(A) 


EQUIVALENT L -INTEGRALS 

394 . If the values of a function / (x), that is summable over a set E, 
be altered at points of E belonging to a part E 0 , for which m (E 0 ) = 0 , 

the value of f (x)dx is unaltered. For, in the definition of the integral, 

J (E) 

the measures of all the sets employed will be unaltered. Moreover, the 
integral of / (x) over the set E will be the same as over the set E — E 0 . 

Two functions defined for a measurable set E, which have almost 
everywhere in E, equal values, are said to be equivalent functions . Thus 
all summable and equivalent functions have the same L-integral. 

A function that is unbounded in E may be equivalent to a bounded 
function; for all the points at which the unbounded function is numerically 
greater than a certain fixed number, may form a set of which the measure 
is zero. 

In some cases, especially in the theory of functions defined by series, 
it is convenient to admit infinite values of a function at particular points 
of the set E. If these points form a set E 0 such that m ( E 0 ) = 0 , we can 

regard f(x)dx as existing, and equal to f(x)dx , whenever this 

HE) J(E-E 0 ) 

latter integral exists. 

Iff (x) be £ 0 , in the set E , of measure > 0 , and such that f(x) dx — 0 , 

Ue) 

then f (x) is zero almost everywhere in E. 

Let E m be the set of points of E at which / (x) £ e, then from the 

definition of the L-integral, we should have / (x) dx ^ em ( E ,); it follows 

J(E) 

that the integral of f (x) cannot vanish unless m ( E \) = 0 ; and this must 
be the case for every positive value of e. The set of points at which 
f (x) > 0 is the outer limiting set of the sets E €l , E< % , ..., corresponding 
to a sequence of diminishing values of e, that converges to zero. It follows 
(§ 131 ) that the set of points at which / (x) > 0 , has measure zero. 

If the summable function f (x), defined for an interval , or cell , (a, 6), be 
such that its integral over any interval , or cell , whatever , contained in (a, b), 
is zero , the function f (x) must be zero almost everywhere in (a, b). 
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Let A denote a set of non-overlapping intervals, or cells, containing 
the points of the set E x , that part of (a, b) for which/ ( x ) £ 0 . We have then 

f / (x) dx + [ / (x) dx = ( / (x) dx =■ 0. 

J(A) 

The set A can be so chosen that m (A — J^) is arbitrarily small, and there¬ 
fore so that the integral of / {x) over A — E x is arbitrarily small. It follows 

that / (x) dx = 0, and therefore / (a;) is zero at almost all points of E x . 

J(Ex) 

Similarly, it can be shewn that / (x) is zero at almost all points of the set 
of points at which / (x) £ 0 . It follows that / (x) is zero at almost all 
points of (a, 6 ). 


PROPERTIES OF THE LEBESGUE INTEGRAL 

395. It has been shewn in § 384 that, if f x (x), / 2 (a;) be functions that 
are measurable in the measurable set E, then their sum f x (x) + f 2 (x) is 
also measurable in E . 

It will now be proved that 

[ {/1 (*) + U (*)> dx= f /j (x) dx + f f 2 (x) dx. 

J (E) J (E) J (E) 

First, let it be assumed that f x (x),f 2 ( x ) are bounded. If e be an arbitrarily 
chosen positive number, let e r (1) , e r (2) denote the two sets in which 
re £/ x (x) < (r + 1 ) e; re £/ 2 (x) < (r + 1 ) e. 
r =0 r-p 

Consider the two sums 2 rew (e r (i) ), 2 rew (e r <2) ), where a, ft are integers 

r -a r-a 

(positive or negative), such that all the values of f x (x), f 2 (x), in E , are 
£ ae, and <(/?-}- 1 ) e. If e ra denote the set which is common to e r {1) , e, (2) , 
we have 

9 a-*0 n-20 f \ w-2/3 

2 rew (e r (1) ) + 2 sem (e a (2) ) = 2 2nn j 2 m (e r8 )l = 2 n. 2 ew (I£ w ), 

r-a s n-2a (r + a-n j n = 2a 

where 2 £ n denotes that set of points in which ne &f x (x) + / 2 (x) < (n + 2 )‘e. 

s~/3 r-/3 

This follows from the fact that e r u) = 2 e r8 \ and e 3 {2) = 2 e r *. Making 

S *= a r " a 

n -20 

e converge to zero, we now see that 2 w. 2 ew (^ n ) converges to 

n =2a 

f {/l (*)+/. (*>}<**• 

J ( E) 

It thus follows that this integral is the sum of f x (x) dx and / 2 (x) dx. 

HE) J(E) 

Next, let one, or both, of the functions f x (x), f 2 (x) be unbounded in E. 
Let En be that set of points in which 

2 N>f t (x) +/, (x) > - 2 N; N>f 1 (x) > - N; N > f 2 (x) >-N; 


37-2 
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then m (En) ~ m (E), as N ~ oc. We have then 

f {fi (*) +/»(*)} dx = I fi ( x ) dx + ( h( x )dx; 
hEm J (En) j (En) 

and since by the theorem of § 392, the integrals of 

h ( x )+h (*)> A ( x )> u ( x )> 

taken over the set E — E N) all converge to zero, as N ~ oo, and 

rn (E — E n ) ~ 0, 

we see that 

f {/i (*) + / 2 (x)} dx = [ /j (x) dx + [ / 2 (x) dx. 

HE) J(E) J (E) 

396. If {/ (z)} 2 be summable in E , a set of finite measure, so also is 
/ (x), but the converse is not necessarily true. For {f(x)} 2 > |/<*) |, if 
| / (x) | > 1 ; and therefore | f (x) | is summable over that part of E in each 
point of which it is > 1 , since {/ (z)} 2 is summable over that part. It 
follows that | f (z) |, and consequently / (z), is summable over E. 

If [f x (a;)] 2 , [f 2 ( x)] 2 are both summable over E , so also is f x (z)f 2 (z); 
for |/, (x)/ 2 (x) | a *[{/, (x)} 2 + {/ 2 (x)} 2 ], and therefore | /, (x )/ 2 (x) | is 
summable over E. 

It can be the case that f x (x) f 2 (z) is summable over E , but not { f x (a ;)} 2 
or {/a (*)} 2 - 

Assuming that {f x (a;)} 2 , {/ 2 (a ;)} 2 are both summable over E, we have 

I [ /i (*)/a (x) dx I s f | /, (x)/ 2 (x) | dx 
\J(E) I J(E) 

A\ {/,(*)}•&! + if {/ 2 (x)} 2 dx ...( 1 ). 

* J(E) z .'(#) 

Also {A | / 2 (a;) | -f /x | / 2 (a;) |} 2 is summable over E , as it is the sum of 
three summable functions; A, fi denoting constants. 

Since 

A* [ {/x (x)} ! dx + 2A/z f |/j (x)/ 2 (x) I dx + [ {/a (x)} 2 dx 

is essentially positive, we see that 

| f /a (*)/a (*) «&}*([ l/i (*)/i (*) I 
li(£) ) l HE) ) 

sf {/i (x )} 2 dx j* {/ 2 (x)} a dx.( 2 ). 

The inequalities ( 1 ) and ( 2 ) are of considerable use in questions connected 
with the convergence of series. The inequality ( 2 ), being a generalization 
of a theorem due to Schwarz, is known as Schwarz’s inequality. 
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397. If fi (x) be surnmable , but not equivalent to a bounded function , it 
is always possible to determine a surnmable function / 2 (: x) so that f x (x) / 2 (x) 
is not surnmable *. 


If a x , a 2 , ... a w , ... be a set of positive increasing numbers without an 
upper limit, and e n denote the set of points at which a n 5 | f Y (x) | < a n+1 ; 
then m (e n ) > 0, for an infinite number of values of n. The series Sfl n m (e n ) 

is convergent, being s [ \f, (x) | dx; let |/ 2 ( x) | = - J, , - , for all 

J ( e) na n m \e n ) 

points of e n , for each value of n. We have then I I f 2 (x) | dx = S • ; 

J(E) na n 

and if the numbers a n be properly chosen (for example a n — n ), this series 
is convergent, and / 2 (a;) is surnmable. But the series 


S 


1 

na n m (e n ) 


a n m (e n ) 


being divergent, 


( \fi{x)f 2 (x)\dx 

J (E) 


has not a finite value, and therefore f 2 (x) is a function such as is required. 


398. If f n (x) denotes a sequence of measurable functions, defined in the 
interval , or cell, (a, b), and such that | f n (x) | is, for every value of n, and for 
every point x, in the interval, or cell , (a, b), less than a fixed positive number A , 
then if {f n (#)} converges for each value of x, to the value of the function f (x), 

rb rb 

that function is surnmable, and I / ( x ) dx — lim / /„ (x) dx . 

J a n~oo J a 

Since, for any fixed value of #, | / (x) — f n (x) | < e, provided n be 
sufficiently great, we see that | / (x) | < | f n (x) | + c < A 4- e ; and there¬ 
fore since e is arbitrary, | / (x) | ^ A. Therefore the function/ (a:) is bounded. 
Let e n denote that measurable set of points of (a, b) at which f n (x) > c ; 
then the set e , of points at which f (x) > c, is such that each point of e 
belongs to all the sets { e n }, from and after some value of n dependent on x , 
and therefore, employing a theorem given in § 131, the set e is measurable. 
It follows that/ (x) is a measurable function, as c can be arbitrarily chosen; 
and since / (x) is bounded, it is surnmable in (a, b). 

Let g n denote the set of points x , at which \ f (x) — f n (x) | > e; the 
set g n is measurable, and lim m ( g n ) = 0 . For if this is not the case, there 

n~co 

are an indefinitely great number of values of n for which m ( g n ) is greater 
than some fixed positive number a. In accordance with the theorem of 
§ 136, there exists a set of points, of measure ^ a, such that each point 
belongs to an infinite number of the sets g n . This is inconsistent with the 
condition that f n ( 2 ) converges to / (x), for each value of x. It has thus 
been shewn that m (g n ) —- 0 , as n ~ 00 . 

* See Lebesgue, Annales de Toulouse (3), vol. l (1909), p. 38. 
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Now 

[ {f(r)-f„(x)}dx=[ {f(x)-f n (x)}dx+( {f (x) - f n (x)} dx. 

J a J g„ J C(0 n ) 

The first integral on the right-hand side is numerically less than 
2Am (g n ), and the second integral is numerically less than el , where l is 
the measure of the fundamental cell. The integral on the left-hand side 
is arbitrarily small, for all sufficiently large values of n\ and thus the 
theorem is established. 

It may be observed that, in case there is an exceptional set of points, 
of measure zero, at which the sequence {f n (#)} does not converge, the 
above theorem still holds. For, in the above proof, we can disregard this 
set; the integrations being all taken over the complements of this set, 
the measures of the sets over which they are taken are unaffected. 

399. The following theorem is concerned with the L-integral of a 
function which is, at almost every point of its domain, the limit of a 
monotone sequence of measurable functions. 

Let {f n (#)} be a sequence of non-negative hounded functions, summable 
in the measurable set of points E, in any number of dimensions , and let it be 
assumed that, for each value of x, in E, the sequence is monotone and non- 

diminishing. Let it be further assumed, that lim f n (x) dx has a definite 

n~oo J (E) 

value. Then (1), the points of E at which the sequence { f n (#)} does not con¬ 
verge to a definite number form a set of points of measure zero; and (2), if 

f (x) denote the limit of the sequence { f n (x)}, the integral f (x) dx exists , 

Le) 

and has the value lim /„ (a;) dx; those points of E at which f (x) is unde - 

ao '(E) 

fined being disregarded. Conversely (3), if f (x) dx exists as a definite 

•(E) 

number, then lim f n (#) dx exists, and has the same value as the integral, 
n—oo J (E) 

The first two parts of this theorem were given* substantially by Vitali, 
and byf B. Levi; the proof here given was published} by Hobson. Let k 
denote a positive number, and let the functions f n lk) (x) be defined by 
fn k) (#) = fn ( x ), for values of x for which f n (x) ^ k, and by f n {k) (x) — k, 
for values of x for which f n (x) > k. Let f {k) (x) be a function such that 
f lk) (x) = / (x), when / (x) & k, and f {k) (x) = k, when / (x) > k. For each 
value of x,f n (k) (x) defines a monotone double sequence, that is a monotone 
function of n and k, if n have the values 1, 2, 3, , and k have the values 

* Rend, del Circ. Mat. di Palermo , vol. xxm (1907), p. 137. 
f Rend. deXC Jstit. Lombardo , (2), vol. xxxix (1906), p. 775. 
t Proc . Lond. Math. Soc. (2), vol. vin (1910), p. 28. 
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in any monotone increasing sequence of numbers without an upper limit. 
It is clear that f n (k) (x) dx, regarded as dependent on n and k, defines 

J (E) 

a monotone double sequence of non-diminishing numbers. From a pro¬ 
perty of such double sequences (see § 388), we infer that the two repeated 
limits , r 

lim lim I f n lk) (x) dx, lim lim f n (k) (#) dx 

A'~co n~oo J (E) n~ oo k~ oo J (E) 

are such as to have the same finite value, if either of them is finite. 

Since f n ik) (x) is bounded, for all values of k and x, we have (see § 398), 


and therefore 


lim J /„<*> (x) dx [ /„ (x) dx, 

k~-<x> J (E) J (E) 

lim lim I f n ik) (x) dx — lim I f n (x) dx, provided that i 
n~ao k~ oo J (E) n~<x> J (E) 


of these two limits is assumed to exist. 

Also, we have lim f n (k) (x) dx f ik) (x) dx, since f n {k) (x) is 

n~oo J (E) '(E) 

bounded for all values of n and x . Therefore 

lim lim [ f n ik) (x) dx — lim [ f {k) (x) dx, 
k~<» n~ao J (E) A ~oo J (E) 

provided one of these two limits exists. 

Since f n ik) (x) is summable, so also is f lk) (x), and therefore also 

lim f (k) (x), or f{x), is measurable. In case lim I f ik) (x) dx is finite, its 
oo A'~* J ( E ) 

value defines r 

/ / ( x ) dx - 
J (E) 

First, let it be assumed that lim f n (x) dx exists. If e k be the set 

n~* oo J (E) 

of points for which f (k) (.r) = k, we have 

km (e k ) < [ f ik) (x) dx < lim [ f n (x) dx. 

J (E) n~*coJ (E) 

The set of points at which / (x) is indefinitely great is contained in all 
the sets e k , hence its measure is 

s m ( e k ) < l lim ( f n ( x ) dx. 

* n~oo J(E) 

Therefore, since k is arbitrarily great, the measure of the set of points at 
which / (x) = oo is zero. 

Also f f (x) dx = lim f f (k) ( x) dx = lim f f n (x) dx. 

J (E) fc-w.bfi) n~<x>J(E) 
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Next, if I / (x) dx, or lim lim I f n {ki (x) dx, is assumed to be finite, 
J (E) k~cc n~oc J (E) 

it will be seen, from what has been proved above, that lim I f n (x) dx 

n~To J {JC) 

exists, and has the value of | / (#) dx. 

J (E) 


THE LIMITS OF A SEQUENCE OF MEASURABLE FUNCTIONS 

400. Let f 1 ( x ), f 2 (x), .../„ ( x) y ... be a sequence of measurable func¬ 
tions defined in an interval (a, 6), or in a cell of any number of dimensions. 
Let v n (x) denote that function which, for each value of x, has for its value 
the greatest of the numbers f 1 (x), f 2 (x), ... f n (x). The function v n (x) is 
measurable; for the set of points at which A < v n (x) is that set each point 
of which belongs to one or more of the measurable sets for which 
A </, (x), A </„ (x), ... A < f„ (x). 

The functions v x ( x ), v 2 ( x ), ... v n (x), ... form a sequence which, for any 
value of x , gives a monotone non-decreasing set of numbers; let w 1 (x) 
denote its limit. The value of w x (x) is finite, or infinite, for each value 
of x. Let the function w n (x) be formed in the same manner from the 
sequence v n (x), v n+i (x), ..., obtained by leaving out the first n — 1 func¬ 
tions in the sequence v 1 ( x ), v 2 ( x ), The sets w x (^), w 2 ( x ), ... are all 

measurable; for the set of points x at which u\ (x) > A is the set of points 
which belong to an infinite number of the measurable sets for which 
(*) > A, v 2 (x) > A, ... v n (x) > A , ...; 
hence w x (x) is a measurable set, and similarly w n (x) is measurable for 
each value of n. The functions w x (x), w 2 ( x ), ... define a monotone non¬ 
increasing sequence of numbers for each value of x. The limit of this 
sequence is / (x), where /(#) denotes lim f n (a:). As before, this function 

n~co 

/ (x), being the limit of a monotone sequence of measurable functions, is 
itself measurable. In a similar manner it can be shewn that/ (x) := lim f n (x) 

0O 

is measurable, it has thus been shewn that: 

If f x (x), f 2 (x), ... be a sequence of measurable functions of one or more 
variables , defined in an interval , or a cell , ( a , b ), the functions J (x), f (x) are 
measurable , where J (x),f (x) denote , for each value of x , the upper and lower 
limits (finite or infinite) of the sequence of numbers {f n (#)}. 

If, instead of the upper and lower limits of {f n (a;)}, their upper and 
lower boundaries are taken, we have the corresponding theorem: 

If {f n (#)} is a sequence of measurable functions of one or more variables , 
the functions U (x), L (x) are measurable; where U (x), L (x) denote, for each 
value of x , the upper and lower boundaries of the numbers {f n (#)}. 



399—401 ] Limits of a Sequence oj Measurable Functions 585 

The set of points for which U (x) > A, is that set which consists of 
the points that belong to one or more of the sets for which 
fi (*) > A, f 2 (x) > A, .... 

It follows that the set is measurable, and consequently that U (x) is a 
measurable function. Similarly it can be shewn that L (x) is a measurable 
function. 


THE DERIVATIVES OF A FUNCTION 

401. If <f) (#) be continuous in the linear interval ( a , b), the four derivatives 
D+<f> (x), D+fi (x), (x), D f> (#) (finite or infinite) are measurable func¬ 

tions of x. In particular, if f (x) have almost everywhere a differential 
coefficient fi' (x), then <j>' (x) is a measurable f unction of x. 

We shall consider the cases of (x) and D+<f> (x ); the cases of the 
other functions being then deducible. 

D + cf> (a:) is the upper limit lim I (x, x -f h), when h > 0, and I ( x, x f h) 

h~ o 

denotes the incrementary ratio, {</> (x -f h) — ( x)}/h . It will be shewn 
that a sequence of positive values of //, converging steadily to zero, can 
be determined, such that the upper and lower limits of I (x, x -f h),w hen 
li has successively the values in the sequence, are for every value of x 
the same as when h is not restricted to have such values. Let 

V> V> K, ■■■ 

be a sequence of diminishing positive numbers, converging to zero, and 
let € lt c 2 , ... €„, ... be another such sequence. Since / (x, x + h) is con¬ 
tinuous with respect to ( x , h), for all values of x in (a, 6), and for all values 
of h that are > 0, it follows from the uniformity of continuity of I (x,x + h ), 
when h is confined to the interval (h' n + lf h' n ), that this interval can be 
divided into a definite number r n of parts, such that 
| I (x, x + h) — / (x, x -1- h') | < £„, 

for every value of x, provided h and h f both lie in one and the same part 
of the interval (h' n+1 ,h' n ). Let this sub-division of (h' n+l ,h' n ) be made 
for each value of n, and let h x , h 2 , h 3 , ... denote the end-points of all the 
parts of all the intervals. The sequence h l9 h 2 , h 3 , ... converges to zero, 
and it is a sequence such as satisfies the required condition; for we have 
| I (x, x + h m ) — I (x, x -f h) | < provided h m ^ h <i h m+l ; the integer s 
being determinate, corresponding to each value of m. It follows that the 
upper and lower limits of the sequence 

/ (x, x + h x ), I (x, x + h 2 ), ... I (x,x + k m ), 
are identical with those of any other sequence 

I (x, x + V), I (x,x + h 2 r ), ... I (x,x + hj), ..., 
h x > hf ^ h 2 , h 2 ^ hf > h 3t ...; 


when 
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and generally h m ^h m r £ h m+1 ; and this is the case for every value of x 
in (a, b). Therefore the sequence {h n } has the required property. 

If we identify the functions f n (x), of § 400, with the functions 
/ ( x , x + h n ), where the sequence h n is formed as indicated above, we see 
that the two derivatives D + <f> (x), D + <f> (x) are measurable functions, since 
the functions {<f> (x h n ) — <f> (x)}/h n are measurable for each, value of n. 

It follows from the theorem that: 

The set of points in which a derivative D<f> (x) of a continuous function 
is finite is measurable . 

For the set is the outer limiting set of the sequence of measurable sets 
for which n > Df> (x) > — n\ for n — 1, 2, 3,_ 

402 . If E be a set of points in the interval (a, b), at which Df (x) y one of 
the derivatives of a function f (x), continuous in (a, b), has a fixed sign (and 
is not zero), the set E , when it exists , is unenumerable , and contains a perfect 
set . 

It will be sufficient to consider the set of points E, at which D 4 / (x) < 0; 
the other cases may be treated in a similar manner. 

If a be a point of E , another point ft (> a) can be determined such 
that / (/?) — / (a) has a negative value — A. 

Let rj be a fixed positive number such that rj (ft — a) - K < A, and 
let k be a positive number such that A 1 + k < X; we shall consider the 
continuous function <f> (x) = / (x) — / (a) -f* rj (x — a) -f k, in the interval 
(a, /?), of x. Since <f> (a) is positive, and </> (/?) is negative, <f> (x) must have 
the value 0 at one or more points within (a, ft ); let f be the point nearest 
to /3 at which this is the case. Since (f) — 0, and <f> (fi) is negative, <f> (x) 
is negative for all values of x such that f < x g p. Since <f> (x) - <f> ($) is 
negative, it follows that D + cf> (f) s; 0, whence we have D + f (£) £ — rj < 0, 
and therefore f belongs to E. 

The number A x being kept fixed, k may vary continuously within the 
interval (0, A —A a ); and, to each such value of k, there corresponds a 
single value of £, which belongs to E. It follows that the set E contains 
a single point corresponding to each value of k within the interval 
(0, A — Aj). Therefore E contains an unenumerable set of points, of the 
cardinal number of the continuum, and thus contains a perfect set. 
Accordingly the theorem* has been established. 

403 . If f (x) be a function defined, in the interval (a, b), and E be the set 
of points at which one of the derivatives Df (x) has a fixed sign (and is not 

* See de la Vallee Poussin’s Cours d'Analyse, 2nd ed., vol. I, p. 80. The proof there given 
corresponds to the case rj =0; but the conclusion there drawn that (£) <0, is incorrect. This 
is remedied above by the introduction of the positive number rj. 
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zero), then, if Df ( x) is finite (bounded or unbounded) at every point of E, 
the exterior measure of E is greater than zero. 

It will be sufficient to consider the case in which E is the set of points 
at which D+f (x) < 0; it being assumed that, at no point of E , is 

= - 00 • 

Let it be assumed, if possible, that m e (E) = 0. Let E n be that part of E , 
in the points of which n — 1 £ — D + f (x) < n , then m e (E n ) = 0. The set 
E n may be enclosed in a set A„, of non-overlapping intervals, of total 
measure e n \ and this may be done for all the sets E n (n = 1, 2, 3, ...). The 
numbers e n can be so chosen that H ne n converges to a value which is 

n 1 

less than an arbitrarily chosen number e. Let A„ (x) denote that part 
of A n which is in the interval (a, x), and let 

<f> (x) = m {A x (x)} 4- 2m {A 2 (x)} + ... + nm, {A n (x)} + ...; 
the function <f> (x) being consequently < e, for all values of x. 

Consider the function if) (x) ~ / (x) + <j> (x). Since <j> (x) is monotone 
and non-diminishing, we have D+ip (x) ^ D+f (x) } at all points of C (E). 
At every point x , of E , if x belongs to E n , the increment <f> (x + h) — cf> (x), 
for sufficiently small values of h , is at least nh ; hence 
D+ijj (x) ^ n -f D+f (x) > 0. 

It has thus been shewn that ifj (x) is monotone and non-diminishing in 
(a, b); thus i/j (p) — ip (a) ^ 0, for any two points a, /?, when > a. Since 
<f> (ft) — <f> (a) < c, it follows that f (ft) — f (a) > — e. But if a is a point 
of E y a point can be so chosen that f (fi) — f (a) is negative, and has a 
value — k ; and thus k < e. Since e can be so chosen as to be less than k , 
the hypothesis that m e (E) = 0 has been shewn to lead to contradiction. 
It has therefore been shewn that m e (E) > 0. In case / (x) is continuous, 
since E is then certainly measurable, its measure must be > 0. 

INDEFINITE INTEGRALS 

404. If / (x) be summable, whether bounded or not, in the linear 
interval (a, b), it is also summable in the interval (a, x), where a & x ?;■ b. 

The integral [ / (x) dx may be regarded either as a function of the upper 

. a 

limit, or as a function of the; set of points which constitute the interval 
(a, x). 

If F ( x ) be a function defined in (a, x), and such that 

F (x) - F (a) = [ / (x) dx, 

J a 

it is called the indefinite integral of / (x), and is determinate, when / (x) 
is defined, except for an additive constant. 
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An indefinite integral is continuous, and of bounded variation, in the 
interval for which it is defined. 

f r+h 

f (a;) dx> 

x 

F (x) - F (x - h) = f / (x) dx, 

Jx-h 

and since the integrals on the right-hand side converge, in accordance 
with the theorem of § 392, to zero, as h —«0, it follows that F (x) is con¬ 
tinuous at every point x, of (a, b). Again, if (a, b) be divided into any 
number n of parts (x r ^ l7 x r ), where x 0 = a, x n = b , we have 

£ | F (x T ) - F | = £ I [ ' f(x)dx 

r 1 r - 1 l J xr -i 

r-n r-rr 

S S | / (*) | dx 

r-l . J-r-, 

5 ' ( | / (»•) | rf-e- 

•' a 


Since this inequality holds for every possible set of sub-divisions of 
(a, b) into parts, it follows that F (x) is of bounded variation. This 
may also be proved by utilizing the fact that a summable function / (x) 
is the difference of two non-negative summable functions f y ( x ), f 2 (x). 
Then F (x) is the difference of the two monotone functions, F 1 (x), F z (a:), 
which are the indefinite integrals of f x (x), f 2 (x). Then by the theorem of 
§ 244, it follows that F (x) is of bounded variation. 


405. The indefinite integral F (x) has a finite differential coefficient almost 
everywhere in ( a , b). 

Since F (x) is of bounded variation, it is expressible as the difference 
of two monotone functions, and in accordance with the theorem of § 298, 
it consequently has a finite differential coefficient almost everywhere in 
(a, 6). 


This theorem is included in the following more general theorem*: 

If F (x) be the indefinite integral of a summable function f (x), defined in 
(a, 6), F (x) has , almost everywhere in (a, b), a differential coefficient equal 
tof(x). 


The theorem will first be proved in the case in which / (x) is bounded in 
(a, 6). Let e (x) be the part in (a, x) of a measurable set of points e, in (a, b). 


The metric density of e has been defined in § 137 as 


lim 
n -o 


e (x -f h) — e (x — h) 
2h ’ 


* See Lebesgue, Lemons sur Vintegration, p. 124. 
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since the continuous function e (x) is monotone, it has, for almost all values 
of x, a differential coefficient, and wherever this differential coefficient 
exists, its value is identical with the above limit, and therefore with the 

metric density of e (x). It follows that ~ e (x) exists, and has the value 1, 

(IX 

at almost all points of e, and that it exists, and has the value 0, at almost 
all points of C (e) (see § 140). 

If L , U denote the lower and upper boundaries of / (x) in (a, b ), let 
(L y U) be divided into parts (a 0 , a x ), (a x , a 2 ), ... (a n _ x , a n ) y where a 0 = L , 
a n — U , and where a r — a r _ x < e, for all values of r. Let e r be the set of 
points at which a r ^ / (x) < a T , except that e n is the set at which 

1 £ / (x) s a„; 

and let e r (x) be the part of e r in (a, x). Let <f> x (x) = a r _ lt at every point 
of e r , for all values of r, and let </> 2 (a:) = a r , at each point of e r , for all 
values of r. 

Also let F x (x) = | </>i (x) dx , J^ 2 (a:) = | <f> 2 (x) dx\ 

J a J a 

then F x (a:) = a 0 m (a:)} + a x m {e 2 (a;)} -f ... -f a n _ x m {e n ( x)} f 
F 2 (x) == a x m {6! (a;)} -f- ( e 2 (^)l + -f {^n W}- 

The function ^ (a;) has a differential coefficient equal to (x) almost 
everywhere in (a, b). For m {e r (x)} has a differential coefficient almost 
everywhere equal to 1, or 0, according as x belongs, or does not belong, 
to e r . Similarly F 2 (x) has a differential coefficient equal to <f> 2 (x) almost 
everywhere. 

Also 

F x (x + h) — F x (x) ^ F (x -f h) — F (x) ^ F 2 (x + h) — F 2 (x) 

£ - a s /( 

for positive and for negative values of h, since F x (x) cannot increase more 
than F (x), and F (x) cannot increase more than F 2 (x), as x increases. 

Therefore the four derivatives of F (x) all lie between <f> x (x), </> 2 (x) 
inclusively, at each point x which does not belong to that set, of measure 
zero, at which F x (x), F 2 (x) do not have differential coefficients equal to 
(f> x (x), (/> 2 (x) respectively. Now <f> x (x), <f> 2 (x) differ from each other, and 
from / (x), by less than €; therefore, almost everywhere, the four deri¬ 
vatives of F (x) differ from one another by less than e. Taking a sequence 
of values of €, which converges to zero, we see that F (x) has a differential 
coefficient equal to / (x), almost everywhere in (a, b). 

The theorem has now been established for the case of the indefinite 
integral of a bounded function. 
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In order to prove the theorem for the case in which / (x) is unbounded, 
the following Lemma* will be required: 

If <f> ( x ) be a continuous monotone non-diminishing function , <f)' (x) is 
summable in (a, b), and 

r b 

/ <f>' (x) dx s (f> (b) — (f> (a). 

J a 

The fact that <f>' (x) does not exist, or is not finite, at points of a set 
of measure zero, has no effect on the integral, which is taken over that 
set of points at which <f>' (x) exists, and is finite. 

Let x N ( x ) — 4>’ ( x )> f° r values of x such that </>' (x) s N, and Xn (x) = N, 
when <f>' (x) > N. In accordance with the result obtained in § 401, </>' ( x ) 

and x v (x) are measurable functions. The function j XN^dx, being an 

indefinite integral of a bounded function, has a differential coefficient, 
equal to Xn ( x )> almost everywhere in (a, b). At all points of (a, b) its 

fb 

derivatives are all g N. Let the points of the set G , at which / Xn ( x ) dx 

J a 

does not possess a differential coefficient equal to Xn (#), be enclosed in a 
set of intervals A, of measure e/N, and let A x be the part of A in the 
segment (a, x). 

We shall consider the function <j> (x) -f Nm (A x ) == </> (#). In G , we have 
Difs (x) £ N; and at points not in G, Dift (x) ^ D<f> (x) £ % N (x ), where D 
denotes any one of the four derivatives. The increase of the function 

0 (a;), with x, is not less than that of j Xn dx; therefore 

J a 

<t> ( b ) ~ $ («) + « s [ Xn ( x ) dx ■ 

J a 

tb 

It follows that Xn ( x ) dx is bounded, for all values of N, and it is 

• a 

also greater than Nm (H N ), where Hn is that set of points at which 
D(f> (#) > N. We infer that lim m(HN) — 0. If En is the set complementary 

N~cc 

to Hn, we have 


[ d)<f> ( x ) dx s I Xn ( x ) dx z <f> (b) - <f> (a) + e. 

J{En) Ja 

If N ~ oo, we see that f D<f> (x) dx is finite, and & <f> (b) — <f> (a). 


From this Lemma we deduce that : 

If <f> (x) be a continuous function , of bounded variation in (a, 6), then 
D<f> (#) is summable , when taken over the set of points of (a, 6) at which it is 

* See de la Vallee Poussin, Court £ Analyse, 2nd ed., vol. I, p. 266. 
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finite; D<f> (x) denoting any one of the four derivatives of <f> (x). The integral 
of D<f> (x) has the same value for all the four derivatives. The set of points at 
which Dcf> (x) is not finite has measure zero. 

For <f> (x) is the difference of two monotone non-diminishing functions 
^ (x), <f> 2 (x). Since </>/ (x) } <f>f (x) exist almost everywhere in (a, b), and 
are summable over the set of points at which they exist, we see that 
<!>' (x) exists almost everywhere, and is summable. The theorem follows 
from the fact that D(f> (x) — <f>' (x), almost everywhere in (a, b); thus 

I <f>' (x) dx = f D<f> (x) dx. 

J a J a 

We are now in a position to prove the general theorem, when / (x) is 
unbounded. 


Consider the function fy (x) which = / (x), when/(x) < N, and is equal 
to N, when f (x)> N ; it being assumed that/ (x) is a non-negative function. 

Since j f (x) dx and j fy (x) dx are monotone functions, they both have 
J a J a 

finite differential coefficients almost everywhere, and that of I f N (x) dx 

J a 

clearly does not exceed that of I / (x) dx. Since fy (x) is bounded, the 

J a 

differential coefficient of j fy (x) dx is In (x), almost everywhere; therefore 
J a 

fy (x) s F* (x), almost everywhere. Since N is arbitrarily great, it follows 
that / (x) F' (x), almost everywhere. 

Now [ / (x) dx = F (6) — F (a) £ [ F r (x) dx , as is seen by employing 

J a J a 

the Lemma proved above; but since / (x) ^ F' (x), we have 
[ / (x)dx £ f F f (x) dx. 

J a J a 

rb 

It follows that I [F' (x) — / (x)] dx 0; and since F' (x) — / (x) ^ 0, it 
J a 

then follows that F'(x)=/(x), almost everywhere. The theorem has 
therefore been established for a non-negative unbounded function. 

Since every summable function is the difference of two non-negative 
summable functions, the theorem is completely established. 

It is easily seen that, at every point of continuity of the function / (x), 

j f (x)dx has a differential coefficient equal to/(x). For at such a point, 

J a 

if Z, u be the lower and upper boundaries of the function in (x — h } x -f h ), 
F (x -f h) — F (x) and F (x) — F (x — h) 
are between Ih and uh inclusively, and since l and u both converge to / (x), 
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as h ~~ 0 , it follows that F (x) has as its differential coefficient f (x). The 
set of points at which F (x) has / (x) for its differential coefficient conse¬ 
quently includes all points of continuity of / (x), when such points exist. 

The important theorem here established, which is due to Lebesgue, 
throws light on one of the fundamental questions which arise as to the 
reversibility of the two processes of integration and differentiation; as it 
asserts that the process of integration of any summable function can be 
reversed by differentiation, at almost all points of the interval for which 
the function is defined, and certainly at every point of continuity of the 
function. 


406. The necessary and sufficient condition that a function , defined for 
an interval , may he an indefinite integral is that it should be absolutely con¬ 
tinuous in the interval. 


To shew that the condition stated in the theorem is necessary, consider 
F (x) — F (a) -f- f / (x) dx. 

J a 


If A be any set of non-overlapping intervals such that m (A) < rj , we 
also have m (AJ < rj } m (A 2 ) < rj , where A x consists of those intervals of A 
for which the variations of F (x) are positive, and A 2 those for which 
they are negative. 


The integral / (x) dx is the sum of those variations of F (x) that 

are positive, and / (x) dx the sum of those that are negative. 

h*t) 

< 6, provided m (E) < rj y where rj is determined 


Since 


f /(*) 

J(E) 


dx 


when e is given, we see that the sums of the variations in the intervals of 
A x and of the absolute variations in the intervals of A 2 are < e, for all sets 
of intervals A, such that m (A) < rj. The condition of absolute continuity 
(see § 218) of F (x) is therefore satisfied. To prove that the condition 
stated is sufficient, let us assume that it is satisfied by F (x); it has been 
proved in § 244 that F (x) is of bounded variation in (a, b). 


Since F (x) is of bounded variation it has a finite differential coefficient 
/ (x) almost everywhere. It follows from the Lemma in § 405 that, F (x) 
being the difference of two monotone functions, its differential coefficient 

/ (x) is summable in (a, 6). Also f / (x) dx f which has thus been shewn to 

J a 

exist, has a differential coefficient / (x), almost everywhere. Therefore 
F (x) — j / (x) dx ~<f> (x) has a differential coefficient equal to zero, except 
in a set G, of measure zero. Enclose G in non-overlapping intervals of a 
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set A, where m (A) < e. Each point a, at which <£' (#) exists, and has the 
value 0, is such that, if a; is in a sufficiently small neighbourhood of a, 

| (f> (#) — <j> (a) | < c | x — a |. A Lebesgue chain (see § 78) may be defined 
as follows, so as to reach from a to b. To a point a , of G , attach the part 
of that interval of A that is on the right of a \ to a point a, not belonging 
to 6r, attach an interval («, a x ) such that | <f> (x) — </> (a) | < e (x — «), if 
a < x < a x . This interval may be defined uniquely, by taking (a, eq) to 
be the maximum of all the intervals which satisfy the condition. There 
is one and only one chain from a to x ( b) composed of the intervals so 
defined for every point a such that a ^ a < x (see § 78). 

Now | cj) (x) — (f> (a) | cannot exceed the sum of the absolute values of 
the differences of the functional values at the ends of an interval of the 
chain from a to x. To this sum those of the intervals that are parts of 
the intervals of A contribute a part that is < rj , dependent on e, since (.r) 
is absolutely continuous. The other intervals of the chain contribute a 
part < € (b — a). Hence | <f> (.r) - c/> (a) | < r/ -\- e (b ~ a); and since e is 
arbitrarily small and rj, e converge together to zero, we have </> (x) <f> (a). 

Therefore F ( x) ~ F (a) + | / (x) dx\ and thus F (x) is an indefinite 

J a 

integral. 

The following theorem has been proved: 

If F (x) be an indefinite integral in (a, b), then F (#) — F (a) — I F' (x) dx ; 

J a 

the set of points at which F' (x) may not exist being ignored. 

It has been shewn in § 218 that the sum and the product of two 
absolutely continuous functions are absolutely continuous functions. 

It follows that : 

The sum and the product of two functions which are indefinite integrals 
in (a, b) are both indefinite integrals in the interval. 

407. The necessary and sufficient conditions that F (x) should be an 
indefinite integral have been stated by Lebesgue in another form: 

The necessary and sufficient conditions that F (x) should be an indefinite 
integral in (a, b) are that (1), it should be of bounded variation in (a, b), and 
(2), the total variation over any set of points of measure zero should be zero . 

By the total variation of a continuous function F (x) over any set E 
is meant the limit of the sum of the absolute variations of F (x) in a set 
of non-overlapping intervals A which contains E, as m (A) — m (E), when¬ 
ever such limit exists. 

It has been shewn that, if the condition of the former theorem is 
satisfied, the condition (1) is satisfied. Also the condition of absolute 
continuity ensures not only that the condition (2) is satisfied, but also 
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that it is satisfied uniformly for all sets of points of measure zero. Thus 
the conditions (1) and (2) are necessary. That the conditions are sufficient 
is shewn by the proof in § 406, if we apply (2) to the set <7, in which <f> (x) 
has not a differential coefficient equal to zero. 

The above theorem may also be stated in the following* form: 

If x — f (t), where f (t) is a continuous monotone function of t, and 
a — f (t 0 ), the necessary and sufficient condition that 

/«-/&)= [V V)dt 

Jto 

is that, to every set of points on the t-interval , of measure zero, there shall 
correspond a set of points on the x-interval, of measure zero. 

For the total variation of / (t) over any set of points G lt) , on the 
^-interval, is the lower limit of the measure of a set of non-overlapping 
intervals on the ^-interval which encloses the set G (x) of points that 
corresponds to G (t) ; and this lower limit is m e (G (x) ). Thus the condition 
that the total variation of / (t) over a set G {t) , of measure.zero, should be 
zero, is equivalent to the condition that m (G lx) ) should be zero. 

A continuous function of bounded variation is an indefinite integral, 
if both the monotone continuous functions of which it is the difference 
satisfy the condition of the above theorem. 

It will be observed that the functions which are indefinite integrals 
form a sub-class of the class of functions of bounded variation. 

A function which is expressible as the sum of an indefinite integral and 
of a bounded monotone (not necessarily continuous) function, and which 
is therefore of bounded variation, has been named by| W. H. Young an 
upper semi-integral , or a lower semi-integral, according as the monotone 
function is non-diminishing or non-increasing. 

It has been shewn by W. H. Young that an integral is both an upper, 
and a lower, semi-integral. It is accordingly sufficient, in order that a given 
function may be an indefinite integral, that it be less than some indefinite 
integral by a monotone non-diminishing function, and also greater than 
sojne indefinite integral by a monotone non-diminishing function. 

It can be shewn that the necessary and sufficient conditions that F (x) 
should be an indefinite 12-integral are that it should have bounded de¬ 
rivatives in (a, 6), and that a derivative should be continuous almost 
everywhere. 

408. If F x {x), F % (x) are indefinite integrals, and F (x) be the function 
that is equal to F x (x), for all values of x such that F x (#) s F 2 (x), and to 
F z (x) when F 2 ( x ) > F x (x), then F (x) is an indefinite integral . 

* See Hahn, MoncUshefte fUr Math. u. Phydik, vol. xxm (1912), p. 163. 
f Proc . Lond. Math. Soc. (2), vol. ix (1911), p. 294. 



407-409] Indefinite Integrals 595 

The theorem can be at once extended to the case of any finite number 
of indefinite integrals F x (x), F 2 (x), ... F n (x). The function F (x) has at 
each point x the value of the greatest of the given functions. 

To prove the theorem, it will be observed that, in any interval (x A , x 2 ), 
the variation | F (x 2 ) — F (x x ) | is equal to that of F x (x), if F x (x x ) s F 2 (x 2 ) 
and F x (x 2 ) s F 2 (x 2 ); it is equal to that of F 2 (x) if both the inequalities 
are reversed, and to | F x (x x ) — F 2 (x 2 ) |, or to | F 1 (x 2 ) — F 2 (x x ) |, if only 
one of the inequalities is reversed. Considering the case in which 
F x (x x ) as F 2 (^i), F x (x 2 ) s* F 2 (x 2 ); 

if F x (x x ) & F 2 (x 2 ), we have 0 ^ F x (x x ) — F 2 ( x 2 ) £ F x (a;!) — F x (x 2 ), hence 
I F i (*i) “ F 2 M | £ | F x (x x ) - F x (x 2 ) |; if F x (x x ) £ F 2 (x 2 ), we have 
0 s F, (x 2 ) - F x (x x ) £ F 2 (x 2 ) - F 2 (x x ), 
and therefore | F x (x x ) - F 2 (x 2 ) | £ | F 2 (x x ) - F 2 (x 2 ) |. 

A similar remark applies to the case in which 

F x (x x ) £ F 2 (x x ) y F x (x 2 ) z F 2 (x 2 ). 

Thus the variation of F (x) in (x x , x 2 ) is in any case not greater than the 
sum of the separate variations of F x (x), F 2 (x) in the same interval; hence 
the sum of the absolute variations of F (x), in any set of intervals, is not 
greater than the sum of the absolute variations of F x (x), and of F 2 (x), 
in the same set of intervals. It then follows that, if F x (x), F 2 (x) are 
absolutely continuous in (a, 6), so also is F (x), which is therefore an 
indefinite integral. 


409 . The criterion that a function F (x), defined in the interval (a, b), 
may be the indefinite integral of a function of bounded variation has been 
obtained* by F. Riesz. It is contained in the following theorem: 


The necessary and sufficient condition that the function F (x), defined in 
the interval (a, 6), may be the indefinite integral of a function of bounded 
variation is that the expression 

F (x T+1 - x r ) _ F (x r ) - F (ay,,) J 

where (x 0 , x x , ... x m ) defines a net fitted on to the interval (a, b), and x 0 = a , 
x m = b, should be less than a fixed positive number , independent of the 
particular net. 


That the condition is necessary follows from the fact that 


(av+i) - J_(Xt) = 

^r+l X r x r+l ' . ... 

where is some number between the upper and lower boundaries of / (x) 


1 ?Xr +1 

“ _ f(x)dx = p r ; 


Annales sc. de V&ole normals (3), vol. xxvin (1911), p. 36. 
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in the interval (x r , x r+1 ). When/ (x) is of bounded variation, the expression 
in the theorem is seen to be bounded for all possible nets. For the proof of 
the sufficiency of the condition, reference may be made to Riesz’s memoir. 


THE FUNDAMENTAL THEOREM OF THE INTEGRAL CALCULUS 
FOR A LEBESGUE INTEGRAL 


410 . Let F (x) be a continuous function, defined for the interval (a, 6), 
and let it be assumed that F (x) has, at every point of (a, 6), a differential 
coefficient F' ( x ), that is bounded in (a, b). It has been shewn, in § 401, 
that F' (x) is a measurable function; being bounded, it is consequently 
integrable ( L ). 


Let f n (x) = 


F (x + h n ) — F ( x) 


; where {h n } is a sequence of numbers 


converging to zero; then |/„ (x) | is bounded, for all values of n and x, 
as it has the same boundaries as F' ( x ). By the theorem of § 398, we have 


Thus 


(V (x) 

J a 


lim I f n (x) dx = [ F' (x) dx. 
n~cc J a J a 


dx 


lim . 

hn*' 0 "'n 


rx-\-hn ra+hn 

F (x) dx — F (x) 
J x J a 


dx 


= F (x) - F (a). 


The following theorem has been established: 

If F (x) be a function which possesses , at every point of the interval ( a , b), 
a differential coefficient F' (a;), bounded in that interval , then F' (x) possesses 
an L-integral in the interval (a, x), which differs from F (x) by a constant only. 

This theorem corresponds to the theorem (B), of § 343. For the 
/^-integral, the theorem is subject to the condition that F' (x) should be 
integrable ( R ); but it has here been shewn that the corresponding theorem 
holds without restriction, when Lebesgue’s definition of an integral is 
employed, so long as the differential coefficient is a bounded function. 


411 . The following theorem was given* by Lebesgue: 

If a function <j> ( x) y defined in a given interval , be such that one of its 
derivatives (say D+<f> (x)) is finite at every point , the necessary and sufficient 
condition that the derivative is summable in the interval is that the given 
function be of bounded variation in the interval. 

The indefinite integral of such a summable derivative is the function of 
which it is the derivative. 

To prove the theorem, we assume that, for each value of x in (a, b ), 
the derivative D+<f) (x), of the given continuous function <f> (x), has a finite 

* Lemons sur Vintegrationf-g. 123. For a discussion of his proof, and for corrections to its 
original form, see Atii dei Lincei, Rendiconti y (5) vol. xv (1) (1906), B. Levi, pp. 433, 551,674; (5) 
vol. xv (2) (1906), Lebesgue, p. 3, B. Levi, p. 35S. Also (5) vol. xvi (1) (1907), Lebesgue, pp. 92,283. 
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value. Let the unbounded interval (— oo, oo) be divided into intervals 
(c t , c t+1 ), where the integer i has all positive and negative values, including 
zero, for which we assume c 0 — 0; also let c l+1 — c t < e, for all values of i. 
Let e t denote that set of points x, in (a, b), for which c t < D+<f> (x) £ c tfl ; 
and arrange the sets e t in the order e 0i e Xi e~ lf e 2i e_ 2 , ... e n , e_ n , _ 

Let k 0 , k x , k_ x , & 2 , k_ 2 , ... jfc n , k_ n , ... be a sequence of positive numbers, 
all less than 1, so chosen that the limiting sum of 

K + h\ C 1 | + &-1 | r -l | +• ••• + k n I C n I + k-n | C -n | + ••• 

is less than e. Let the set e 0 be enclosed within non-overlapping intervals 
of a set \\ and the complementary set C ( e 0 ) within intervals of a set A 0 ', 
so that the measure of that set of intervals which is common to A () , A 0 ' 
does not exceed k 0 . Enclose e, in a set of intervals A 1? and C (e 0 -f- e x ) in 
a set of intervals A/; where A^A/ are both within A 0 ', and have in 
common a set of intervals of measure not exceeding k x . Proceeding in 
this manner, we enclose e p , where p is positive or negative, in a set of 
intervals A„, and (e 0 + e, + e_, + ... + e p ) 

in a set of intervals A/, so that A p , A/ are both interior to A,/, where q 
immediately precedes p, and such that A p , A,/ have in common a set of 
intervals of measure not exceeding k v . 

We have m (A p ) — m (e v ) < k V3 and A p has in common with all the other 
sets A a set of intervals whose measure is ■; k 0 f k x + k_ x + ... 4- k v , 
say K p . 

Since 2 | c v | m (A p ) — 2 | c p | m ( e v ) does not exceed 2 k p | c v |, or is 
p p 

< e, we see that 2 | c P | m (A P ) and 2 | c p | m (e P ) are either both divergent, 

V V 

or both convergent; and in the latter case the difference of their limiting 
sums is less than e. 

Since | c p | — e < | D+cf> (x) | < | c p | 4- £, in the set e p , we have 
2 | c P | m (e p ) — € (b- a) < f | /)+ <f> (x) | dx < 2 | c p | m (e p ) -f e (b - a). 

p J a p 

rb 

Hence, if 2 | c p | rn (e P ) is convergent, I | D+ <j> (x) | dx is finite; and 

p J a 

conversely. It has now been shewn that the necessary and sufficient 
condition that /■& 

| D+ <f> (x) | dx 
J a 

exists, as a finite number, is that 2 | c v | m (A P ) should be convergent. 

v 

Any point x, in (a, 6), belongs to one of the sets e p ; let be that 
interval, of the set A p , which contains x. Let (x, x -f h) be the longest 
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interval, on the right of x , contained in b p , of length not greater than e, 
and satisfying the condition c p — c £ I (x, x + h) £ c P+1 + c. It follows that 
h {| c„ | - 2e} < | <f> (x + h) - <f> (x) | < h {| c p | + 2c}. 

A unique Lebesgue chain, reaching from a to 6 (see § 78), may be 
defined by means of these intervals (x, x + h). For all left-hand end¬ 
points x a , of the chain, that belong to one and the same set e p) let B v 
denote the intervals of the corresponding part of the chain. Then 

S | <f> (x a + h) - <f> (x a ) | 

a 


is between the two numbers \c P \m (B P ) ± 2ew (B P ) ; therefore, for the 
whole chain, 2 | <f> (x + h) - <f> (x) | 

is between the two numbers 2 | c p | m (B v ) ± 2c (b — a). 

V 

Now 2 | c p | m (B P ) ^ 2 | c P | m (AJ, and therefore 2 | c p | m (B P ) con- 

p 

rb 

verges if I | D+ <f> (#) | dx is finite. 

J a 

It has now been shewn that, if D+<f> ( x) is summable in (a, b), 

2 | <f> (x + h) - <f> (x) | , 

the total variation of <f> (x) over the Lebesgue chain, is less than 

2 | c P | m (B P ) -f 2c (b - a ), 
v 

or than 2 | c P | m (e P ) -f c -f 2c (6 — a), 

v 

and this is less than 


We have also 


| D + (f> (x) | dx -f c + 3c (b — a). 


1 <f> (b) - <f> (a) | s E | <j> (x + h) - <f> (x) |, 

and therefore 

| <f> (6) — <j> (a) | < f | D+ <f> (#) | dx -f c -f- 3c (b — a). 

J a 

Since | D+<f> (x) | is integrable in any interval (a, ft) contained in (a, 6), 
the last inequality may be applied to the interval (a, f$). If (a, b) be 
divided into m parts ( a r> £ r ), where r = 1, 2, 3, ... m \ we have 

r - to rb 

2 | <f> (j3 r ) — <f> ( a r ) | < | | <f> (x) | dx + me + 3me (b — a ); 

r »1 J a 

and since c is arbitrary, we have 

r “To fb 

2 I <f> iPr) - <f> («r) I s | -D + <M*) I dx. 

r -X ./ a 

From this we see that <f> (x) must be of bounded variation in (a, 6), 
rb 

in case / Z>+ <f> (x) dx exists. 
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Next, let it be assumed that <f> (x) has bounded variation in (a, 6), then 
2 | <f) (x -f h) — cf> (x) | taken for the intervals of the chain is convergent 

0 O 

(see § 243), and therefore 2 \c P \m (B P ) converges to a finite number. 

p ~ i 

Take a fixed value P, of p. All points of A P that are not in an interval 
of B P necessarily belong to intervals of A q , where q 4= p, and their measure 
does not exceed K P ; hence 

P i P )c,\m (A„) - P i P | c P | m (B v ) \ K v \e,\. 

P“1 P “ 1 P -1 

P»P 

The numbers Jc can be subjected to the condition X K P | c P | < e, for 

p *-1 

p-p p -p 

every value of P; we then have X \c v \m (A*,) — e < X \c P \m ( B p ). 

p -1 v “1 

It follows that 


i, | c„ | m (e„) - e < r £ | c p \ m (B v ) < £ | c„ | m (B P ) -|- t, 

P *“1 P -1 P-1 

0 O p - P 

and therefore, if X | c,, | m (B p ) is finite, so also is X | c P | m (e P ), and 
p - j p ~ l 

this last sum is bounded for all values of P; it therefore follows that 

b 

| Z> + <£ (x) | dx exists. It has thus been shewn that it is a sufficient con- 

a 

rb 

dition for the existence of Z> + (f> (x) dx, that </> (x) should be of bounded 

J a 

variation in (a, b). 

In any interval of the chain, we have 

h (c P — e) £ <f> {x + h) — (f> (x) £ h (c p+1 -f e); 
therefore, for that part of the chain for which p £ P, 

X c p m (. B v ) — € (b — a) s X {cf> (x -f- h) — <f> (a:)} ^ X c P m (B P ) -f e (6 — a). 

p" i p - i 

p 

It now follows that X {<f> (x -f h) — <f> (#)} lies between the numbers 

p-p 

X c p m (e p ) ±(6-a+l)e. 

p » i 

P ~ P CO 

Now P may be so chosen that X c P m (e p ) differs from X c p m (e P ) by 

p-i p-i 

less than e; then X {</> (x -f h) — <j> (a:)}, for the whole chain, is between 
00 

the two numbers X c p m ( e p ) ± (b — a + 2) c. Therefore <f> (b) — <f> (a) is 

p-i 

between 


f D + <f> (x) dx ± (b — a + 3) c; 
J a 
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and since e is arbitrary, we have 

I <f> (x) dx — <f> (b) — <f> (a) ; 

• a 

and in general f D+ (x) dx — cf> (x) — <f> (a), 

a 

if x is in (a, b). 

The theorem may be proved for the case in which any one of the other 
derivatives is employed, in a precisely similar manner. We have then the 
following theorem : 

If <f> (x) be of bounded variation, and have its four derivatives finite at 
each point, we have 

<f> (b) — (f> (a) — f D v <f>[x)dx—{ D^<l>(x)dx—[ D <f>(x)dx~A I)_<f>(x)dx. 

J a J a J a -a 

This holds for every interval (a, ft) contained in ( a, b ); in every such 
rfi 

interval I {D^ (f> (x) — D + (f> (x)} dx — 0; hence (see § 394) we have 

D+ 4> (.X) = D + <f> (x), 

almost everywhere. Similarly, by taking the other derivatives in pairs, 
we see that, almost everywhere, all four derivatives are equal, and thus 
a finite differential coefficient exists. This is a particular case of the 
theorem, established in § 298, that any function of bounded variation has, 
almost everywhere, a finite differential coefficient. 

We have, further, the following theorem: 

If $ (x) be of bounded variation, and one of its derivatives D(f> (x) is finite 
at every point of the interval (a, b), then 

[ <f>’ (x) dx — <f> (b) — <j> (a ); 

J a 

the integral being taken over the set of points at which <f> f (x) exists, and is finite. 
In particular, the theorem holds if <f>' (x) exists at every point, and is finite. 

412 . The theorem of § 411 may be applied* to obtain a proof of the 
theorem already established, in § 298, that a function of bounded varia¬ 
tion, and in particular, any monotone function, has a finite differential 
coefficient almost everywhere. 

As any function of bounded variation is the difference of two monotone 
non-diminishing functions, it is sufficient to consider the case of a monotone 
non-diminishing function / (x). All the derivatives of f (x) are 2 0, hence 
those of / (#) -f kx = <f> (x) are all £ k, where k is any chosen positive 
number. It will be sufficient to prove the theorem for <f> (x). Let <f> (x) = 
and consider the inverse function x = F (£), regarded as defined on the 

* See W. H. Young, Quarterly Journal of Math. vol. xlii (1911), p. 79. 
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^-segment (<f> (a), <f> (b)), which is also monotone non-diminishing. Whether 
<f> (x) is continuous or not, F (|) is a continuous function of £. All the 
derivatives of F (£) are in the interval (0, k J ); and thus, as these are all 
finite, F (f) has a differential coefficient for almost all values of If K 
be any fixed number > k 1 , we have Ax < KAtj, and thus we have 
EA.r < KZ Af. From this it follows that, to any set of points of measure 
zero on the ^-segment, there corresponds a set of points on the ^-segment, 
of measure zero. Therefore x — F (£) has a differential coefficient F' (£), 
for almost every value of x; and thus (f> (x) has a differential coefficient 
almost everywhere. Each point on the ^-segment at which F' (£) — 0 can 
be enclosed in an interval A£, such that Ax < eAf, where e is arbitrarily 
chosen. It follows that, if E be the measure of the set of points on the 
£ segment at which F' (£) — 0, the measure of the corresponding set on 
the ^-segment is < e {m (E) f r/}, where rj is arbitrarily small; and since 
e is arbitrary, it is zero. Thus the points on the #-axis, for which F' (£) = 0, 
and consequently </>' (#) — 4- oo, form a set of measure zero. Therefore 
<}>' (x) exists, and is finite, almost everywhere in (a, b). 

413. There remains for consideration the case in which a continuous 
function f (x), of bounded variation, is such that one of its derivatives, 
say D [ (f) (x), is not everywhere finite. 

If </> (x) is continuous, and D<f> (x) is finite, except at the points of a 
reducible set G, and if 

D<f> (x) dx 

- a 

exists as an L-integral, then 

f D(j> (x) dx = <f> (x) — cf> (a). 

J a 

Consider the closed enumerable set G'. If (a\ x) be interior to an 
interval contiguous to G', then 

cf> (x) — (f> (a') = f D(f> (x) dx\ 

• a' 

and therefore 

<f> (x) — f Dcf> (x) dx — <f> (a') — f Df> (a:) dx. 

J a 'a 

It follows that the continuous function 

(f> (x) — f D<f> (a;) dx 

J a 

is constant in each interval contiguous to G', and therefore, since G ' is 
enumerable, it is constant in the whole interval (a, b), and equal to <f> (a). 
We have therefore the following theorem: 

If <f> (x) be a continuous function, and if one of its derivatives D<j> (x) be 
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infinite only at the points of a reducible set in (a, 6), and is summable in 
(a, 6), when those points at which it is infinite are ignored , then 

I Dcf> (x) dx = <f> (x) — <f> (a), in (a, b). 

J a 

The condition that D<j> (x) is summable may be replaced by the condition 
that <f> (x) should be of bounded variation. 

If the set G were irreducible, but non-dense in (a, b ), the set O' would 
contain a perfect component H. The reasoning given above would shew 
that 

</> (x) — | D<f> (x) dx 
J a 

is constant in each interval contiguous to H , but it would not follow that 
it would be constant in (a, b). In fact it may be a continuous function 
with an everywhere dense set of lines of invariability (see § 269). In this 
case the relation , x 

Dfi (x) dx = <f> (x) — (f> (a) 

J a 

does not in general hold good; but we have the following theorem: 

If <f> (x) be a continuous function , with one of its derivatives D<f> (x) 
infinite at the points of a non-dense set G f such that G' consists of a perfect 
set II and an enumerable set (which may be absent ), and if D<f> (x) possesses 
an L-integral in (a, b), then 

f Dcf> (x) dx = <j> (x) — <f> (a) -f U (x) — U (a), 

J a 

where U (x) is a continuous function with an everywhere dense set of lines of 
invariability. The condition that Dcf> (x) is summable may be replaced by 
the condition that <f> (x) should be of bounded variation. 

It has already been shewn, in § 405, that, if <f> (x) be continuous, and 
of bounded variation, D<f> (x) is summable, when taken over the set of 
points (necessarily of measure equal to that of the whole interval) at which 
it is finite. 

414. It has been shewn in § 405 that, for a function / (x) } which is con¬ 
tinuous, and of bounded variation, f (x) is summable in the interval for 

which f(x) is defined. The indefinite integral f /' (x) dx is however not 

~ a 

necessarily equal to f (x) — / (a), unless f (x) satisfies the further condition 
that it should be an indefinite integral. In accordance with § 407, this 
condition may be expressed in the form that the total variation of / (x) 
over any set of points of measure zero should be zero. 

In the general case of a continuous function of bounded variation 
which is not subject to this last condition, the relation of the function to 
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the indefinite integral of/' (x) is expressed by the following theorem, due* 
to de la Vall6e Poussin: 

If f (x) be a continuous function, of bounded variation in the interval 
(a, b), then r x 

/(*)-/ («) = /' (x) dx + Vj (x) - F 2 (x), 

J a 

where V x ( x) is the total variation of the monotone continuous function f 1 (x) 
over the set of points in (a, x), at which one of its derivatives Df x (x) is not 
finite , and V 2 (x) is similarly defined , relatively to f 2 (x); and f x ( x),f 2 (x) are 
two continuous monotone non-diminishing functions whose difference is f (x). 

It will be sufficient to assume that / (#) is monotone, either non¬ 
increasing or non-diminishing; the result will be obtained by considering 
the difference of two such functions. 

Let <f> N (x) = | Df (x) |, at all points at which | Df (x) | > N, and 
</> N ( x ) = 0, when | Df (x) | & N; thus <f> N (x) — Df (x) ^ — N. Let the set 
of points E, at which Df (x) is not finite, be enclosed in a set A, of non¬ 
overlapping intervals; the set A can be divided into a finite set A l5 and 
an infinite set A 2 , such that the sum of the variations, E {/ (x r ) — f (x r _ 1 )}, 
taken for the intervals (x r __ x , x r ) of A 2 is numerically < e; these variations 
are taken with their proper sign. 

Let V x (x) be the sum of the variations, each with its proper sign, of 
the function fjr 

f(x)~ {Df (x) - <f>j, (x)} dx, 

J a 

for the intervals of A x that lie in (a, x ); and let V 2 (x) be the sum of the 
variations of f(x), taken positively, for the intervals of A 2 that lie in 
(a, x); we thus have V 2 (x) £ V 2 (b) < e. If x is within an interval of A lf 
or of A 2 , the variation in the part of that interval on the left of x is counted 
in V x (x), or V 2 {x). 

Let us consider the function 

X (x) =f (x) - f {Df (x) - <f> N (x)} dx - V x (x) + V 2 (x). 

J a 

In an interval of tjie set A x , V x (x) and 

f{x) - ( {Df (x) - <f>y (x)} dx 

J a 

both increase by the same amount, or both diminish by the same amount, 
and V 2 (x) does not vary; thus such an interval is one of invariability of 
X (x)- In a finite interval complementary to A^ V x (x) does not vary, and 

( 4> N (x)dx, V t (x) 

J a 

* See the Cours d'Analyse, 2nd ed., vol. i, p. 269. 
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do not diminish. The function 

f(x)~ [ Df (x) dx 

J a 

has a derivative which is equa] to zero, for almost all values of x\ and it 
follows that the points where the derivative of y (#) negative form a 
set of measure zero, in case there are any such points. 

Moreover, Z>y (#) has nowhere the value — oo, for x ( x ) is the sum of 

[ i<t>N ( X ) ~ D f ( X )) dX ’ 

J a 

of which the derivative is ^ — N, and of 

/ (x) - V x (x) -f V 2 (x), 

of which the derivative is finite, except at points in A 2 , and at such points 
cannot be negative, as in any interval ( x , x -f h) contained in an interval 
of A 2 in which / (x) diminishes, / (x) 4- V 2 (x) does not vary, and also V l (x) 
does not vary. Since, in an interval complementary to A x , Dx ( x ) has 
nowhere the value — oo, and is certainly > 0 almost everywhere, it follows 
from the theorem in § 403, that it is nowhere negative, and therefore y (x) 
never diminishes; for its incrementary ratios are all £ 0. It follows that 
X (x) is a monotone non-diminishing function; thus y (x) ^ y (a), or 

f(x)-f (a) s f {Df ( x) - <f> N (a;)} dx - V 1 (a;) + V 2 (x). 

J a 

Since this holds for an arbitrary value of e, let e — 0, then V 2 (x) ~ 0, and 
V 1 (x) converges to the total variation of 

f(x)-( {Df (x) - f> N (x)} dx, 

J a 

over the intervals of A, each variation having its proper sign. Now let 
N ~ oo, then * x 

j <j> N (x) dx - 0, 

. a 

and V x (x) converges to the variation, taken with its proper sign, of 

f (x) — f Df (x) dx in A; 

J a 

this may be denoted by V A (x). Now let m (A) ~ 0, then the variation of 
j Df(x)dx i in A, converges to zero, and V±(x) converges to V (x) } the 

• a 

total variation, with its proper sign, of / (x) over the set E. We then have 

! (pc)-f (a) 2 f Df (x) dx + V (x) ; 

J a 

we can now change f (x) into — f (x), when V (x) changes to — V ( x); we 
then find that rx 

f(x)-f (a) s I Df (x) dx + T ( x). 

J a 
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From the two inequalities, we have 

/(*)-/ («) ^ [ Df (x) dx + V (x) = [ f (x) dx 4 V (x) t 
J a J a 

where / (x) is monotone, either non-diminishing or non-increasing. 

When / (re) is of bounded variation, and the difference of two monotone 
non-diminishing functions, f (a) f P (rr), N (x), we have 

/ (*) - f (a) = f> (*) I T 1 (x) - V, (x). 

. a 

THE TOTAL VARIATION OF AN INDEFINITE INTEGRAL 

415. it has been shewn in § 404 that the indefinite integral F (x) of a 
function f (x), summable in ( a , b), has its total variation 

Va F (x) - I | / (x) | dx. 

J a 

It can however be proved that 

V„F(X)= p|/(*)l dx. 

J a 

Let the set E 1 , of points at which / (x) & 0, be enclosed in a set of 
intervals Aj, of which the measure exceeds that of E x by an arbitrarily 
small amount; a finite set A x of these intervals can then be chosen such 
that the difference of m(\) and m (E x ) is arbitrarily small. Similarly the 
set E 2 , of points at which / (x) < 0, can be enclosed in a finite set A 2 , of 
intervals, so that m (A 2 ) and m (E 2 ) differ by an arbitrarily small number. 
Further, A l3 A 2 , can be so chosen that the measure of the common part 
of Aj, A 2 is arbitrarily small. 

We have/ (x) = / + (x) — f~ (x), where/+ (x) — 0, over E 2 , and f~ (x) — 0, 
over E x . Remove from A 1 , and from A 2 , the intervals of the set D (Ai, A 2 ), 
which they have in common; there remains then a finite set of intervals A, 
consisting of the remaining parts of the intervals of Aj and A 2 . The sum of 
the absolute variations of F (a;) over the intervals A differs from the sums, 
added together, of the absolute variations over Aj, A 2 , by not more than 

2 ( _ _ l/» I *x ', 

a,) 

and this is arbitrarily small. The sum of the absolute variations over the 
intervals of A x and A 2 is 

2 f / (x) dx + S f / (x) dx , 
h*>) J (&*) 
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where A x = {SJ, A 2 = {8 2 }; and this differs by an arbitrarily small number 

from r r 

I /+ (x)dx 4- I /" (x) dx , 

■/(A,) -V.) 

rb 

which differs by an arbitrarily small amount from | / (x) | dx. Therefore 

a set of intervals A can be determined such that the sum of the absolute 
variations of F (x) over them differs by an arbitrarily small amount from 


i/(*) i dx \ 

J a 


consequently this integral is, in accordance with the definition of § 243, 
the total variation of F (x) over (a, b), since that total variation cannot 
exceed the integral. We have further 


and thus 


Vl F (X) 

d 

dx 


= [ i /(*) 

J a 

V T a F (x) 


dx , 


exists, and is equal almost everywhere to | / (x) I. 

The variation of F (x), over any set E , has been defined (see § 252) as 
the lower boundary of the sum of the variations of P (x) and N (x) \ P (a:) 
and — N (x) denoting the positive, and the negative, total variation of 
F (x) in (a, x ), taken for a set of intervals (a n , f3 n ) which enclose the points 
of Ey when all possible such sets of intervals are taken into account. 

We can shew that the variation of F (x) over E is equal to 

I I / (x) I dx. 

J(E) 

For the integral of \f(x) | over (a ni fi n ) is the sum of the variations of 
P (x) and N (x). 

The above theorem holds also* in the case of functions of two, or 
more, Variables, defined in a rectangle, or cell. In the two-dimensional 
case, the total variation of F (x (1) , x (2) ) over a measurable set E is the 
lower boundary of 

E | {F (jS.™ j3 n ( *>) - F «n (2) ) - F («.<» jS n < 2 >) + F («.<», a™)} |, 
where the rectangles enclosing E are denoted by 


* Lebesgue, Annales sc. de Tdealt normale (3), vol. xxvn (1910), p. 383. 
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THE GENERALIZED INDEFINITE INTEGRAL 

416. If / (x) be measurable in a fundamental interval, or cell, in which 
it is defined, and E denote any measurable set of points in that interval, 

or cell, / (x) dx = <f> (E) may be called the generalized indefinite integral 

'(E) 

of / (x). It has been shewn in § 392 to be a function of E which converges 
to zero, as m (E) does so, uniformly for all such sets; and it has been 
shewn to be a completely additive function and also an absolutely con¬ 
tinuous function. 

Let E + BE be a measurable set which contains E, and let E — BE be 
one which is contained in E. A function cf> (E) is said to be continuous, for 
the particular set E , if 0 (E 4- BE), <f> (E — BE) converge to cf> (E), as m (BE) 
converges to zero. This is the case if, for every positive number e, 

| 0 (E ± BE) - 0 (jff)J < e, 

provided m (BE) < ; where 17 , depends upon c, and converges to zero 

as c does so. 

If, for every value of e, a value of rj € can be so chosen as to be the 
same for all measurable sets E in the given domain, the function 0 (E) 
is said to be uniformly continuous in the given domain. 

We obtain now the theorem that: 

If f (x) be a summable f unction, defined in a fundamental cell, or interval, 
f (x) dx is a uniformly continuous function of E. 

■ (E) 

For [ / (x) dx - f / (x) dx = [ / (x) dx 

J(E+SE) - (E) J (SE) 

and [ f (x) dx — f / (x) dx = f f (x) dx ; 

J(E) J(E-SE) J(6E) 

and the integral on the right-hand side is numerically < e, for all sets BE, 
of which the measure is less than a number rj f) dependent only on e. 

rx 

That f (x) dx, f (x (1) , x (2) ) d (x il) , x (2) ) are absolutely con- 

.'o .(a (1 ),oO)) 

tinuous with respect to x, or (x (1) , x (2) ), are particular cases of this theorem. 

417. The following theorem is the analogue of the property of a con¬ 
tinuous function of a single variable, given in § 214, that the function 
takes every value between its upper and lower limits in the interval for 
which it is defined : 

If (f> (e) denote the generalized indefinite integral f (x) dx, for all 

J (e ) 

measurable sets e, contained in a given bounded measurable set E, for which 
the summable function f (x) is defined, e can be so determined that 0 (e) has a 
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prescribed value lying between the upper and lower boundaries of <f> (e) for 
all sets e, in E. 

In the first place we see that, if E is a measurable set, in any number 
of dimensions, a component F , of E , can be so determined that m ( F ) has 
any prescribed value between 0 and m (E). For if x {l) be one of the co¬ 
ordinates which determine the position of a point in space, the section 
E (£), of E , between a plane x n) — £ and a fixed plane x (1) =- a, where 
those points of E for which x (1> < a form a set which has a measure Jess 
than the prescribed number, is such that its measure m {E (£)} is a con¬ 
tinuous function of and thus the theorem follows from the theorem of 
§214. 

We have now f (x) dx — \ f (x) dx -\- f (x) dx , where e 1 is a 
he) j(e j) J(e t ) 

component of E x , that part of E in which / (x) is positive, and where e 2 
is a component of E 2 , that part of E in which/ (x) is negative. The upper 

and lower limits of / (x) dx, in E , are clearly / (x) ax, and / (x) dx, 

he) hvi) h&») 

which may be denoted by rn ((f) and — m (G 2 ), respectively, where G x , G 2 
are sets whose dimensions exceed that of E by unity. Any number C 
between these two can be expressed by A — B, where 
A < m (G x ), B > m ((f). 


The set e x can be so determined that f (x) dx = A, and & 2 can be so 

heo 

determined that f 

f (x) dx — — B; 

he t ) 

then if e = e x -f e 2 , we have / (x) dx = C. It should be observed that 

he) 


G x and G 2 are unbounded in case / (x) is unbounded in E, but they have 
finite measures. 


THE INDEFINITE INTEGRAL OF A FUNCTION OF TWO VARIABLES 

418. It should be observed that the indefinite integral 

r(xM,a<V) 

F (x< X >, X< 2 >) = / (x* 1 *, X< 2 >) d (x<*>, X< 2) ) 

J (aU>, a<*>) 

can be expressed as the difference of two monotone non-diminishing 
functions, which are also quasi-monotone (see § 255). 

For, let f x (x (1) , x (2) ) = / (x (1) , x {2) ), at the points at which the function 
/ is & 0 , and let f 2 (x (1) , x (2) ) = — / (x (1) , x (2) ), at the points at which / is 
negative; thus/(x (1) , x (2) ) = f x ( x (1) , x (2) ) —/ 2 (x (1) , x (2) ), and 
F (x<!>, x< 2 >) - Fj (*<« x< 2 >) - F 2 (iX <*>, x< 2 >), 
where F x and F % are the indefinite integrals of f x and f 2 respectively. 
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The two functions F 1 (x {1) 9 x (2) ), F 2 (.r (1) , x i2) ) are both monotone and 
non-diminishing, in accordance with the definition of § 253. Again, with 
the notation of § 254, 




(x"',x {2) ) 


r(x<') t jf(«) 




j\ (x"\ X (2) ) (I (x< l \ X {2 ') 


with a similar result foh F 2 . It follows that the functions F x , F 2 are quasi¬ 
monotone, in accordance with the definition of § 255. 


419. If the function / (x {l \ ;r (2) ) be summable in the cell 
(a (1 >, a (2 >; 6 < 2 >) 

it has been shewn* by Fubini and Toneili that the indefinite integral 


denoted by F (x 
the value / (x (1 \ . 


jc*-) ) 


/ (x {1) , x {2) ) d (x {1 >, a;< 2 >), 


J(a<'K a< 2 >) 

(1) , x (2) ), has the property that 


d 2 F 


exists, and has 


dx (1) dx (2) 

r (2) ), at almost all points of the cell; and thus that 

02 02 J? 

_ _ _ __ — n r <i) 

dx {1) ax {2) r)x {2) dx {1) J ' ’ ' 


almost everywhere in the cell. This result has been however deduced! by 
W. H. Young from the more general theorem which he established, that 
any function F'(x (l) , x {2) ) that is of bounded variation in the cell, in 

02 f 

accordance with the definition in $ 254, has the property that 12 V <tj 

exists almost everywhere in the cell. This property we proceed to establish. 

It is sufficient to consider the case of a quasi-monotone function 
F (x (1) , x (2) ). 

dF 

Since F is monotone with respect to # (1) , exists, for each fixed 

value of x {2) , for all values of x a) , with the exception of those belonging 
to a set $ (x < 2 )) of linear measure zero. Let # <2) have all rational values in 
the interval (a {2) , 6 (2) ), then all the linear sets $ (r <>)) taken together form a 
set S, of linear measure zero. For every value of x (1) that does not belong 
dF 

to S , exists for all rational values of x {2) . 

dx a) 


Consider any value of x (1) that does not belong to S, then 
F{x<" + h,x {2) ) — F (x (l \ x {2) ) 
h 

is monotone non-diminishing with respect to x {2) , and the two functions 
DF (x (1) , x (2) ), £)F (x {1 \ x {2) ) which denote the greatest and least extreme 


* Rend, del Circ. Mat. di Palermo, vol. XL (1910), p. 295. 
t Comptes Rendus, Paris, vol. clxiv (1917), p. 622. 
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derivatives of F (x (1) , x i2) ), with respect to x U) , are also monotone non¬ 
diminishing with respect to x {2) . Consequently, for the fixed value of x {1) , 
they are both continuous functions of x {2) , except for an enumerable set 
of values of x {2) . For a value of x l2) for which they are both continuous, 
each of them is the limit of a sequence obtained by giving x (2) rational 
values only. But when x {2) is rational, DF ( x (1) , x (2 *) and RF (x {1) , x (2) ) are 
dF 

equal, since ^ exists at such points. Hence, for any value of x (2) , not 

belonging to the enumerable set, the two functions DF, RF are equal, as 
they are the limits of one and the same sequence of numbers; and therefore 
dF 

^ (1 - } exists. It thus appears that, for each value of x (1) , not belonging to 

dF 

a set S of linear measure zero, exists, for all values of x (2) , except 
those of an enumerable set. 


When £ (1) has a fixed value not belonging to S, the two functions 
DF (x"\ x (2) ), RF {x"\ x {2) ), 

being monotone functions of x {2) , have differential coefficients with respect 
to x (2) , for almost all values of x (2) ; thus 

DF (x (1) , x (2) 4 - k) - DF (x {1) , x {2) ) DF (x"\ x< 2) 4 k) - RF (x (1) , x (2) ) 
k ’ k 


both have definite limits, as k ~ 0 , if x {2) does not belong to a certain set 
of measure zero. A sequence of values of k, converging to zero, may be 
so chosen that none of the values of x (2) + k belong to the exceptional set, 


which consists of the enumerable set for which 


dF 
dx < x > 


does not exist, and 


of the set of measure zero at which the limits are not definite. For this 
sequence of values of k , D (# (1) , x [2) 4- k) — D (x {l) , x i2) 4- k); and also 
T>F (x { 1) , x l2) ) = RF ( x {1) , x {2) ), if x {2) does not belong to the exceptional 
set; hence the two sequences are identical, and their common limit is also 


lim j 

k~ 0 # 


dF (x ll \ x< 2) 4- k) dF x {2) ) 


dx™ 


dx (l > 


* > 


and therefore 


d 2 F 


dx (2) dx™ 


exists. 


When all values of x™ are considered, the exceptional sets all belong to 
a set of plane measure zero. The following theorem has now been estab¬ 
lished : 


If F {x {1) , x {2) ) be a quasi-monotone function, defined in a given cell , 
d 2 F (x™, x™) d 2 F (x™, x™) 
dx^dx^~~ y ' dx™dx™ 
both exist , almost everywhere in the cell. 

If F ( x a) , x {2) ) be an indefinite integral, it may be expressed (see § 386) 
as the difference of two functions F x (# (1) , x {2) ), F t (x {1) t x™) which are the 
indefinite integrals of two non-negative functions f x (x a) , x m ),f % (x {1 \ x (2> ). 
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whose difference is the function / (# (1> , # (2) ), of which F (x {l) , x m ) is the 
indefinite integral. 

Consider the quasi-monotone function 

rzO) rx<*) 

F 1 (x (1 \ x n) ) ~ (x' 1 ', x (2 >) dx'u.dx'*', (see § 255). 

J 0 (‘) 

7)F 

For a fixed value of x (2) , exists, and is equal to j (x (1) , x {2) ) dx (2) , 

for almost all values of x {1) ; and when this exists, its differential coefficient 
with respect to x {2) exists, and is equal to f x (x {l K x {2) ), for almost all values 
of x i2) . It thus follows from the proof of the foregoing theorem, that 
d 2 F, 

dx^dxf 1 * ex i s f s an d is equal to/ x (# (1) , x {2) ) almost everywhere in the cell. 

As the corresponding result holds for the function F 2 , we have the theorem 
that : 

If F (x (l \ x {2) ) - / (a; (1) , x< 2) ) d {x {1 \ x {2) ), where f (x"\ x {2) ) is 

J a (a >) 

summable in the cell (a (1) , a (2) ; 6 (1) , 6 (2) ), then , almost everywhere in the cell , 

d 2 F d 2 F 

... = . — f ( x <d x {2) ) 

dx™dx™ dx™dx™ J{ ’ '’ 

This is the analogue of the theorem for an integral of a function of a single 
variable, given in § 405. 

419 1 . The function F (a; (1) , x l2) ) being defined in the cell 
(a™, a (2) ; 5 (1) , 6< 2) ), 

let us consider at any interior point (x {1) , x (2) ), the ratio 
1 A (;cO>+fc, *<*>+*) ^ . ... i2) 

pA ( ^ UJ)) F (x {l) ,x i2) ) : 

where h is positive and such that (x a) + A, x i2> -f h) is within the cell 
(a (1) , a (2) ; 6 (1) , 6 (2) ). We may define the two numbers D++ F(x ), D++ F (x), 
by the relations 

D++ F (*) = iim -i F * (2, )> 

h~ 0 n 

Z) ++ F (x) = Urn p A$CX" +h) F (* (,, > * 12 ’)- 


/»M> 


In a similar manner we can define six numbers 
D+~F{x ), D + _ F (x), D~+F(z), D_+F(x), D ~ F (x), D „_F(x) ; 

for example, D + ~ F (a;) = lim —i, r A(^!> + j<«f ** F (x), 

h~Q — n 


D. + F(x) = hm ~ * +w F (x), 

fc-0 n 


D— F (x) = lim ~ A) F (a:). 
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The eight numbers, so defined, may be termed the eight special derivatives 
of F (z (1) , x l2) ) at the point (x (1) , a; (2) ). When, at a point, the eight special 
derivatives have one and the same value, finite or infinite, their value is 
said to define that of the special differential coefficient at the point. 

In the case of a ^-dimensional function, 2 P41 special derivatives can 
be defined in an analogous manner. 

If F (x (1 \ x {2) ) be continuous in the cell , the eight special derivatives 
(finite or infinite) are measurable functions of ( x {1) , x {2) ). 

For example, consider Z> ++ F (x), we may then denote the incrementary 


ratio 


1 A (jr(»+*, *<«+*> ^ . (1) 

y t 2 A U (1 U< 1 )) * \ x 


V (1 \ -C (2) ) 

by 1 (x, x -f h), where x symbolizes (x (1) , x {2) ). The proof of the corre¬ 
sponding theorem for functions of a single variable, given in § 401, is then 
applicable, without essential change, to the present case. 

It follows, as in § 401, that the set of points in which a special deri¬ 
vative I)F (:r a) , x {2) ) of a continuous function is finite is measurable. 

419 2 . It will now be proved that: 

If the function F (x (1) , x (2) ) is continuous and of bounded variation in 
the cell (a (1) , a l2) ; 6 (1) , 6 (2) ), a special derivative DF (z (1) , x (2) ) is finite except 
at the points of a set of plane measure zero. 

Consider the special derivative D ++ F (a? (1) , x (2) ); let K denote the 
total variation of F (x (1) , x (2) ) in the cell (a, b). Let it be assumed that, 
if possible, the measure of the set of points L at which D ++ F ( x a) , x {2) ) 
is infinite has a value A (> 0). Let the set of cells 


(« ( 


>; « (1 > + h, a {2) + h) 


all interior to (a, 6), in which h has the values in a sequence converging to 
zero, and o jr 

| A (a( i),„(.)) F (x {1) , x {2) ) I > -y h 2 , 

be denoted by C. It is clear that L belongs to the set G, the inner limiting 
set defined in § 136 1 , in connection with the set C. In accordance with 
Vitali’s theorem there exists an enumerable set of cells, all belonging to ( 7 , 
and no two of which have a point in common, such that the sum of their 
measures is £ A. Of these cells there are a finite number such that the 
sum of their measures is > £A. Let these be the cells 






( 2 ) 


; a 2 (1) + h 2 , a 2 {2) -f h 2 ),... 


We have 


(« n (3) , a n (2) ; a n {1) 4- h n ,a n < 2 > + h n ). 


S I 

r -I 


A^,r +W F (* a \ x™) 


2K „ 
> JA.-y- > K; 


and this is not possible, because K is the total variation of F (x w , ar <2) ) 
in (a, b). It follows that the measure of L must be zero. 
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A special derivative of a continuous function of bounded variation in the 
cell (a, b) is summable over the cell , the points at which it is infinite being 
neglected. 

If D (x {1) , x {2) ) be one of the special derivatives, let e n be the measurable 
set of points such that nh s | D (x (1) , x (2) ) | < (n -f 1 ) h, where h is a 
positive number and n is a positive or negative integer, or zero. Let 
n ly n 2 , ... be the values of the n y for which rn (e n ) > 0 ; and consider first 
the sets e ni ,e n2 , ... e np . Let € be a positive number less than all the 

i “P 

numbers m ( e n ), m (e n ), ... m (e n ); let P 2 | m t |, and let 77 = e/P. 

v t-i 

By a corollary to Vitali’s theorem (§ 136 1 ) a finite set of squares, C lt no 
two of which have a point in common, exists that enclose a part of e ni , 
of measure greater than m ( e ni ) — 77, such that the measure of C x is 
< rn (e ni ). The points of e ^ that am exterior to the squares of C l form a 
set of measure > m (e n2 ) — 77 ; and therefore a finite set of squares (7 2 , no 
two of which have a point in common, exists, all exterior to C ly which 
encloses a part of e na , of measure > m (e n2 ) — 277 , and such that 

rn (C 2 ) < m (e nt ) - 77 . 

We proceed in this manner to obtain finite sets of squares (7 3 , (7 4 , ... C v . 


Let 0/ denote the set of squares each of which is interior to one of the 
squares of C t , and such that, if (a (1) , a (2) ), (£ (1) , are its corners, 


a( fl d) | q<a)) 


F 


(a* (1 \ x< 2 >) 


(/3 (1) - a (1> ) 2 


— njx 


< h. 


The measure of the inner limiting set, corresponding to Of is greater than 
m (e n J — ltj, and less than m (e„ t ) — (1 — 1 ) 77 . Therefore an enumerable 
set C'\ of squares exists, no two of which have a point in common, all 
of which belong to <?/, and such that their total measure is > m (e n f — vr), 
and < rn (e nt ) — (t — 1 ) 77 . 


A finite set of squares belonging to C" exists, whose total measure* 
has a sum s t , between m (e n ) — 177 and m (e n ) — (1 — 1 ) 77 . Let these 
squares be denoted by a[y } ; ftf), where y = 1 , 2 , 3, ... A t ; and 
we consider these sets of squares, for all the values 1, 2, 3, ... p, of 1 . 


No two of the squares of the complete finite set have a point in common. 


If 


where I 6 iy I < 1 


v ly * ly > 


we have 


a:, 1 ')* 


nji -h 6 iy h y 


S A'%’ 7 2 ; F (*<» z< 2 >) = nM + OM, 

y “i ^ a tv * °iv 1 
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where | ft | < 1. It follows that 


i -p i-p 

h 2 n t s , -f - h 2, 8 t s t . 

L~1 l»l 


Since 


we have 


* ~p 

2 6 U 

i-i 


<m(A), the measure of the cell (a (1) ,a (2) ; 6 (1) ,ft (2) ),and also 
*• -p , 


L-P C-P 

2 7i t s t — 2 n t rn (e n ) 

-1 t-i 


< 7} 2 | m, | = P 77 , 


1 -Py-A t /d( 1 ) o(2K »-P 

2 2 A 7/ 7o/ JP (x (1 >, x< 2 >) - h 2 w.m (e„ t ) + ShPrj + Qhm (A), 


= l r l (a; 


L (l) „( 2 ) 


where | 8 | < 1, |0|< 1. 

Since the numerical value of the expression on the left-hand side is not 
greater than K , and rjP = £, where c is arbitrary, it follows that 

■ h 2 n t m (e n ) & K + hm (A). 

1 

Since this holds good for every integer p, it follows (see § 388) that 
DF ( x ) is summable over A. 

419 s . It will next be shewn that: 

If F (x (1) , x l2) ) is absolutely continuous in A, then the integral 
DF (*<”, x^) d (*«>, xW) = A <%>’ a(*)) F (*), 

.'(a<‘), o<*>) 

where DF (x {1) , x i2) ) is any one of the special derivatives of F (x {1) , x {2) ). 

This theorem and the preceding one were given* by Vitali for the case 
of a function of one variable. The proofs are applicable, without essential 
change, to the case of functions defined in a cell in any number of 
dimensions. 

If £ be an arbitrarily chosen positive number, since F (x (1} , x {2) ) is 
absolutely continuous, the sum of the moduli of the increments 

A F (x z< 2 >), 

in every set of rectangles, the total measure of which is less than some 
number rj , dependent on £, is < £. Using the notation employed in the 
proof of the last theorem, p can be chosen so large and e so small that 

i -p i —p i -p 

2 s t - 2 m (e ) < and m (A) - 2 m {e ) < \y. 

i-i i-i i-i 

The points of A that are not interior to some square of the set 

(«">,«!*>; p ( y, 

where y = 1, 2, 3, ... A ( , i = 1, 2, 3, ... p, form a finite set of rectangles 

* Aiti d. Torino , vol. xmi (1908), p. 238. The extension to the case of two or more variables 
is indicated, p. 244. 
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where r = 1, 2, 3 , q, such that 

>> I 

< £• 


Since 


r ~« jfr 1 ’’ fr 2> ) 
r 1 (at'KaW) 




- p *"*• t (ftfK 


r-Q (fid) fl( 2 >) 

A (&<*), 6 <*>) , w , 2U 
~ A(a< l >, a( 2 >) ^ (•£ > )> 


we have 


AgS!:S3^(* (i) , * (2) ) 


7^ 2 n t m (e n ) 

i-i 


< £ -f eh -f hm (A). 


The number 7/ can be chosen so small, and the number p so large that 

r 6 < 3 >) I 

DF (x^,x<v)d(x {1) ,xM) < C; 


( c b.) - r 

J («<*>, a< 2 >) 

also h may be taken so small that h (e -f m (A)) 


t-p 

h 2 n,m i 

t-i 


A|SaF«»u»: 


-I 


< 6 <»\ &<»>) 


£. We have now 


< 3£: 


AgSl;2S|F (*'»*'*>). 


DF (x« l >, *12)) d (*<!>, *( 2 )) 

J (at*), a<*)) 

Since £ is arbitrarily small, we have 
r( 6 <*>, b<*>) 

(x™, x (2) ) d (x' 1 ', x < 2 >) 

As 6 (1) , 6 (2) may be replaced by x (1) , x (2) , corresponding to any point in A, 
the general theorem has been established. 

Since the integral, over any cell contained in A, of the difference of 
any pair of the special derivatives of an absolutely continuous function 
is zero, it follows (see § 394) that the two special derivatives are equal 
and finite almost everywhere in A. It then follows that: 

A function F (x (1) , x {2) ), absolutely continuous in the cell A, has a finite 
special differential coefficient almost everywhere in A. 

It has been shewn in § 416 that the indefinite integral 

*<*>) 

f{x"\x™)d(x (l \xW), 

J («<*>, a< 2 >) 

of a function f (x {1) , x {2) ) that is summable in a cell (a, b ), is absolutely 
continuous in the cell. The converse theorem that an absolutely con¬ 
tinuous function is the indefinite integral of one of its special derivatives 
has been proved above. The two results combined give the theorem that: 

The necessary and sufficient condition that a function of two variables , 
defined in a cell A, should be the indefinite integral of some function , sum¬ 
mable in the cell, is that it should be absolutely continuous in the cell . 

The proof given above is applicable, without essential change, to the 
case of a function of any number of variables. Another very interesting 
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proof of the theorem, dependent upon the theory of functions of sets of 
points, has been given* by de la Vallee Poussin. 


INTEGRATION BY PARTS FOR THE //-INTEGRAL 

420. Let the two //-integrals j U (#) dx, f V (x) dx be denoted by 

u (x), v (x) respectively. The functions u (x), v (x) are absolutely con¬ 
tinuous, and of bounded variation in the linear interval (a, b) ; moreover, 
in accordance with the last theorem in § 400, the product uv is an inde¬ 
finite integral in (a, b). In accordance with the theorem established in 
§ 405, u (x) and v (x) have differential coefficients equal to V (#), V (x) 
respectively, almost everywhere in (a, b). Let E denote the set of points 

at which this is the case; then, since % U , are summable in E, u ^, v^T- 

dx dx dx dx 

are also summable in E , since u and v are bounded. We now have 


f d ( yv ) i [ dn , f dv . 

, dx — v y - dx a . dx; 

1(E) dx ] (/,;) dx ' dx 


and since m (E) 
this equation 


a, and uv is an indefinite integral, we may write 


uv 


b [ b du . 

~ v , dx + 

a la dx 


[ b dv , 


and this is equivalent to 
rb 


j U (x) jj V (x) dot^ dx — J (J (x) dx I V (x) dx ~ J V (x) jj U (x) dx 
which is the general form for integration by parts, for //-integrals. 


dx 


MEAN VALUE THEOREMS 

421. It is frequently of importance to be able to assign upper and 
lower limits between which the value of a definite integral lies, in cases 
where the exact determination of the value is not required. Such estimates 
of the value of a definite integral may frequently be made by the em¬ 
ployment of theorems known as mean value theorems; the most im¬ 
portant of these will be here investigated. 

If / (x) be a bounded function, summable in the measurable set of 
points E, we have, denoting by U and L the upper, and the lower, boundary 
of / (x) in E, r 

Lrn ( E) < f (a;) dx Um (E ); 

•' (E) 

and therefore f (x) dx — Mm (E), where M is some number which 
1(E)' 

satisfies the condition U ^ M £ L. 

This holds good whatever be the number of dimensions of the set E. 


InUgralts de Lebesgue (1916), p. 73. 
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In case / (#) is a continuous function in the linear interval (a, h), there 
must be some point in (a, b) at which / (x) = M. If that point be denoted 
b y a -f 6 (b — a), we obtain the following theorem: 

If/ (x) be continuous in the linear interval (a, b), then 

r b 

f (x) dx — (b — a)/{a -f 6 (b — a)}, 

J a 

where 6 is some number such that 0 s. 6 ^ 1 . 


Next, let / (#) and </> (x) be summable in the measurable set E, and 
suppose / (x) is bounded in E, and </> (x) everywhere ^ 0 in E. We find 
immediately from the definition of the integral of/ (x) </> (.r) in JEr, that 

L [ <f> (x) dx •' (* f (x) <f> (#) dx ■ U f 6 (x) dx, 

J (E) J (E) J(E) ' 

where (J and L are the upper, and the lower, boundary of / (x) in E. 


It follows at once that 


[ / (x) <f> (x) dx = M x [ <f> (x) dx, 

J (E) J(E) 

where M l is a number such that TJ i- M y ^ L. 

In case/ (x) be a function that is continuous in the linear interval (a, b), 
we obtain the following theorem: 

If f (x) be continuous in the linear interval (a, b), and </> (x) be a sum¬ 
mable function that is 0, 

[ f (x) (f> (a?) dx - / {a + d l (b - a)} [ (f> (x) dx, 

•J a J a 

id here 6 X is some number such that 0 ^ 0 X 1 . 

This theorem, including also the more general case in which the inte¬ 
gration is over any measurable set E in which / (x) is continuous, is known 
as the First Mean Value Theorem of the Integral Calculus. 

An extension to the case in which <f> (x) is not necessarily everywhere 
of the same sign, but has a finite lower boundary, is obtained by applying 
the theorem to <f> (x) 4- C, where C is such that <f> (x) -f C s 0 in (a, b), or 
in the set E. 


For the case of a function of two variables, where / (x {l) , x m ) is con¬ 
tinuous in the cell (a (1) , a (2) ; b {1) , b {2) ), and (f> (x (1) , x (2) ) is summable, and 
^ 0 , in that cell, the theorem may be written in the form 

r(b^\ 6 <*)) 

/ (x™, X {2) ) <f> (x™, X< 2 >) d {x™, X< 2 >) 

J (a* 1 ), a<*>) 

r(fe(*>. 

= /{a<*> + e x (6 (1) - a (1) ), a< 2 > + e 2 (b l2) - <z< 2 >)} 6 (x"\x (2) )d(x"\x™) y 

J(ttC), a<*)) 

where d lf d 2 satisfy the conditions 0 ^ £ 1 , 0 £ 0 Z s; 1 . 
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422. If the bounded f unction f \ x) be monotone and non-increasing in the 
linear interval (a, b), and everywhere a 0, and. if <i> (x) be summable (whether 
bounded or not) in (a, b), then 

[ f (x)4> (x) dx =■= / (a) I <f> (x) dx, 

J a J a 

where £ is some number such that a £ :» b. 

Also, if f (x) be monotone and non-diminishing in (a, b), and everywhere 
S 0, then , 6 r b 

/ / (*) (*) dx —f (b) | f ( x ) dx, 

J a J £ 

where £ is some number such that a ^ £ £ b. 

This theorem was first given* by Bonnet, for the case in which <f> (x) 
has an i?-integral, and was applied by him to the theory of Fourier’s series. 

Another form of the theorem was obtained by Weierstrass, and also 
by Du Bois-Reymondf, for the case in which <f> (x) has an /^-integral, and 
it is generally known as the Second Mean Value Theorem. When general¬ 
ized, so that <f> (x) is only restricted to be summable in (a, b), the theorem 
may be stated as follows: 

If f (x) be monotone and bounded in the linear interval (a, 6), and if 
<f> (x) be summable (bounded or unbounded) in that interval , then 

f f(x)4> (x) dx =f (a) J <f> (x)dx +f ( b) J <f> (x) dx, 

where tj is some point in (a, b). 

The theorem in this form is deducible immediately from Bonnet’s 
theorem, by writing f (x) — / (6), or / (x) — / (a), in the two cases, instead 
of / (x). Bonnet’s theorem is however not immediately deducible J from 
the second theorem. 

rb 

It is clear that, since the value of / (x) <f> (x) dx is unaltered by 

•' a 

changing the values of f (a) and /(6), we may, in the above statement, 
take, instead of f(a),f (b), any two numbers§ A, B which are such that 
the function 0 (x), defined by iff (a) = A, \p (b) = B, ifj (x) = f (z), for 
a < x< b y is monotone. We have thus the generalized form of the theorem: 

f / (z) <f> (x) dx = A I <f> (x) dx + B f <f> (x) dx , 

J a • a J £ 

where A £ / (a -f 0), B £ / (6 — 0), if f (x) is non-diminishing , and 

Azf(a + 0),Bzf(b-0), 

if f (x) is non-increasing. 

* Mim. Acad. Belg. vol. xxm (1850), p. 8; also Liouville's Journal , vol. xiv (1849), p. 249. 
t Crelle's Joumai, vol. lxix (1869), p. 81. 

| This was pointed out by Pringaheim, Mllnch. JBer. vol. xxx (1900), p. 210, where an 
account of various proofs of the theorem is given. 

§ Du Bois-Reymond, SchlomilcKa Ztitschr. vol. xx (1875), Hist. Lit . Abtg. p. 126. 
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The value of £ depends in general upon the chosen values of A and B. 
In this generalized form, the theorem includes Bonnet’s theorem as a 
particular case. For we may take A = 0, B = f (6), if / (x) is positive and 
non-diminishing, or A — / (a), B = 0, in case / (x) is positive and non¬ 
increasing. 

Various proofs* of the theorem of Weierstrass and Du Bois-Reymondf 
have been published. In these proofs the function <f> (x) has usually been 
restricted to change its sign only a finite number of times, and sometimes 
to be differentiable; but a proof fi^ee from the former restriction was given 
by Du Bois-Reymond. A proof has been given by PringsheimJ in which 
<f> (x) is not restricted to be a bounded function, but may be any function 
such as possesses an absolutely convergent integral, or in certain cases it 
may have an integral that is not absolutely convergent. 

423. The following proof of the second moan value theorem, in its 
general form, in which the only restriction imposed upon the function 
<f> (x) is that it is to be summable, whether bounded or unbounded, was 
given§ by Hobson. 

Let (f> (:r) be a function which, whether it be bounded or not, is sum¬ 
mable in the interval (a, b). Let f (x) be monotone and non-increasing in 
the interval, and suppose it to have no negative values. 

Let € t be an arbitrarily chosen positive number, less than 

/(a + 0) —/(* — 0); 

and let the function f r (x) be defined for the interval (a, b) as follows: 

An interval (a, x x ) can be so determined that / (a -f 0) — f (x) < e r , 
for a s x< x lf and so that / (a + 0) — / (x x ) s c r . In case x x is a point of 
continuity of / (x), we shall have f (a + 0) — / ( x x ) = c r , but this will not 
be the case if x x is a point of discontinuity. Next, an interval (x lf x 2 ) can 
be so determined that f (x x + 0) — / (x) < e r , for x x g x< x 2 , and that 
f (x x -f 0) — f (x 2 ) ^ €>. Proceeding in this manner to determine intervals 
(x 2 , x 3 ), ( x 8 , x 4 ), , for some finite value of n, not exceeding 

{/(a + 0) —/(6 — 0)}/c r , 

we must have 

/ (Zn -1 + 0) -/ (x) < e r , for x n £ x < b; 
we then take x n to coincide with 6. 

Let f r (#) =f(a -f 0), for a s x< x x ; let f r (x) =f(x 1 -\- 0), for x x zx< x 2 ; 
and in general let/ r (x) = f (x s + 0), for x 8 ^ x< x g+l . The function f r ( x ), 

* For example, by Hankel, Schldmilch's Zeitschr. vol. xiv; by Meyer, Math. Annalen , vol. vi 
(1873), p. 313; by C. Neumann, Kreis- Kugel - und Cylinderfunctionen , Leipzig, 1881, p. 28; by 
Hdlder, Oottinger Anzeiger, 1894, p. 519; by Kowalewski, Math. Annalen , vol. lx (1905), p. 151. 
f CreUe's Journal , vol. lxxix (1875), p. 42. See also Kronecker’s Vorlesungen , vol. i. 
t Munch. Ber . vol. xxx (1900), p. 218. 

§ Proc. Lond. Math. Soc. (2), vol. vn (1909), p. 14. 
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so defined, has only a finite number of values in the interval (a, b ); it is 
monotone and non-increasing, and is s 0 . Moreover, we have 

0 — fr (X) -/(*)< 

for every value of x except for the values a, x r , x 2 , ... x n _ 1} b, at which it 
is not necessarily the case. 

We have now 

f f r (x) (f> (x) dx = / (a ~f- 0) 1* <f> (x) dx + / (x x 4- 0) f </> (x) dx 4- ... 

J a Jfl JXi 

+ f(*n-l + 0 ) f <f>(x)dx. 

J JTft-l 

Denoting </> (x) dx by F (x), we have 

J a 

f fr (x) <f> (x) dx - {/ (a + 0 ) - f(xj ■! 0 )} 7 1 (a:,) 

+ {f(x 1 + 0)~f(x 1 + 0)}F(x i ) +... 

H“ {/(^n -2 + 6 ) — f (x n 1 + 0 )} (x n x ) -f / (x n x -f 0 ) .F (b). 

Since /(a -f 0 ) -/fo + 0 ), /(*, + 0 ) - / (x 2 4 - 0 ), ...J (x n + 0 ) are 
all ^ 0 , the expression on the right-hand side will be unaltered in value if 

F (a^), F ( x 2 ) y . F (x n _ 1 ) y F (b) be all replaced by a definite number N 

which lies between the greatest and the least of these numbers. The ex¬ 
pression then becomes Nf (a + 0 ). Moreover, since F (x) is continuous in 
the interval (a, b), some value f r , of x, exists such that N = F (£ r ). It has 
therefore been proved that 

f fr (*) <f> (x) dx = / (a + 0 ) ( <f> (x) dx, 

J a J a 

where f r is some point in the interval (a, b). 

Also I [ f r (x) (f> (x) dx - [ / (x) <f> (x) dx I < e r [ \ <f> {x) | dx. 

It follows that 

| J a f (*) <t> (*) dx - / (a + 0 ) J <f> (x) dx (< ij r , 
where rj r = e r f | <f> (x) | dx. 

J a 

Let r — 1 , 2 , 3, ..., where e 1? e 2 , e 3 , ... form a sequence which converges 
to zero; then also rj lt tj 2 , rj 8 , ... form a sequence which converges to zero. 
The set of points £ 3 * ••• has a t least one limiting point; and it is 

clear that the sequence {e r } may be so chosen by neglecting, if necessary, 
a part, that the sequence {£.} has a single limiting point f. 

We have then 

If f(x)</>(x)dx-f(a + 0 ) [*<f>(x)dx < rj r + / (a + 0 ) I f <f> (x) dx 

l J a J a I J 
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If £ be an arbitrarily chosen positive number, a value r x , of r, may be 
so chosen that rj r < |£, and / (a -}- 0) | J <f> (#) dx 
Then we have 


< provided r 


£; 


f / (a:) 6 (a:) dx - f (a -}- 0) [ (j> (x) dx 
• a J a 

and therefore, since £ is arbitrarily small, we must have 

| f (x)<f> (x) dx = / (a + 0) [ f <f> (x) dx . 

J a .’a 


.(i). 


In a precisely similar manner, when / ( x) is non-diminishing in (a, b), 
and is never negative, it may be shewn that 

I f( x )4> (*) dx - / (6 - 0) [ <(> {x) dx .(2), 

J (1 Jy 

where rj is some point in (a, b). 

In case / (a) — / (a f 0), f (b) — f(b — 0), these results are equivalent to 
Bonnet’s form of the second mean value theorem. 


Next, let / (x) be only restricted to bo bounded and monotone in (a, b ), 
but be unrestricted as regards sign. In case / (x) is non-increasing, we may 
apply the theorem (1) to the function / (x) - f (b — 0), and we thus have 

f / (x) cf> (x) dx = / (a -f 0) f <f> (x) dx + / (b — 0) [ </> (x) dx. 

In case / (x) is non-diminishing in (a, b), we may apply the theorem (2) 
to the function f (x) ~ f (a -j- 0), and wo then have 

[ f (x) cf> (x) dx = / (a -f 0) f (f) (x) dx -f / (b — 0) f (a?) da\ 

J a . a ' y 

It has thus been shewn that, if / (x) be bounded and monotone in 
(a, b), and </> (a;) bo summable in the same interval, 

( (x) dx = / (a + 0) ( <f> (x) dx -f / (b - 0) [ <f> {x) dx , 

Ja Ja JX 

where X is some point in the interval (a, b). 

In order to obtain the more general form of this theorem, let A and B 
bo numbers such that A ^ f (a -f 0), B s f (b — 0), when / (x) is non¬ 
increasing; or else let i s'/(a -f- 0), B ^ / (b — 0), when f (x) is non¬ 
diminishing. 

Consider an interval (a — A, b + A) which contains (a, b) in its interior, 
and let / (x) = A , for a — A £ x < a ; and / (a:) = B, for 6 < a; ^ 6 -f- A; the 
function / (x) being already defined for a ^ x ^ b. Let <j> (x) = 0, for 
a — \ s x < a, and for b < x ^ b -f A, where <£ (a;) has already been defined 
for a s x s 6. Now apply the theorem above established, for the interval 
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(a — A, 6 + A), for which / (a — A + 0 ) = A, and / (6 + A — 0 ) — B. We 
then have 

[ f (x) <f> ( x ) dx = A ( <f> (x) dx + Tt [ <f> (x) dx, 

J a J a J X 

where X is some point in (a — A, b 4 - A), and which clearly lies in (a, b). 

The general form of the second mean value theorem may now be 
stated as follows: 

If f (x) be bounded and monotone in the interval (a, b), and if <j> (x) y 
whether bounded or not , be summable in (a, b ), then , if A, B be numbers 
such that a a / (a + 0 ), B f (b - 0 ), 

or Aaf(a + 0), B 2/(6 - 0), 

according as f (a:) is non-increasing , or non-diminishing , in (a, b), 

[ / (x) <f> (x) dx = A f <f> (x) dx 4 - B I <f> (x) dx , 

J a -a J X 

where X is some number in (a, b). The number X will depend upon the values 
of A and B. In particular , we may have A = / (a), B = / (b) } or also 
A = f (a A 0), B — f (b — 0). 

In case the function / (x) is never negative in (a, 6 ), we may take Z? = 0 , 
if / (#) is a non-increasing function; and we may take A = 0, if / (x) is a 
non-diminishing function. We obtain thus the following generalized form 
of Bonnet’s theorem: 

If f (x) be a bounded monotone function, never negative in (a, b), and if 
<f> (x) be any function summable in (a, 6 ), then 


[ f (x) <f> (x) dx = A i <f> (x) dx 

J a J a 


where A is any number ^ f (a + 0), and X is some number , dependent on A, 
in the interval (a, 6), provided f (#) is non-increasing. If f (x) is non-dimin¬ 
ishing, we have 


j f (x) </> (x) dx = B J <f> (x) dx 


where B is any number ^ / (b — 0), and X is some number in (a, 6), dependent 
on the value of B. In particular , we may take A = f (a), B = / (6), in the 
two cases . 

424. If the function / (a) be of bounded variation in the linear interval 

, for all pairs of 


I A ^ 


dx 


(a, b), and M be the upper boundary of 
values of a, /J in the interval (a, 6), we may make use of the fact that 

/ (*) = P (*) - N (x), 

where P (x), N (x) are monotone non-diminishing functions. 
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We have 

[ {P ( x) — P (a)} <f> (x) dx — {P (b) — P (a)} [ cf> (x ) dx 

■' a 1 £ 


with a similar equation for N (x). By subtraction, we have 

f if ( x ) - / («)} 4> ( x ) dx = i P (b) - p (<*)} I 4>(x)dx 
la J £ 

— {N (b) — N (a)} [ <j> (x) dx. 

Hence we have* the following result: 

[ (*) dx -f (a) f <j> (x) dx I < M . V a b f (x), 

la la 1 


V a b f(x) denoting the total variation in (a, b); where f (x) is of bounded 


<f> (x) dx 


, for all 


variation in ( a , b ), and M is the upper boundary of 
intervals (a, fi) contained in (a, b). 

This result is of considerable use in estimating the value of the integral 
of the product of a function of bounded variation and a summable function. 


425. Let cf> (x) denote a function of any number of variables, that is 
summable in a bounded measurable set E\ and let / (x) denote a function 
that is bounded and summable in E , and is everywhere £ 0. If U and L 
denote the upper, and the lower, boundary of f(x), in E , let a 0 , a x , a 2 , ... a n 
be a set of decreasing numbers, where a 0 = A, a number s U, and L £ a n ^ 0. 
Let e r „x denote that set of points for which a r _ x ^ / (x) > a T , for 

r = 1, 2, 3, ... 7i. 


Let f (m) (x) — a r _ ls in the set e r _ lt for each value of r; we have then 
I f (m) (x) <f> (x) dx = a 0 cf> (#) dx-ha 1 </> (x) dx -b ... + a n _ x I <f> (x) dx 

1 (E) 1 (Co) J (Cl) j (en-i) 

~ (a 0 — a x ) (x) dx -f (a x — a 2 ) <f> (x) dx 

He •) 1 (e 0 + c,) 

+ ... + («„_ 2 - a„-i) I <j> ( x ) dx 

- (C* + Ci +... + Cn- # ) 

4- <f> (x) dx , 

- (Co+ei4 • •• + «*—j) 

and therefore f im) (x) <f> (x) dx = AM m , where M m is some number 

J(E) 

between the greatest and the least of the numbers I <f> (x) dx ; E a denoting 

1 ( E«) 

eo + e x + ... + e,_ x , 

for s = 1, 2, ... n. The set E a is that set for which A s/ (7n) (x) > a s . 

The numbers a 0 ,a l .a 2 , ... o n can be so chosen that the greatest of the 


* See Lebeegue, Annates de Toulouse (3), vol. I (1909), p. 37. 
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differences of the successive numbers is < e m , where {€ m } is a sequence of 
decreasing numbers converging to zero. We have now 

f f {m) ( x ) <f> ( x ) dx ~ f f (x) c/> (x) dx < €m ( | <f> (x) | dx, 

J (E) J (E) J (E) 

which converges to zero, as ra ~ oo. The numbers AM m consequently 

converge to I f (x) <f> (x) dx , asm - oo; and thus f (x) (a:) dx **. AM , 

J(E) HE) 

where M is between the greatest and the least of the numbers <f> ( x ) dx, 

He) 

and e belongs to a family of sets such that in any one of them / (a:) exceeds 
a certain number between A and zero. By the theorem of § 417, there 

exists a set E x of the family, for which / (f> (x) dx — M. 

J ast) 

The following theorem* has thus been established: 

If E be a set of points which is measurable and bounded,, and in any 
number of dimensions , (x), f (x) are summable in E, f (x) being bounded 
therein, and > 0, then 

I f (a;) cf> (x) dx — A f cf> (#) dx , 

HE) J (Ei) 

where A is any assigned number > the upper boundary of f (x) in E , and 
E\ is a part of E dependent on the value of A , and belonging to a family of 
sets of increasing measure , such that for any one of them f (x) is greater than 
some fixed number. 

This theorem may be regarded as the complete generalization of 
Bonnet’s theorem, for any number of dimensions, and for integration 
through any measurable and bounded set of points. 

If / (x) be no longer restricted to be > 0, we may apply the above 
theorem to the function / (x) — B, where B is any number s L , the lower 
boundary of / (a;) in E. 

The theorem then takes the form: 

[ / (x) (x) dx = A' I cf) (x) dx + B' [ cf> (x) dx, 

HE) J (Ei) HE Ei) 

where A' and B' are such that A' TJ, B' < L; (J, L being the upper, and 
the lower, boundary of f (x) in E, and E t is a part of E such that f (x) is 
therein greater than some number dependent on A' and B'. 

If we take the set E to consist of the points of a linear interval (a, b), 
and / (a;) to be monotone, non-increasing, and £ 0, the set E x will be the 

set of points of some interval (a, £), and thus M= <f> (x) dx, for some 

J a 

value of $ in (a, b). Similarly, iff (x) be ^ 0, and monotone, non-decreasing, 
we must take for E x some interval (£', b). Thus the first theorem given 
above reduces to Bonnet’s theorem. 


* See Lebesgue, Annales sc. de Vicolc normale (3), vol. xxvn (1910), p. 443. 
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426. It will now be shewn that, in case* the set E is the set of points 
in the plane rectangle (a (1) , a {2) ; 6 (1) , 6 (2) ), and when the function/ (# {1) , x (2) ) 
is quasi-monotone and non-negative, of any of the four types considered 
in § 255, the set E l , in § 425, may be replaced by the set of points in a 
plane rectangle with one comer at one of the comers of (a (1) , a {2) ; b n \ b {2) ). 

Let / ( x a) , x <2) ) be quasi-monotone, non-negative and non-diminishing 
with respect to (x il \ x i2) ), and also with respect to x m and x {2) . 

We have, from the theorem of § 448 1 

r (&<»>, b< 2 >) 

/ (x (1 >, x< 2 >) g (x (1) , x< 2 >) d (x">, x (2) ) 

- (a (1 \ « (2) ) 

,•(#><»), b<V) 

f(xM\x< 2 ')dG(x"\x {2) ), 

J («<*>, a (2) ) 

where G (x (1) , x {2) ) denotes 

ru<'>, jt^) 

x (2) )d (x (1 >, x (2) ); 

a< 2 >) 

and this may be transformed by means of the formula (Bj) of § 448, 
changing the two functions /, g , there employed, into G , /. The three 
integrals in this formula are all less than, or equal to, the values which 
would be obtained by substituting for the integrands the value U of the 
upper boundary of A^li)* G (x n \ x {2) ), and their values are greater than, 
or equal to, those which would be obtained by substituting for the inte¬ 
grands the lower boundary L of the same expression. Hence the sum of 
the three integrals is not greater than U {/ (6 m , b {2) ) — / (a (1 >, a (2) )}, and not 
less than L {/ (6< 1 >, 5< 2) ) - / (o«>, a (2) )}. 

Also the term / (a (1) , a <2) ) A^a)' «(*)) 6r (x a) , x <2) ) 

is g 17/ (a a> , a (2) ), and is £ Lf (a (1) , a (2) ). It now follows that 
;<&<»>, 6< a >) 

/ (x"\ z< 2 >) dG (xW, x< 2 ') 

J (a (1 K a< 2 >) 

is equal to M {/ (6 (1) , 6 (2) ) — / (a (1) , a (2) )} -f Mf (a (1) , a (2) ), 

or to Mf (b (1) , 5 (2) ), where AT is some number in the interval (L, U). 

Since *<'!!> G (x' 1 ', *<*>) 


assumes all values between L and U, it follows that 
M = A^i)) fli!) G (x ll \ x i2) ) 
for some point f (2) ) in the rectangle. 

We thus have the formula 

f(b (1 K b<*>) 

/ (x<», x < 2 >) g (# (1) , x™) d (x™, x < 2 >) 


J <a< l >, o< 4) ) 


/•(fed), &<«)) 

= /(6 (1) , b {2) ) g(x^\x (2) )d 

J (fO>, *<»)) 


{x {l) , x l: 


* See W. H. Young, Proc. Lond. Math . $oc. (2), vol. xvi (1916), p. 273, where the case of 
quasi-monotone functions of any number of variables is treated. 
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where (f (1) , £ (2> ) is some point in the closed rectangle (a (1) , a (2) ; 6 (1) , b (2) ). 
It is clear that in this formula / (6 (1) , b {2) ) may be replaced by any number 
A which is ^ / (6 U) , b {2) ), for this only involves a change in value of the 
function / at the point (b (1) , b (2) ), the condition that the function is quasi¬ 
monotone non-diminishing being conserved. 

In case / (x (1) , x {2) ) is positive, quasi-monotone and (non-diminishing, of 
one of the other types defined in § 255, we obtain in a similar manner, 
by employing the formulae (B 2 ), (B 3 ), (B 4 ) of § 448 
r (&<!>, 6< 2 >) 

/ (^ (1) , x {2) ) g (z (1) , x (2) ) d {x (1 \ x {2) ) 


r(( ( *>. £<*>) 

= f (a (1) , a < 2 >) | g (xM\ x < 2 >) d (x (1 >, z< 2 >) .(2), 


where / (x (1) i x {2) ) is non-increasing with respect to x a) , and with respect 
to x (2) . 

Also, when f (x {1) , x {2) ) is non-diminishing with respect to x ll) , and 
non-increasing with respect to x (2) , we have 
r(¥ l \ &<*>) 

I / (# (1) > x {2) ) g (a; a) , x (2) ) d (x (1) , x (2) ) 

J («<*>, a<*>) 

f<*>) 

= / (b (1) , a {2) ) g (x"\ x< 2 >) d (x"\ x {2) ) .(3). 

J (£<»>, a<*>) 

When f(x li) ,x {2) ) is non-increasing with respect to x (l \ and non¬ 
diminishing with respect to x (2) , we have 
&<*>) 

/ (x< l \ x< 2) ) g (x {1) } x (2) ) d (x (1) , x (2) ) 

J («<*>, «<*>) 

= / (a (1) , b {2) ) g (x' 1 ', x (2) ) d (x' l \ x (2) ) .(4). 

J (a<*>, £<*>) 

In case/ (x (1) , x (2) ) is £ 0 but is non-increasing with respect to (x a) , x l2) ), 
the above formulae are applicable, according to the type of the function 
in regard to monotony with respect to the separate variables x u) , x {2) . 


REPEATED LEBESGUE INTEGRALS 

427. The L-integral of a function of any number of variables has been 
defined as the limit of a sequence of simple sums; but it is convenient to 
represent such an integral as a set of repeated limits of sums with respect 
to the separate variables, and thus to express the integral as a repeated 
integral with respect to the separate variables, taken in any order. As 
an ly-integral with a non-negative integrand is the measure of a set of 
points in space of dimensions one greater than the dimensions of the space 
in which the function is defined, we first shew that the measure of a set 
of points, measurable in p + 1 dimensions, can be exhibited as the single 
.//-integral of the measure of the p-dimensional section of the set by planes 
perpendicular to one of the coordinate axes, or more generally as the 
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^-dimensional L-integral of the measure of a (p — q 4 * 1 )-dimensional 
section of the set. 

Let E denote a measurable and bounded set of points, in the (p 4- ])- 
dimensional space between the two planes x (1) = a (1) , x {1) = 6 (1) , perpen¬ 
dicular to the # ( 1 ) ~axis. Denote by E (x u> ) the ^-dimensional set which is 
the section of E by the plane corresponding to any fixed value of x a) in 
the interval (a (1) , b {1) ). 

The set E is contained in a set of closed cells A, of measure < m (E) f e, 
where € is an arbitrarily small positive number (see § 127). Giving to e 
the values in a sequence of decreasing numbers that converges to zero, 
we have a sequence {A t }, of sets of closed cells; and the sequence may be 
so chosen that each set contains the next. The inner limiting set of {AJ 
is a set E 1 which contains E , and is such that m (E x ) = m (E). 

Let A tl , A t2 , ... A tn , ... be the cells of the set A*; we have then 
m (A t ) — £ f m {A tn (x (1) )} dx {i) , 

n “1 J a 

where A in (a? (1) ) is the section of the cell A in by a plane perpendicular to 
the x ( 1 ) -axis, and the measure m {A tn (x (1) )} is the p-dimensional measure 
of A tn (x a) ). Since an L-integral is completely additive, we have 

m (A t ) - \ b m{ A t (x™)} dx™. 

J a 

In virtue of the theorem of § 398, we now have 

m (E) = m (E x ) = lim m (A t ) = f m {E x (x {l) )} dx n \ 
i~cc J a 

since lim m {A* (a; a) )} = m {E x (x a) )}. 

i^ao 

Again, the set E x — E , which is of measure zero, is contained in each 
set of a sequence of sets of cells {A/}, where A/ can be so determined as 
to be contained in the set A*. This sequence of cells has an inner limiting 
set E 2 which contains E x — E . 

We have now, as before, 

m (A/) = f m {A/ (# (1) )} dx {1) , 

J a 

and therefore, taking the limit of both sides of the equation, we have 
[ m {E 2 (x ( 1) )} dx a) = lim m (A/) = 0 . 

J a i—oo 

It follows that, for almost all values of x a) , m {E 2 (x (1) )} = 0; and since 
E 2 (# (1) ) contains the section of E x — E } m {E (# (1) )} exists, for almost all 
values of x {1) , and is equal to m {E x (# (1) )}, which exists everywhere, as 
E x (x {1) ) is the limit of a sequence of measurably sets. 
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We have now 


m 


(.E) — f m {E x (a; (1) )} dx {1) = I m {E (ar (1) )} dx {1) . 
J a J a 


The following theorem has now been established: 

If E be a bounded measurable set of points in (p + l)-dimensional space , 
its section E (x) is measurable , as a p-dimensional set , for almost all values 
of x. The function m {E (z)}, of x, defined for almost all values of x, is linearly 
measurable and its L-integral with respect to x is equal to m (E). The co¬ 
ordinate x may be any one of the p - hi coordinates which determine the 
points of E , 

Instead of taking x to be one of the coordinates x {1 \ x {2) , ... # (p+1) , it 
may be taken to be typical of a group (# (1) , a? (2) , ... x {Q) ), of these coordinates, 
where q has any value < p - hi. The section E (#) will then denote that 
(p 4 - 1 — g)-dimensional set which is the section of the set E obtained 
by giving x (1) ,x {2 \ ... x iQ) fixed values. This will then entail no essential 
modification in the above proof of the theorem ; and accordingly we 
obtain the following more general result: 


If E be a bounded measurable set of points in (p -f- l)-dimensional space , 
its section E (# (1) , x (2 \ ... x (Q) ), where q < p + 1 , is measurable as a set in 
(p 4 - 1 — q)-dimensional space , for almost all points (x (1) , x {2 \ ... x (0) ). The 
function m {E (x n) , x [2 \ ... a; (</) )} is measurable as a q-dimensional set , and 
its L-integral with respect to (x {1) , x (2) , ... x (</) ) is equal to rn (E). Instead of 
x n) , x {2) ,... x iQ) , we may take any qof thep-\- 1 coordinates x (1> , x (2) , ... x iv+i) . 

428. Let / (x {l) , x <2) , ... x (v) ) be any bounded function, defined in the 
cell (a a) , a (2) , ... a lp) ; b n) , b {2) , ... b ip) ), and measurable in that cell. We 
first assume that the function is at all points s 0 . The integral of the 
function may be regarded as the measure of a (p 4 - 1 )-dimensional set, 
defined in a (p + 1 )-dimensional cell (a (1) , a (2) , ... a (p+1) ; b {1) , 6 (2) , ... 6 (p+1) ), 
where a (p+1) is any number less than the lower boundary of the function 
in the cell, and 6 (p+1) is any number greater than its upper boundary. 


We may suppose the function to be defined to have the value zero at any 
point of the (p + 1 )-dimensional cell, for which it was not originally defined. 

We now have, from the foregoing theorem, 

/•(*><»>,&(», ... &<»»)> 


I («<»>, a<*>,... a<P>) 


for every value of q (< p). 


f(x"\ x < 2 >, ...x'*')d(x™, x i2) , ...x< p >) 

(6<ff+i>, #«+*),... fr<p>) 

/ (x (1) , s (2 >, ... x<*») 

(a<i+ l K «<*+*>,...a<*>) 

d (x (tf+1) , x (9+2) , ... # (p) )l d (x {1 \ x {2 \ 


r(fr< l >. fed),. 

I r 

J a<*>,. 

...«<*>) ( J 
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If we take q = 1 , we have 

[(b) rbl» ([(&*>, b<*>,...b<f») ) 

f(x)dx^~- ] f(x™,x™, ...x™)d(x™,x™, ... x™)[dx™. 

J (a) Ja< l > (J (a ( *>, a ( *>,.. a<*>) J 

/*(&) 

By repeated use of this result, we express I / (#) da; as a repeated 

J (a) 

integral , taken successively with respect to each coordinate; and it is clear 
that the order in which the integrations are taken is immaterial. 

In particular, we see that 


r (&<»>, fc< 2 >) 

/•&<»> 

f f&< 2 > ) 

il 

©J 

I < 

/ (a" (1) , x (2) ) dx {2) \ 


a<»> | 

J 


r&<*> | 

{ f b <*> ) 


J 

; f{x™ 9 x w )dx< l) \ 


a<*> 1 



dx<" 

dx (2) . 


If the function / (x (1) , ... x (p) ) is no longer restricted to be non-negative, 
it can be expressed as the difference of two measurable non-negative 
functions, and the general theorem may be applied to each of these 
functions. We have then the following result: 

If f ( x (1) , x {2) ) be defined in the cell (a U) , a l2) ; 6 (1) , b (2) ), and be bounded 
and measurable , the two repeated integrals 


rbw riit) rb (i) 

f(x {1 \x w )dx^dx {t \ f (x (1 >, x' 2) ) dx {2) dx"\ 

JaWJaW JaitoJaM 

[(b"\bW) 

both exist , and are equal to f (x (1) , x {2) ) d (# (1) , x {2) ). 

hat'KaW) 

[N l > fb<*> 

The form I f (x {1) , x (2) ) dx {1) dx i2) is employed to denote the re- 

.' a< l > J a(*) 


r&<‘> ( r 5 <*> \ 

peated integral « f (x {1 \ x {2) ) dx {2) [ dx {1 \ in which the integration 
J a* 1 ’ a< 2 > j 

is taken, first with respect to x (2) , and then with respect to x (l> . 


rbw 

It has been shewn that / (x (1) , x {2) ) dx {2) has a definite value for 

J «<*> 

almost all values of a: (1) . 

More generally we have the theorem that : 

The L-integral of a function of p variables , defined in a cell (a, b), and 
bounded and measurable in that cell , is equivalent to the repeated integral 
obtained by integrating the function successively with regard to the p-co~ 
ordinates , taken in any order. 


429. In order to extend the results of § 428 to the case of an unbounded 
function, it will be sufficient to consider the case in which the function is 
non-negative, since any summable function can be treated as the difference 
of two such functions. 

In the two-dimensional case, let / (x ll) , x (2) ) be a non-negative function, 
summable ovei the measurable set E. Let {k n } be a monotone sequence of 
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positive numbers without upper limit, and let /„ (x (1) , x ,2} ) = /(x (1) , x (2) ), 
when / (x (1) , x (2) ) s k n , and let/ n (x m , x (2) ) = k n , when f (x n) , x (2) ) > k„. 

We have then, from § 428, 

f /„ (x<», x< 2 >) d (x' 1 ’, x ,2 >) = J dx n) jf n (x' 1 ', x< 2 >) dx< 2 >. 

If [ / (r (1) , x i2) ) d (r (1) , x {2) ) exists, it is defined as 

'(E) 

lim [ f n (x ll) , x< 2 ') d (r< l >, r< 2 >), 

71—00 {K) 

which is equal to lim f dx a) j f n (x {1) , x {2) ) dx i2) . 

n~<x> J J 

Denoting J f n (x {1) , x {2) ) dx {2) by s n (r (1) ), we see that {«$„ (a; (1) )} is a monotone 

non-decreasing sequence, for each value of x {1) ; and on the assumption 

that [ f (x {1 \ x {2) ) d (x (1) , x {2) ) exists, we conclude that lim [ s n (x {1) ) dx a) 
J (E) 71-00 J 

exists, and has the same value as the integral. 

Applying the theorem of § 399, we see that {s n (z (1) )} converges for 
almost all points r (l) , and also that 

fs (x<») dx n) = [ f (z (1 >, x i2) ) d (x (l \ r< 2) ); 

J J (E) 

where s (x (1} ) = lim s n (x ll) ) = lim j* f n (r (1) , x {2) ) dx {2 K 

71—00 n— QO ' 

Applying again the theorem of § 399, we see that, for each value of 
x {1) for which lim f/ n (r (1) , x (2) ) dx {2) is finite, 

74 — 00 J 

[/ (x (1) , x (2) ) dx (2) = lim I f n ( x {1) , x {2) ) dx {2) . 

74 — 00 J 

Therefore j* dx (1) J f (r (1) , x {2) ) dx {2) 

exists, and is equal to 

f f (x (1> , x (2) ) d (x (1) , x (2) ). 

■’ (E) 

The complete result may be stated as follows: 

Iff(x {1) ,x {2) ) be any function, bounded or unbounded , that is integrable 
(L) over the measurable and bounded set E, then the repeated integrals 

I dx'" J f (*<« x< 2 >) <2x (2 >, J dx<*> f f (x* 1 ', x< 2 ») dx™ 

both exist , and hive the same value as the Ldntegral function over E. 

It is clear that this statement includes the case of a function of any 
number of variables, as x {1) and x {2) may each be regarded as typical of 
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a set of variables. The first proof of this result was given by Fubini*, 
who deduced the theorem from a definition due to Lebesgue, of the 
superior and inferior integrals of a function, which are not identical with 
Darboux’s upper and lower integrals, defined in § 331. The above proof 
of the theoremf was given by Hobson, who also gave the following 
extension : 

Iff (sc (1) , x {2) ) be measurable in the measurable and bounded set E , and if 
| dx (l) J | / (x u) , x (2) ) | dx {2) 

has a definite value , then / / ( x {1) , x (2) ) d (a; (1) , x (2) ) exists as an L-integraI t 

J (/?) 

and is consequently equal to the repeated integrals of f (x m , x {2) ). 

Let us consider the function J | / ( x (l) , x (2) ) | dx (2) and apply the last 

part of the theorem of § 399. Thus, if j s (x (1) ) dx n) exists, it follows that 

lim s n (# (1) ) dx (1) exists, and is equal to s (x (1> ) dx a) . 
n ~oo J J 

Also 

js n (x ll) ) dx {1) = J dx n) I \f n (x ll) , x i2) ) | dx (2) - J ^ | /„ {x"\ x™) | d (x™, x {2) ); 
where s n (x (1) ) denotes j | f n (x (1) , x {2) ) | dx {2) . 

If we assume that J dx ll) j | / (x (1) , x i2) ) | dx (2) exists, we see that 
f I f(xM\x {2 ') ! d (x (1 >, a:< 2) ), 

■ (E) 

which is defined as 

lim f f n ( x {1) , x (2) ) d (x {1) , x {2) ), or as lim f s n (x (1> ) dx (1) , 

n~oo J (E) n~<x> J 

exists, and is equal to js (x (1) ) dx (1) , or |* dx (1) f | / (x (1) , x {2) ) | dx (2) . 


It then follows that / ( x (1) , x (2} ) is summable in E . 

In the case in which / (x (1) , # (2) ) is bounded in E, the theorem is an 
immediate corollary from the preceding theorem. 

It may happen that the repeated integrals off (x {1) , x (2) ) exist, but that 

J dx ll) J | / ( x (1) , x (2) ) | dx (2) does not exist. In that case the L-integral of 

f(x a) ,x {2) ) does not exist, and the two repeated integrals of f (x {1) , x {2) ) 
may have unequal values. 


* Rend. del. Real. Accad. dei Lincei (5), vol. xvi (1910), p. 608. 
f Proc. Lond. Math. Roc. (2), vol. vm (1909), p. 22. 
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It has been proved* by Sierpinski that the existence, for a function 
f (x {1) , x {2) ), bounded in the square (0, 0; 1 , 1), of the integrals 

f f (x {1) , x {2) ) dx a) , for 0 s x {2) £ 1 , 

Jo 

and [ f (x (1> , x {2) ) dx l2) , for 0 £ s 1, 

J o 

does not necessarily entail the existence of the repeated integral 

f 1 dx (1 > l 1 fix'", x (2) )dx™\ 

Jo Jo 

but his proof involves the assumption of the truth of Cantor's hypothesis 
that 2 K o =■ The hypothesis is not required in the case of an unbounded 
function. 


1. The repeated integrals 




EXAMPLES 


/:/: 




n (. 

. 


(i) x 2 — y 2 
(x 2 4 


dydx 


X 2 •— )/^ , 

both exist, but have unequal values, the function - „ not being summable in the 

M {x l + y 2 ) 2 

square (0, 0; 1, 1). 

2. Let / (x, y) = 


1 


(* " *) 3 


for 0 < y < | x — £ |; and let / (x, y) = 0, for y ^ \ x — £ |, 


and for y — 0. The integrals f f (x, y) dx, f f (x, y) dy exist as ft-integrals and therefore 
i j J 0 J 0 j j 

as L-integrals; also 1 dy I f (x, y) dx exists, but I dx I f (x, y) dy does not exist. This 

in io J 0 J 0 

example was given by Sierpiftski. 


A FUNDAMENTAL APPROXIMATION THEOREM 

430 . The following theorem affords the means of relating the L-integral 
of a summable function with a sequence of integrals of continuous func¬ 
tions, in such a way that various properties of the L-integral may be 
obtained by considering the special case in which the integrand is a con¬ 
tinuous function: 

If f / (x) be summable in a given interval , or cell , (a, b ), a continuous 
function <f> (x) can be constructed which satisfies the condition that 

[ I / (*) - 4> ( x ) I dx 

. a 

is less than an arbitrarily prescribed positive number . Moreover , m case 
f ( x ) ^ 0, in (a, 6), the function <f> (x) can be so determined that <f> (x) £ 0, in 
(a, b). 

The proof of this theorem may be exhibited as a process which has 
several stages. 

* Fundamenta Math. vol. i (1920), p. 142. 
t See Hobson, Proc. Land. Math. Soc. (2), vol. xn (1913), p. 156. 
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(1) . Consider the case in which / (x) = 1, in the cell, or interval, (a, j3) 
contained in (a, 6), and is elsewhere zero. 

Let the continuous function <f> (x) be defined to be equal to 1 in the cell 

(« (1) ,« (2) , ...), 

and to be equal to 1 — 0 on the boundary of the cell 

(a (1) - 0e, a< 2 > - 06, f 06, £< 2 > 4- 0e, ...), 

for each value of 0 such that 0 ^ 0 ^ 1; and outside the cell for which 
0=1, let <f> (x) — 0. We may disregard all points that are not in the 
fundamental cell (a, b). 

The function <f> (x) has values between 0 and 1 in the set of points 
between the two cells (a, /?) and (a — 6, ft 4- e); and it is continuous in 
(a, b). Since / (x) — <f> (x) vanishes in the cell (a, /3), we see that 

f | / (x) - </> (x) | dx 

J a 

is less than Ac, where A is a fixed number independent of e; and this is 
arbitrarily small. 

(2) . Let f (x) = 1, in each cell of a finite set A 1} A 2 , ... A n , of non¬ 
overlapping cells contained in (a, b), and let f (x) = 0 at all other points 
of (a, b). Let <f> r (x) be the continuous function, defined as in (1), such 
that, if/ r (x) = 1 in A r , and everywhere else = 0, 

( I fr (*) - 4>r (x) \dx< l _ e; 

also suppose <f> r (x) defined for r — 1, 2, 3, ... n. Let 

<t> (x) = 4>1 ( x) + <Aj (x) + ... + <f> n (x); 

we have then 

f | / (x) - 4> (*) I dx < £ f | fr (x) - 4>r (x) \dx<€. 

J a r - 1 J a 

The function <f> (x) is continuous, and satisfies the condition in the 
theorem for the particular function / (z). 

(3) . Let f (x) = 1 in a measurable set of points e, contained in (a, b) ; 
and let / (x) = 0 in C (e). 

A set of non-overlapping cells A 1 , A 2 , ... A m , ..., of total measure 
exceeding m (e) by less than £e, can be so determined that they contain 
all the points of e either within them or on their boundaries. An integer n 

<30 

can be so chosen that 2 m (A r ) < Jc, and that 

r ~n + l 

0 < 2 m (A r ) — m (e) < ^6. 

r -1 

Let f n (;x) be the function that has the value 1 in each of the cells 

A,, A 2 , ... A„, 
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and elsewhere the value zero. In virtue of (2) a continuous function <f> ( x ) 
can be so determined that 

f I /«(*)-<£ (*) I dx < 

J a 
fb 

We have also \ f (z) ~ f n (x) \ dx < ; 

J a 
fb 

and therefore | / (x) — <f> (x) | dx < e. 

J a 

The continuous function cf> (x) satisfies the condition of the theorem for 
the particular function / (a:). 

(4). Let e lf e 2 , ... e m be m measurable sets, no two of which have a 
point in common, and all of which are in the cell (a, 6). Let numbers 
q, c 2 , ... c rn be prescribed, and let / ( x ) be the function that has the value 
c r in the set e r , for the values 1, 2, 3, ... m, of r, and has the value zero 
at points that do not belong to any of the sets e T . 

Let <f) r (#) be the continuous function determined as in (3), such that, 
if f r (x) = 1, in e r) and is elsewhere zero, 


J l/r {x) * r (x) \ dx< ] Cl | +1 c 2 1 f+1 c m i ; 

and let these functions (f> r (x) be constructed for the values r — 1, 2, 3, ... m. 
If 4> {x) = ( x) 4- c 2 <f> 2 («) + ...+ c m </> m (x), we have 

f b r=rn rb 

| / (x) - (x) I dx a S I c r I I fr (a:) - 4>r (*) \ dx < e. 

a r = 1 •' a 

Therefore <f> (x) is the continuous function required by the theorem, 
which corresponds to the function / (x). 

(5). ’ Let / (a;) be a bounded and summable function. In accordance 
with § 385, a function / (a:) can be determined which is such that 

\f(x)-J(x)\< v , 

and that / (x) has only a finite number of distinct values, each of which 
it takes in a measurable set of points. Let 4> (x) be the continuous function, 
constructed as in (4), such that 


We now have 
rb 


i. 


!/(*) 


</> (x) | dx < €. 

f lf(x)-<fi(x)ldxsl \f(x)-f(x) | dx + [ \f(x)-<f>(x) | dx 

J a J a J a 


< t jm (a, b) + e; 

where m (a, 6) is the measure of the fundamental cell (a, b). 

Since rj and € are arbitrarily small, a continuous function <f> (x) such 
as the theorem requires can be determined, corresponding to any bounded 
function that is summable in (a, 6). 
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(6). Lastly, let/ (x) be unbounded, but summable in (a, b). A bounded 
summable function f N ( x) can be determined, as in § 386, such that 

I l/(*) -/» (*) I dx 

■ a 

is arbitrarily small, say < then if <f> (x) be a continuous function, 
obtained as in (5), such that 

f \fn( x )- 4> (*) I dx < fc> 

J a 

fb 

we have | / (x) — <f> (x) \ dx < e. 

J a 

This completes the proof of the theorem, for the general case of any 
summable function / (x). It is clear, by examining the successive stages 
of the process of construction of <p (x), that if / (x) £ 0, in (a, b), then 
cf> (x) £ 0, in (a, b). 

We observe that 

If {/(*)- <t> (*)} dx\<\ | f (x) -<f> (x) | dx<e. 

] J a I J a 

If e have successively the values in a diminishing sequence {e n } that 
converges to zero, and if <f> r (x) be the continuous function that corresponds 
to the value e r , of e, we see that 

lim I (f> r (x) dx — f f (x) dx y 

r~co . a J a 

and also lim | / (x) — <f> r (x) | dx = 0. 

r~ao J a 

The following deduction from the approximation theorem is of im¬ 
portance in view of applications: 

If f (x) be summable in a given interval , or cell , (a, b), a function ip (x) 
which has a constant value within each interval , or cell, of some finite set 

rb 

into which (a, b) is divided, can be so determined that / \ f (x) — ip (x) \ dx 

J a 

is less than an arbitrarily prescribed positive number. 

On the boundaries of the cells of the finite set, / (x) may have arbi¬ 
trarily assigned values, as these do not affect the value of the integral. 
Taking, in accordance with the above theorem, the continuous func¬ 
i' 6 

tion <f> (x), such that | / (x) — <f> (x) | < Je, we may divide (a, b) into a 

. a 

finite set of intervals, or cells, such that, in each of them, the fluctuation 
of * (a:) is < (see § 217) - 

Let ip (x) have, within each such interval, or cell, the value of <p (x) at 
its centre; then I | cp (x) — \p (x) | dx < Jc. 

J a 
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fb 

It now follows that \ f (x) — ip (x) \ dx < e, and thus ip ( x ) satisfies 
the required condition. 

A sequence {f r (a:)} of functions of the type of ip (a:) exists, such that 

lim [ | / (x) - ip r (x) | dx = 0. 
r~ oo J a 

431. Various theorems of importance in the theories of integration 
and of series can be deduced from the theorem of § 430. 

Let / (#) be summable in the open linear interval (a, £), and let (a, 6) 
be a closed interval contained in (a, $). Consider 

P|/(* + 0 -/(*) I dx , 

a 

where t is < — b, so that the integrand is defined in (a, b). Let <p (x) be 

the continuous function, determined as in the theorem of § 430, so that 

r I / (*)-<£ (*) I dx < ie. 

J a 

The integral of | f (x + t) — f (x) | over (a, b) does not exceed the sum of 
the integrals of 

\f(x + t)-<f>(x + t)\, \4>(x + t)-<f>(x)\, and \ </> (x) - f (x) \ 
over the same interval. The first and the third of these integrals are each 
< |e, and the second integral is also < Je, provided t is less than a fixed 
number , dependent on e. Therefore 

( | / (x 4- t) - f (x) | dx < €, 

• a 

if t < ; and since e is arbitrary, it follows that the limit of the value of 

the integral, as t ~ 0, is zero. 

We have thus obtained the following theorem, which has been estab¬ 
lished in a different manner* by Lebesgue: 

If f (x) be summable in an open linear interval ( a , fi), and if (a, 6) be a 
closed interval interior to (a, /S), then 

fb 

lim | / (x + t) — f (x) | dx = 0. 

J a 

The corresponding theorem holds for a summable function defined in 
an open cell (a, ft) of any number of dimensions; thus, in the case of a 
function/ (x (1> , # (2) ) of two dimensions, we have 

rW'f, b<*>) 

lim I / (x<'> + *<*>, z< 2 > + *< 2 >) -/ (*<« x< 2 >) I d (*<*>, z< 2 >) - 0. 

No essential change in the proof of the theorem for the linear case is 
required to make this extension. 

* See the Lemons sur Its series trigonomtiriques, p. 15. 
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432 . The following theorem, also due to Lebesgue*, may be deduced 
from the theorem of § 430: 

If f (a;) be summable in a linear interval (a, b), then, for almost all points 
x 0 , in (a, b), \f(x)~a\ is, for x = x 0 , the differential coefficient of its 
indefinite integral , whatever value the constant a may have. In particular 

f I /(*)-/ (*o) I dx 

JXo 

has a differential coefficient at x = x 0 , equal to zero , for almost all values of x 0 , 
in (a, b). 

/•Jo + Zl 

The integral / \ f (x) ~ a \ dx 

J X Q 

is between the two numbers 

f xo+A 

\<f>{x)-a\dx±\ \f(x) - <f> (x) | dx, 

X 0 J Xq 

where <j> (x) is a continuous function which we may take to be such that 
[ | / (x) - <f> (x) | dx < e*. 

J a 

It follows from this inequality that the set of points at which 
\f(x)-<f>(x) I a e, 

has its measure less than <■; and therefore, if x 0 be any point of a certain 
set H t , where m ( H «) > b — a — e, the condition | / (a; 0 ) — </> (x 0 ) | < c is 
satisfied. 


Since | <f> (x) — a | is a continuous function, 

l rx'+h 

lim t | <f> (x) — a | dx 

k~ 0 tl J x 0 

exists, and is equal to | <f> (x 0 ) — a j, for all points x Q interior to (a, b). 

For almost all values of x 0 , 

I f*.+A 

lim 7 \f(x) - <f> (x) | dx 

h~Q ** J x„ 

exists, and is equal to | / (x 0 ) — <f> (x 0 ) |, since | f (x) ~ <f> (x) | is a summable 
function (see § 405). 

- 1 rx'+h 

We now see that lim r | / (x) — a | dx 
o A 

is between the numbers 


I <f> (*o) “ « I ± I / (*o) - <f> (*o) I . 

for almost all values of x 0 , whatever value a may have. In a set K t , 


such that 


m ( K t ) = m (H,) > b 


* Loc. cit. p. 13. 
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_ 1 [7o + h 

it follows that lira r | / (x) — a | dx 

h ~o h - ■ r - 

is between the numbers | / (x 0 ) — a | ± 2e. 

Let c have the values in a sequence which converges to zero, then m (K t ) 
converges to b — a ; we see then that, for almost all values of x 0 , 

l r^o+A 

lim v | / (x) - a | dx 

h~0 rl J X o 

exists, and is equal to I / (^o) ~ a I» 

whatever value a may have. The theorem has now been established. 

433. Let /j (x) be a function which is summable, and is ^ 0, in a cell, 
or interval, (a, b ); let us further assume that { f x (x)} 2 is summable in (a, 6). 
A continuous function fa (x), at every point s 0, can be so determined that 

P I {/i (X)} 2 - {4>1 (*)} 2 1 

a 

is less than c/4. We have, for every value of x, 

{A (x) - <f>i (x)Y £ 11/i (*)} 2 - {<A, (*)} 2 1; 

and it follows that f {/, (x) — </>j (a:)} 2 dx < e/4. 

a 

Next, let / (x) be no longer restricted to be £ 0, in (a, b) y but let {/ (x)} 2 
be summable in that cell, or interval; / (x) can then be expressed as the 
difference f x (x) — f 2 (x) of two functions, each of which is £ 0, in (a, 6), 
and such that the square of each of them is summable in (a, b). 

Let (f> 2 (x) be a continuous function, corresponding to/ 2 (x), such that 

f {/a (x) - 4>t (x)Ydx < e/4; 

. a 

and let <f> (x) = fa (x) — fa (x). 

We have now 

f {/ (*) - <t> (*)} 2 dx s 2 I {/! (x) - <j> y (a;)} 2 dx 

+ 2 f {/,(*) - <f>, (x)}*dx, 

J a 

and the expression on the right-hand side is < e. 

The following theorem has thus been established: 

If {/ (x)} 2 be summable (whether it be bounded or unbounded) in the 
interval , or cell , (a, 6), a continuous function </> (x) can be so determined that 

\ b {f{x)-<t>(x)Ydx 

• a 

is less than a prescribed positive number e. 

As in § 430, it can be shewn that the continuous function <f> (x) may be 
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replaced by a function ip (x) which has a constant value in each interval, 
or cell, of a finite set into which (a, b) may be divided. 

From this theorem a result can be deduced which has been otherwise 
obtained, for the case of a linear interval, by A. C. Dixon*. 

If {/ (x)} 2 be summable , in an open cell , or interval , (a, ft), and (a, b) be 
a cell , or an interval , contained in (a, ft), then 

lim ( {/ (t + x) - / ( t )} 2 dt = 0. 

.r~0 J a 

In the case of a cell of two dimensions , x represents a pair (x n \ x {2) ) of 
numbers , and the limit is taken as x {1) , a; (2) converge independently of one 
another to zero. 

We assume x to be such that t \ x is in (a, ft), for all values of t in 
(a, b). If <p {t) be a continuous function such that 

f \f(t)-<f>(t)) 2 dt<€, 

J a 

we have [ {/ (t -f x) — / ft)} 2 dt < 3 [ {/ (t -f x) — <p ft -f x)} 2 eft 

J a ia 

+ 3 [V (< -l- *)-</> (<)} 2 rf< 

Cf 

+ 3 \ l \4> (t) -/(<)}*• 

J a 

The first and third integrals on the right-hand side are each < e. The 
function cp (t) being uniformly continuous in a closed interval (a', l/) which 
contains (a, 6), and is in (a, ft), we see that if | x |, or in the two-dimen¬ 
sional case, | x (1) | and | x {2) |, be sufficiently small, the second integral 
is < c. It follows that 

\ b {f(x + t)-f(t)} 2 dt<9e, 

. a 

if | x | be sufficiently small. Since € is arbitrary, the theorem has been 
established. 

From this result, the following theoremf may be deduced: 

If {/ (x)} 2 and {cp (a;)} 2 be summable in an open interval , or cell , (a, ft), 
and (a, 6) be contained in (a, ft), 

[ 4> (l)f(l + *) dt 

J a 

is a continuous function of z, for all points x in (a, b). 

In the two-dimensional case , / (t -f x) denotes f ft a> -b x {l) , t m -b x (2) ). 

* Proc . Camb. Phil. Soc. vol. xv (1910), p. 210. 

f For the case o! a linear interval, this theorem has been proved in a different manner by 
A. C. Dixon, loc. cit., in the case in which the functions <f>, f are identical. 
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Let F ( x) denote I <f> (t)f(t + x) dt ; 

J a 

then, employing Schwarz’s inequality, we have 

{F (x x ) - F (*,)}* s f {<j> (<)}* dt I* {f (t + z x ) - f (< + x*)Y dt ; 

J a J a 

the second integral on the right-hand side is less than 

J a, 

if (a lt ft) is an interval or cell contained in (a, ft, and x' denotes x x — x 2 . 
This integral converges to zero, with x x — x 2 \ hence | F (x x ) — F (x 2 ) | is 
arbitrarily small, for all sufficiently small values of | x x — x 2 |, and there¬ 
fore F (ft is a continuous function. 

APPROXIMATE REPRESENTATION OF AN //-INTEGRAL AS A RIEMANN SUM 
434. Let / (x) be summable in the interval, or cell, (a, b) y and let f N ( x) 
denote the function that has the same values as / (ft at all points at which 
| / (ft | & N, and which has the value N, or -- N, at all other points of 
(a, b). Let the measure of the set ift of points at which | / (ft | > N, 
have the value e. By the theorem of § 430, a continuous function (f> (x) 

( b 

can be so determined that | | f N (ft — (ft | dx < e 2 . In a certain set 

• a 

of points L, such that m (L) < e, we may have | f N (x) — <f> (x) | £ €; in 
the complementary set C (L) we have | f N (x) — <f> (ft | < €. The function 
<f> (x) may be taken to be such that | <f> (x) | ^ N ; for if it have values 
numerically > N, these may be replaced by N or — N , without affecting 
the conditions that <f> (ft is to satisfy. 

A net can be fitted on to the interval, or cell, (a, 6), such that 

r m 

£ 4 , (f r ) m (8,) 
b V 1 

differs from (f> (ft dx by less than e; where ft, S 2 ,... 8 m denote the m 
Jo' 

meshes of the net, and the points {£.} are chosen in any manner so that 
is a point of ft. If there are meshes of the net such that all their points 
belong either to L or to E, the total measure of such meshes cannot 
exceed 2e. 

In all the other meshes there are points that belong neither to L nor 
to E. In these latter, the points ft can be so chosen that they do not 
belong to L> or to E. In a mesh ft, all the points of which belong to L 
or to 2ft we can choose ft so that | / (ft) | differs from the lower boundary 
of | / (ft | in ft, by less than c. Let this be done for all such meshes; then, 
denoting the set of all such meshes by A, where m (A) £ 2c, we see that 

I S/ (£r) rn (8 f ) I < e» (A) + f \f{x)\dx. 

J (A) 
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The sum on the left-hand side is here taken for all the meshes of the 
set A. We have also | 2<£ (f r ) m (8 r ) | < 2 I/e; where U is the upper 
boundary of | <f> (x) | in (a, 6). 

In the other meshes of the net we have 

I («,)!<€!; 

where l is the measure of the interval, or cell, (a, 6). 

Also ( / (a) dx differs from [ </> (a;) dx by less than e 2 + [ j f (x) \ dx; 

J a J a J (K) 

rb r *= m 

hence / (x) dx differs from 2 <f> (£ r ) m (8 r ) by less than 
J a r - 1 


e -f e 2 + 


f l / (*) 

J (K) 


Now 2 </> (f r ) m (8 r ) differs from 2 / (f r ) m (8 r ) by less than 

r-l r = 1 

U -f 2c 2 + 2 Ue + [ \f(x) | dx. 

J (A) 

As A ~ oo, we have e ~ 0, and then / \f(x)\dx and 1 I / (x) I 

j (A) J (K) 

converge to zero. Also 2 I/e (< 2Ae) converges to zero (see § 388). 


We now see that / (x) dx differs from the Riemann sum 

• a 

i f (ir) in (8 r ) 

r “ 1 

by an amount which can be made arbitrarily small by choosing A suffi¬ 
ciently large. 


We have thus proved the following theorem, of which Lebesgue* has 
given another proof: 

If f (x) be any function that is surnmable in the interval , or cell , (a, 6), 

rb 

a net can be fitted on to (a, b) such that f (x) dx differs arbitrarily little 

• a 

r-m 

from the Riemann sum 2 / (£v) m (8 r )> provided the points £ r be chosen 

r-l 

properly in the meshes 8 r of the net. 

The chief interest of this theorem arises from the fact that, if we have 
a finite set of functions / (1) (x),/ (2) (x), .../ (s) (x), all surnmable in the 
interval, or cell, (a, 6), it is possible to determine a net, and a definite set 
of points £i, £ 2 , ... in the meshes 8 t , 8 2 , ... 8 m , respectively, so that, 
for each of the s functions, the difference of the integral and the corre¬ 
sponding Riemann sum is less than an arbitrarily chosen number, the 
points f being the same for all the s functions. 


See Annates de Toulouse (3), vol. i (1909), p. 33. 




To shew that this is the case, only a slight modification in the fore¬ 
going proof is required. We have to deal with s sets E X ,E 2 , ... E s , corre¬ 
sponding to a fixed value of N; the number c can be taken to be the greatest 
of the numbers m (Ef), m (E 2 ), ... m (E a ). The meshes of the net which 
are such that every point of one of them belongs to one at least of the 
2$ sets m (Efj, m (E 2 ), ... m (E s ), m {Lf), m (L 2 ), ... m (L a ) have a total 
measure ^ 2s e; and these meshes must be considered separately from the 
others, as in the case s — 1; they form a set A, such that m (A) £ 2s e. In 

one of these meshes 8 r , we take £ r so that 2 \f (p) (£ r ) | differs from the 

7>~1 

lower boundary of 2 | f (p) (z) | in S r> by less than e; we then have, for 

p -1 

the set of meshes A, 


£/<*> (f r ) m (8 r ) | < em (A) 


\f w (x) 


for each of the values p — 1, 2, ... s, of p. The preceding proof may then 
be applied to each of the functions / (1) (x), .../ (s) ( x ), without essential 
change. 


We have now obtained the following theorem: 


If / (1) (x),f {2) ( x ), ... f (s) (x) be functions that are summable in the interval , 
or cell , (a, b), then a net can befitted on to (a, b ), and a set of points ^, ... 
can be determined in the m meshes of the net , so that, for each of the s functions , 

f b r-m 

f (x) dx differs from 2 / (f r ) m (8 r ) by less than an arbitrarily chosen 

a r -1 

number. 


435. By means of this theorem, various algebraical inequalities can 
be extended so as to obtain corresponding inequalities which involve 
Z-integrals. The simplest case we shall consider is that of the well-known 
inequality 

{a 1 b l -f a 2 b 2 -f ... -f d m b m ) 2 £ (a x 2 -f a 2 2 + ... 4- d m 2 ) ( b x 2 -f b 2 2 -f ... -f b m 2 ). 

If we write 

<h =■/"’ (li) {m (8,)}*, a, = /“> (&) {m (8 2 )}* ; ... =/“> (U {m (8 m )}*; 

= / <2 ’ tfi) («!)}*, b, =/< 2 ’ tf.) {m ...b m = /<” (£ m ) {m (8 m )}*, 

we have 

{ r-m 2 r - m r - m 

x f n) (tr)f i2 '({r)m(8 r )\ s S [{f">($r)} 2 m(8 r )] Z [{/«*> (&)}•» (8,)]. 

r-1 r-1 r =1 

Employing the theorem, we have at once the Schwarzian inequality 
(see § 396), 

{/V* (x)f m (X) dxf S £{/«» (a;)} 1 dx f" {/<*> (*)}* 
since the expressions on the two sides are given to any required degree 
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of approximation by the corresponding expressions in the preceding 
inequality, provided £>, ... £ r are so chosen that the theorem of § 434 
is applicable to the three functions {/ (1 > (x)} 2 , {/< 2 > (x)} 2 ,f {l > (x )/< 2 > (x). 


The following more general theorems* may be obtained in a similar 
manner: 


[ f a) (x)f (2) (x)dx ^ I {f (l) (x)}*dx j {f< 2) (x)} 9 dx 
J a [_• a J L* a 


1/7 


where p and q are any positive numbers such that 1/p -f 1/q = 1; and 


f I/' 1 ’ (x) +/“’ (x) \ v dx 

a 

where p > 1. 


Up 


|/“>(*) I’ 


l Ip 


f I/' 2 ’ (*)!' 


dx 


l tv 


These are derived from the known inequalities 


l Ip 

> 

where p and q are positive numbers such that 1 /p + 1/q ~ 1. 

In the case of the first inequality we take a r = / (1) (£ r ) {m (8 r )} 1/;) , and 
br =/ <2 ’ {(r) W 

This method has been applied*)* by Lebesgue to obtain a proof of the 
second mean value theorem (§ 423). 


r — m 


r-m 

1 Ip 

\ r-m 

2 a T b r 

r -1 


2 | a r |» 

_ r - 1 


2 1 br |« 

Lr-1 J 


i la 


r-m ~l ljp 

r » m 

ski- + 

2 \b r \ v 

_r-1 J 

L r~ 1 


435 1 . The above theorems hold good when the integrations are taken 
over any bounded measurable set E instead of over the interval, or cell, 
(a, b). For we may suppose the functions f a) (x),/ (2) (x) to have the value 
zero at all points of the complement of E relatively to an interval, or cell, 
which contains E. In case E is unbounded, but of finite measure, it is 
the outer limiting set of a sequence of bounded measurable sets {E n } y 
each of which contains the next. The theorems may be applied to integrals 
taken over E n , and then the limits of the integrals, as n ~ oo, may be 
taken; from § 392 it then follows that the theorems hold good when the 
integration is taken over E. In case E is measurable, but of infinite 
measure, employing the definition of an integral over such a set E , given 
in § 437, we can take E to be the outer limiting set of a sequence {E n } 
of bounded measurable sets, of which E is the outer limiting set. It then 
follows, applying the theorems above to integration over the sets E nf 
and taking the limit, as n ~ co, that, when the integrals on the right- 
hand side of the inequalities exist, the theorems still hold good. It has 

♦ See F. Riesz, Math. Annalen, vol. lxix (1910), p. 456. 

f Annales de Toulouse (3), vol. I (1909), p. 36. 
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thus been proved that, if E be a measurable set, bounded or unbounded, 
such that the integrals on the right-hand side exist, 

{ f f a) (*)f (2) (*) dxV £ f {/ (1 > (a?)}* dx f {/<*> (x )} 2 d*...(l), 

L/ v (x)fw (x) dx I s [L 1 /u> (x) |p dx \ !v [/<„ |/<a) (x) |e dx ] llQ 

.(2), 

where p and q are any positive numbers such that 1/p + 1 jq— 1; 

| j I /"» (x) + (x) |* a. [ f | /“> (*) |» (fa llP 

+ 17 I/- 2 * (*>|’(faP.(3), 

Urn J 

where p > 1. 

The following relations*, which are of importance in view of applica¬ 
tions, hold also whether m (E) be finite or not: 

f |/ (1) (x) + /< 2) (x) \ k dx^X k \\ | fn (x) \ k dx + f | r (x) I* dx\ 

J (E) l J(E) J (E) ) 

.(4), 

where k > 0, and X k has the value 2*~* or 1, according as & > 1, or 0 < k £ 1; 

| f (/<!>(*) | k dx- [ \f^(x)\ k dx 

\HE) HE) 

^ * {/( J K)I^ <I> (*) -/<2> {x) \ kd *] vk 

V(r ](k-l)lk ( r ) {k - l)/fc~| 

X LI J(E ) 1 fn) (x) 7 + 1 J(E ) 1 f<2) {X) |AC *7 J (6) ’ 

where k > 1. 

The inequality (4) follows from the known theorems that, 

(a + b )* s 2 k ~ 1 (a k + b k ), 

when k > 1, and that (a -f b) k £ a k + b k , when 0 < k & 1; where a and b 
are positive numbers, by letting a — | / (1) (x) |, 6 = | / <2) (x) | ,* and ob¬ 
serving that | fin (x) + f <2) (x) \ & \ / (1) (z) | + | f (2) (x) |. 

In order to prove (5), we observe that, if a and b are positive numbers 
such that a> b, we have 

“aTT * = k[b + 6 (a - b)?~\ 

where 6 is a number within the interval (0, 1), (see"§ 262). Hence 

(a k - b k )/(a - b) 

is less than ka k ~ 1 i or &6*~ x , according as k ^ 1; and it is in either case less 
than k (a* -1 -f 6 fc ~ 1 ), provided a and b are unequal, whichever be the 
* See Hobson, Journal of Land. Math. Soc. vol. I (1926), p. 213. 
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greater of the two, for k > 0. We have accordingly, writing a = | A |, 
b = | B |, and observing that \ A — B \ n \ a — b\, the inequality 
\\A\ k — \ B\ k \&k\A~~B\{\A I*- 1 + | B 
Prom this inequality it follows that 

f | !/»)(*) |»- \r»(x)i*\dz 
• {E) 

*t[ |/ (1) (X) ~/ (2) (*) I [|/ (1 > (X) I *- 1 + |/< 2) (*) I k ~ l ]dx. 

J (E) 

Also, if k > 1, employing (2), the expression on the right-hand side 
does not exceed 


If l/ (1) 


(*) ~/ (2) (*) I* 


ilk 




l J(E) 

+ 


r 

i , 

J (E) J 


where l/k -f Ijk' — 1. From this, the inequality (5) follows immediately. 


436. Another method has been indicated* by F. Riesz of expressing 
the L-integrals of functions approximately by finite sums, in such a manner 
that the above extensions of known inequalities to the case of integrals 
can be carried out. 

It will be sufficient to consider the case of the summable functions 
/ (1) ( x ), f {2) (#), as the extension to the case of any finite number of such 
functions will then be obvious. 


It has been seen, in §§ 385-388, that the integrals 



dx differ respectively from two finite sums 



dx y 


a x m (e x ) -f a 2 m ( e 2 ) + ... + a P m (e p ), 
a y m (e/) + a 2 m («*') -f ... + a p m (e p ), 
by less than an arbitrarily chosen positive nurfiber, provided the increasing 
set of numbers a Jf Og, ... a p is properly chosen, and e r , e/ are the sets of 
points for which a r & f (1) (x) < a r+1 ; a r s / (2) (x) < a r+1 , respectively. 

If we denote by E rs the set D (e r , e/), we have 


s-p r-p 

e r = 2 E„, e,' = 2 B„; 

8-1 r-1 

all the sets E„ can be arranged in order as a single sequence {F Q }, and the 
two finite sums then take the forms 

A k m (F k ) + A % m (F k ) + ... + A k m (F k ), 

B 1 m (F k ) + B s m (F t ) + ... + B k m ( F k ), 
where F t is the set E r , , and A q — a r ,B, — a,. In the set F„, we have 
a r s /<»' ( x ) < a r+1 , a, s: /'« (x) < a, +l . 


* Loc. cit. 
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It is clear that A 1 B 1 m (F t ) + A 2 B 2 m ( F 2 ) -f ... -f A k B k m ( F k ) is an ap- 
f b 

proximation to / (1) (x) / (2) (x) dx . 

J a 


LEBESGUE INTEGRALS OVER AN INFINITE FIELD 

437. The definition, in §§ 385-388, of the L-integral of a summable 
function / (x) over a measurable set E , is applicable when E is unbounded, 
provided it have a finite measure, in accordance with the definition, given 
in § 134. of the measure of such a set. We proceed to consider the case 
in which E is a set such that the portion of it in each finite cell, or interval, 
is measurable, but in which E has not a finite measure. The set E is then 
said to be measurable, but of infinite measure. Let / (x) be a function, 
defined over the set E, and everywhere s 0; and let it be assumed that 
/ (x) is summable in every finite cell, or interval, A. 

If / (x) dx converges to a definite number, as the distances from the 

•'(A) 

origin of all the boundaries of A increase indefinitely, in any manner, that 

number is said to define the L-integral / (x) dx, of / (x), over the un- 

J (E) 

bounded set E. 

If the set E be in p dimensions, / (x) dx is the measure of the 

• (A) 

(p + l)-dimensional set of points (x (1) , x {2) , ... x {p) , y), where 

a (r) x (r) b (r) , 

for r = 1, 2, 3, ... p, and 0 £ y ^ f (x {1) , x m , ... x {v) ); the cell A being 
(a (1) , a (2) , ... a (v) ; 6 (l) , 6 (2) , ... b {p) ). At points x, not in E, we take/ ( x) = 0. 

It is clear that, when a sequence of cells {A n } is taken, each one of 
which is contained in the next, and such that, as we progress in the 
sequence, all the numbers b {1) , b {2) , ... b (v) become indefinitely great, and 
are positive, whereas the numbers a {1> , a {2) , ... a ip) become indefinitely 

great, and are negative, the values of / (x) form a monotone sequence. 

J ( A«) 

Assuming that this sequence has a finite upper limit, that limit, f(x) dx, 

J(E) 

is the measure of the (p -f 1)-dimensional unbounded set for which 
— oo < x {r) < oo, (r = 1, 2, 3, ... p), 0 £y&f(x); 
it being assumed that f (x) = 0 at all points that are not in E. 

It can be shewn that the limit / (x) dx, when it exists, has its value 

J (E) 

independent of the particular sequence {A n } of cells, each of which is con¬ 
tained in the next. For we may consider two such sequences {A^}, {A^ 2) } 
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of cells. For a value of n there is a value of n' such that A^ 1 * is contained 
in Aand thus 




■ K') 


Since n f must increase indefinitely with n, we have 

lim I m f dx S lim I (! <i / ( x ) dx - 

n-aoJlA^ 1 *) n~oo 

Since A^ 11 and A ( „ 2> may clearly be interchanged in this relation, we see that 
lim ( 2 ) / (*) dx = lim (1) / (x) dx. 

n~cc J (A n ) n~ooJ(A n ) 

If denotes the set of points of E which is contained in A n , {E n } is 
a sequence of bounded measurable sets, each of which is contained in 

the next, and of which E is the outer limiting set. Thus f (x) dx is 

the limit of the sequence of integrals of / (x) taken over the bounded and 
measurable sets {E n }, of which E is the outer limiting set. 

If / (x) is not ^ 0, at all points of E, it may be expressed as the differ¬ 
ence /+ (x) — f~ (x) of two functions / f (x), / ' (x), each of which is ‘^0; 

where f ~ (x) = 0, at any point x at which f (x) is positive. 

In case the two integrals /+ (x) dx, f~ (x) dx both exist, in ac- 

J (E) J (E) 


cordance with the above definition, the Z-integral / (x) dx is defined 


• (K) 


to be the value of /+ (x) dx — f (x) dx \ and / (x) is said to have 

. {E) J (E) 


<K) 


J 

J(E) 


an absolutely convergent L-integral over the unbounded set E. The 
existence of the absolutely convergent integral entails the existence of 

the integral |/(x) | dx. We have then, as before, 

J (E) 

f (x) dx = lim / (x) dx. 

>) . {E n ) 

It may however happen that, although f + (x) dx, f~ (x) dx 

J (An) (An) 

become indefinitely great as the cell, or interval, A n becomes indefinitely 
great, {/+ (x) — f~ (x)} dx converges to a finite limit, independent of 

J (A n ) 

the particular sequence {A n }. In that case, the limit is said to define the 
non-absolutely convergent ^-integral of / (x) over the unbounded set E. 


When f (x) dx exists as a non-absolutely convergent integral, 

J (E) 

( I / (*) I dx 

HE) 

does not exist, because {/+ (x) + (x)} dx does not converge. 

Ue) 
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438. In the case of integration over a linear interval f (x) dx is the 

J a 

f b 

limit, as 6 — oo, of I f (x) dx; it being assumed that / (x) is summable in 
J a 

(a, 6), for all values of 6 (> a). If | / (x) | dx exists, the integral 

J a 

[ f(x)dz 

• a 

is said to be an absolutely convergent L-integral over the unbounded 

f X) ('00 

| / (x) | dx is not finite, J / (x) dx is said to be a non- 

• a -a 

absolutely convergent L-integral over the unbounded interval (a, oo). 

r°° r o 

If both f(x)dx and | f(x)dx exist, their sum defines the value 
J 0 J oo 

of / (x) dx , which is absolutely or non-absolutely convergent over 

(— oo, co), according as | / (x) | dx is finite or not. 

J -ao 

The following theorem provides a criterion for the existence of an 
Z-integral over an unbounded interval which is frequently of use: 

If f (■x ) and f (x) <f> (x) be both summable in every interval ( a , x), and if 

I f(x)dx exists , and <f> (x) be monotone and bounded in (X, oo), for some 

J a 

r°° 

value of X (£ a), then I / (x) <f> (x) dx exists . 

j a 

n * £ I 

/ (x) dx < €, for all 


positive values of h y where e is an assigned positive number; and this 
value of $ can be chosen to be ^ X. We have then, by the second mean 
value theorem, 

f$+h r£+0h rt+h 

f(x)<f> (x) dx - (f) (f) / (x) dx + <f> {g + h) f (x) dx, 

-f • £ .'£ + »* 

where 0 has some value in the interval (0, 1). If | <f> (x) | < k , for every 
value of x in the interval (X, oo), we have 

r£+ft I 

f(x) <f> (x) dx j < 2/ce; 

and since c is arbitrary, the integral j*°°/ (x) <f> (x) dx exists. 

A similar criterion for the existence of an absolutely convergent integral 
is the following: 

If * f (x) and f (x) cf> (x) be both summable in every interval (a, x), and if 


Be© Riemafm s Werke, 1st ed., p. 229; also Pringsheim, Math. AnncUen, vol. xxxvii (1890), p. 591. 
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|*f (x)dx exists as an absolutely convergent L-integral, and \ <f> (x) \ be 
bounded in ( X , oo ), for some value of X (s, a), then the integral 

J°°f(x)<f> ( x ) dx 

exists , as an absolutely convergent L-integral . 

For, since f (x) dx is absolutely convergent, a number £ X) can 

•'a 

rf+ft 

be so determined that | / (x) \ dx < c, for all positive values of h ; 
• f 

where e is any assigned positive number. 

We have then 

I n+h [t+h rt+h 

f(x)<f)(x)dx £ \f(x)<f>(x)\dx<K] 1 / (x) | dx < *e, 

l-'f h h 

where k is the upper boundary of | <f> (x) | in (£, oo). Since e is arbitrary, 
the condition for the existence of f | f (x) <j> (x) | dx is satisfied. 


439. An important class of integrals over an unbounded interval, of 
which the convergence is not necessarily absolute, is that of the integrals 


*oo rcc 

<f> (x) sin xdx, <f> (x) cos xdx , 

. n J n 


where </» (^) is monotone, from and after some fixed value of x , and con¬ 
verges to zero, as x is indefinitely increased. 

rf rs. 

We have <f> (x) sin xdx = </> (x!) sin xdx + (f> ( x 2 ) sinarcfa, where x 1 
J*x 'tx ■£ 

is so large that <f> (x) is monotone for x & x x , and $ is some number in the 

interval (x Xi x 2 ). 

From this we have 

[ <f> (x) sin xdx £ 2 | <f> (x x ) I + 2 | </> (* 2 ) |. 

I • Xx 

Since | <f> (x) | converges to zero, x x may be so chosen that | <f> (x x ) | and 
| <f) (x 2 ) | are both arbitrarily small; hence the condition is satisfied for the 

existence of f 6 (x) sin xdx , as an L-integral which is not necessarily abso- 
io 

lutely convergent. The case of the second integral may be treated in the 
same manner. 


CHANGE OF THE INDEPENDENT VARIABLE IN A LEBESGUE INTEGRAL 
440. The transformation to a new variable, of the integral f / (x) dx, 

J a 

of a function / (x) that is summable in the linear interval (a, b) will now 
be considered. The following theorem* will be established: 

* See Lebesgue, Annales de Toulouse (3), vol. i (1909), p. 44, 
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lfj(x) be summable in the linear interval (a, b), and if <f> (£) be a monotone 
function of £ which is an indefinite integral a + \ x (£) d£, where the interval 

J a 

( a , ft), of £, is made to correspond to the interval (a, b), of x , by means of the 
relation x = <f> (£), then f f{x) dx = f F (£) x (i) d£, where F (f) denotes the 

J a 'a 

function f {<f> (£)}. 

We may assume, without loss of generality, that <f> (f) is non-dimin¬ 
ishing in (a, ft). It has been shewn, in § 407, that the necessary and 
sufficient condition that the monotone function <f> (f) should be an in¬ 
definite integral is that, to any set of points of measure zero on the 
^-segment, there should correspond a set of points of measure zero on the 
x-segment. 

The above theorem is equivalent* to the statement that, when the 
condition here referred to is satisfied, then the equality 

f / (*) dx = [* F (£) x (£) d£ 

• Oi ' a 

holds for every summable function / (x). 

Let A n (x) denote a set of non-overlapping intervals in (a, 6), and 
the corresponding set of intervals in (a, P ); we have then 

m (A n <*>) =[ X (f) dt 

If E ix) be the inner limiting set of a sequence of sets of intervals, each 
of which sets contains the next, there corresponds to E (x) a set E^ ] f the 
inner limiting set of the corresponding sequence of intervals in the 
^-segment. We have then 

m (EM) = lim m {A n {x) ) = lim f x (£) <?£-**[ X (f) <%• 

If m (E {x) ) = 0, it follows that x (f) = 0, at all points of E ( t\ with the 
possible exception of those which belong to a component of measure zero. 

The set of points on the ^-segment that corresponds to a set of points 
of measure zero on the x-segment has not necessarily the measure zero, 
and is conceivably not a measurable set; but in any case that component 
of the set at which x (£) is not zero is of measure zero. 

A set of points on the x-segment which is measurable (B) corresponds 
to a set on the £-segment that is also measurable (B). Any measurable 
set E ix) , on the x-segment, contains a set E x (x) , which is measurable (B), 
and such that E ix) — E x lx) has measure zero. If E {x \ E 1 ( ® > correspond to 
E^\ E x ^\ on the ^-segment, contains the measurable set E x { &, and 
at all points of the set Ew — E x ( t\ with the exception of a component of 
measure zero, the condition x (£) = 0 is satisfied. Thus Ew is the sum of 
a measurable set, and of a set of points in which x (£) = 0. 

* See Hahn, Monatshefte f&r Math . u. Physik, vol. xxm (1912), p. 167. 
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If E (x} is the set of points at which / (x) > A , E { is the set of points 
at which F (f) > A. If F 1 (f) is a function defined only at those points 
of the interval (a, ft) in which x (£) ^ 0, and is equal to F (£) at the points 
for which it is defined, the set of points at which F 1 (£) > A is measurable. 
It follows that F x (£) x (£) is a measurable function, defined over that set 
of points at which x (€) =£ 0; and thus F (£) x (£) is measurable over («, p). 

Let us first consider the case in which / (x) is continuous in (a, b) ; then 
F (£) is continuous in (a, p). Let a net (a, z lf x 2 , ... x n _ lf b) be fitted on 
to (a, 6); and let (a, £ lt £ 2 , ... f n _ l5 p) be corresponding points of (a, p). 
If U ri L r are the upper and lower boundaries of f (x) in (x r _ 1} x T ), or of 
I (£) in (ir-i , it), we have 

U r f' x (£) d£ £ f' F (f) x (f) df a L r f' x (f) df. 

•fr-l -Ur-1 Jfr-1 

Therefore 

S t/ r (a;, - %r-i) 2 f F (£) X (f) <*£ 2 S L r (z r - a; r _,). 

r-1 r -1 

If the net be one of a system of nets fitted on to (a, 6), the two sums 

converge to f / (x) dx, when the nets of the system are taken successively. 
J a 

It then follows that 

[**(£) X(f)#= f7 (*) dx. 

J a J a 

Since b and p may be replaced by x and the theorem has been estab¬ 
lished for the case of a continuous function / (x). 

Next, let / (x) be any summable function, bounded in (a, b). A con¬ 
i' 6 

tinuous function/, {x) can be so determined that | / (x) — /, (x) | dx < € 2 , 

a 

and so that the upper boundary of | /, (x) | does not exceed that of | f (x) | 
(see § 430). Denoting by F t (f) the function that is equivalent to /, (. x ), 
we have 

f {I tf) - j*. m X tf) <efx (€) « + 217 f x (£) d£; 

Ja Jo J(H>) 

where U is the upper boundary of | f (x) | in (a, 6), and H x is that set of 

points at which x (£) / 0, and which is a component of the set that 

corresponds to the set H on the a;-segment, in which | f (x) — /, (x) | > €. 

The measure of H cannot exceed e; and if we assign to e successively the 

values in a diminishing sequence that converges to zero, the set H itr) 

which corresponds to a value e r in the sequence contains the set H { * r+ ' } . 

The inner limiting set of the sequence {H Ur) } has the measure zero, and 

therefore the inner limiting set of the corresponding sequence {H 1 (<r) } has 

the measure zero; as it contains no points at which x (£) = 0. 
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It. follows that m (H,) ~ 0, as t ~ 0; and thus we have 

P f(£)x (£) = lim I .F. (|) * (f) = lim I /«(a?) dz. 

J a e-'-O e'—O • a 

It has now been shewn that 

(" F (?) x (?) d? = f f(x)dx, 

J a J a 

where f (x) is any bounded summable function. 

Lastly, let/ (x) be unbounded and summable; since it can be expressed 
as the difference of two non-negative summable functions, it will be 
sufficient to consider the case in which / (a;) £ 0, in (a, b). Let fa (a) = / (x), 
at points where / (x) ^ N, and let f N (x) = N, at points where f (x) > N ; 
the positive number N being arbitrary. We have then 

\ b f N (x)dx= [ F N (?) x (?)d?, 

where F N (£) is equivalent to f N (x). Since the integral on the left-hand 
side is non-diminishing as N is increased, and since it has the finite limit 

f / {x) dx, as N ~ oo, it follows from the theorem of § 399 that 

a 

f F (?) x (?) d? -)/(*) dx. 

J a J a 

We may replace b by any point x , in the interval (a, b), and by the 
point in (a, /?) that corresponds to x. 

If E be any measurable set in (a, b), and E 1 be the measurable com¬ 
ponent of that set in (a, £) which corresponds to E , and in which y (£) ^ 0, 
we may replace / (x) by a function which has the same value as / (a;) in E , 
and is zero in C (E ); we have then 


f f (x) dx = [ F (fix (()#. 

J ( E) J (L\) 

In accordance with the theorem of § 405, since x = a + we have 

J a 

dx 

= x (£)> f° r almost all values of f in (a, /?). Henoe the above theorem 


may be written in the form 




dx 

it being assumed that ^ is integrable (L), in (a, /?). 


The more general case of substitution in which the function <f> (£) is 
not necessarily monotone but is absolutely continuous, and is consequently 
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an indefinite integral, has been treated* by de l$t Valtee Poussin. He has 
shewn that the equality 

f/(*)<** = r f (€) 4>' (i) d4, 

where a = <f> (a), b = (ft), F (£) = / {<f> (£)}, is valid provided either ( 1 ), 

F {<f> (f)} is absolutely continuous, where F (x) denotes j f (x) dx; or ( 2 ), if 

F (£) <f>' (f) is summable. The case has been considered by Fichtenholz 
in which <f> (£) is not necessarily absolutely continuous, but is continuous 
and monotone. He has shewnj that the formula is still valid when the 
function F {cf> (£)} is absolutely continuous. 

441. The extension of the theorem of §373 to the case in which the 
function / (x, y) is any summable function will here be considered. 

It will be assumed, as in § 372, that a ( 1 , 1 ) correspondence between 
the points of a perfect connex domain //, in the plane of (x, y), and the 
corresponding domain H , in the plane of (£, tj), is defined by x = f x (£, rj), 
V = I 2 (€> v)’ w fi ere the partial differential coefficients of f x and f 2 are con- 

tinuous in H, and the Jacobian = J does not vanish in H. It 

0 (s, v) 

was shewn, in § 372, that, to a closed set h , in H y of cbntent zero, there 
corresponds a closed set h , in H , of content zero. 

If 8 denote a closed cell in H , and 8 the corresponding closed domain 
in H , by taking / (x, y) = 1 , in H , we have, in accordance with the result 
obtained in § 373, 

m ( 8 ) = ( / (x, y) d (x, y) = [ F (£, 17 ) | J \ d (£, r,) = km ( 8 ), 

•'(«) -'( 6 ) 

where k is some number between the upper and lower limits of | J | in o. 

If 0 be any connex perfect domain of which the frontier has its plane 
content zero, O being contained in H, and if G , in the plane of 77 ), 
correspond to G , we see that, | J | being continuous and never vanishing, 
it has a lower limit 1 /A, in G y where A > 0 ; therefore m ( 8 ) < Am ( 8 ), for 
every cell 8 , contained in G. 

If A be a set of non-overlapping cells, contained in G y and A be the 
set of domains in G that corresponds to A, we have m (A) < Am (A). 

Let E be any measurable set of points in G } then E is contained in a 
set E lf of measure m (E) y which is the inner limiting set of a sequence 
{A (n) }, of sets of non-overlapping cells. Let {A (n) } be the corresponding 

* Cours d'Analys einfinitdsimale, vol. I (1914), pp. 280-284, also Trans. Amer. Math. Soc. 
vol» xvi (1915), p. 466. For a critical remark on the proof in this memoir, see Fichtenholz, 
Bulletin de VAcad. roy. de Belgique (1922), p. 441. 
f hoe. cit. p. 443. 
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sequence of non-overlapping domains in G. The set E x — E = F, which 
has measure zero, is contained in a set F x , also of measure zero, which is 
the inner limiting set of a sequence {A' (n) } of sets of cells. Let F x be the 
set, in G, which corresponds to F x ; then 

m (F x ) ~ lim m (A' (n) ) s A lim m (A /(n) ); 

n-'-co n—cc 

hence m (F x ) — 0 . The set E is contained in E x , and contains E x — F x , 
and hence the set E, corresponding to E, is contained in E x , and contains 
E x — F x . Since m (F x ) = 0 , and E x is measurable it follows that E is 
measurable, and 

m (j E) -= m (E x ) ^ Am (E x ) ^ Am (E). 

It has thus been shewn that the set E, in G, that corresponds to a 
measurable set E, in G, is measurable, and that its measure does not 
exceed Am (E). 

If / (x, y) be a measurable function, defined over the set G , the function 
F (£, 77 ) which has the same value at a point (f, 7 ?), of G , as / ( x , y) has at 
the point ( x , y) that corresponds to (£, rj) is measurable in G. 

First, let / ( x , y) be bounded in G ; then a continuous function <f> e (x, y) 
can be so determined (see § 430) that 

| I f(x,y)~ 4>. (*> y) I d ( x, y ) < e 2 . 

• ((?) 

In a set L, of points of measure > m (G) — c, we have 

I / (*, y) - <f>. (*> y) I * «• 

Moreover </>, ( x , y) can be so defined that its numerical value never exceeds 
the upper boundary of | f (x } y) | in G. 

From the theorem established in § 373, we have 

I <f>. ( X, y) d ( x , y) = [ F. (f, v ) | J | d (£, rj), 

Ua) J(o > 

where F t (f, rj) = <f> € (x, y) } at corresponding points. 

To estimate the value of 

I f F ({,*!)] J\d(£, V )~ I F.(f, v )IJId(f, v ) 

1J (O) • ( o) 

we divide the set G into the two parts L and G ( L ), where, in L , the 
condition | F (£, rj) — F t (f, 77 ) | ^ c is satisfied. The absolute value of the 
difference is then seen to be 

<ef \J\d(£ tV ) + 2u\ \J\d(i >V ); 

■<3) J C(T) 

where U is the upper boundary of | / (x, y) | in G. 

Since m {G (L)} ^ Am {G (L)} < Ac, we see that this expression is arbi¬ 
trarily small, if c be taken small enough. 
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It has thus been shewn that 

km / _ (£> V) I J I d (£> v) = f I (£> l) I I I d (if V) ‘ 

e ~ 0 J (O) -'(G) 

and therefore we have 

f f ( x > y) * fa y) = f F (£, v) \ J \ d (f» v)- 

j (O) J (G) 

Next, let / (x, y) be unbounded. Since it can be expressed as the 
difference of two summable functions, each of which is non-negative, it 
will be sufficient to assume that / (. x , y) £ 0 . Let N be an arbitrarily chosen 
positive number and let f N (x, y) — / (x, y ), at all points where / (x, y) s N , 
and elsewhere let fa (x, y) -= N. We have then, since fa (x , y) is bounded, 

I /jv (*, y) d, (x, y) = f jFV (£ 1 ?) | j \d (£, v ), 
ha) l(o) 

where Fw(£,i 7 ) has the same value as fa(x y y), at the point (x, */) which 
corresponds to (£,77). 

Since lim I f N (x, y) d (x, y) = I / (a;,«/) <2 (a:, y), 

JV~co J (G) J (G) 

it follows that lim J^jv (f, 17 ) | J | d (f, 17 ) exists, and is equal to 

N~coJ ( 77 ) 

I f(x, y) d (X, y). 

• «?) 

Since F N (£, 77) | J | is monotone as A is increased, it follows, by applying 
the theorem of § 399, that 

f F (f, ■n) \J \d (f, v) = [ f (*. y) <2 (*» y)- 

The transformation obtained in § 373 has now been extended to the case 
in which / (x, y) is any function that is summable in G. 

The extension, given in § 374, to the case in which, in a closed set of 
points of content zero, the Jacobian either vanishes or is indefinite, is 
applicable when / (x, y) is a summable function. Moreover the case in 
which one of the domains G , G is unbounded, or in which both are un¬ 
bounded, can be considered, as in § 375. 

442. A more general treatment of the transformation of a double, or 
of a multiple, integral has been given by W. H. Young, in which it is not 
assumed that the Jacobian is necessarily of one sign, or that the corre¬ 
spondence of the points (x, y ), (£, 77) is necessarily a ( 1 , 1 ) correspondence. 
He has established the following theorem: 

Let x = x (f, 77), y = y (f, 77) be functions of (f, 77) possessing the property 
of having all their partial derivatives with respect to (£, 77) bounded for all 
values of (f, 77) in the fundarnental rectangle (a lf ft; a 2 , ft); and lei A be 
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dg\ d£; 


the area of the curve in the (x, y)-plane which is the image of the perimeter 
of this fundamental rectangle. Then 

A _ r* f ( Pi1 <L( X > V ) 

* _ n lU d(i,r)) , 

, dx dx dy dy , _ . , _ . 

where ^, g^, g^ represent any of the partial derivatives of x and y with 

respect to £ and 77. 

More generally, the same is true, if the partial derivatives are not bounded , 
provided only the following conditions are satisfied: 

(!)• x (f, 77) is an integral with respect to and an integral with respect 
to rj ; 

( 2 ) * y (£, rj) is an integral with respect to and an integral with respect 
to 77 ; 

'dy dx 

(3) . g^ g- are, except for a set of values 0/77, 0/ measure zero, Zess 
<Aan summable functions of 77 alone, say p (77) and if (77); 

(4) . The same condition as (3) is satisfied, with £ and 77 interchanged, or 

more generally, when p (y)d (f, 77), if (77) d (£, 77) as absolutely 
convergent integrals. 

For the proof of this theorem reference may be made to a memoir* 
by W. H. Young. 

For the transformation of the integral/ (x, y), he has given the following 
theorem: 

Let x = x (f, 77), y — y (£, 77), where the functions are continuous with 
respect to (£, 77), in a fundamental cell. Let it be assumed that any cell in the 
(£, 7})-space has for its image in the (x, y)-space, a portion of that space whose 
boundary divides it into two distinct portions, an exterior and an interior. 
Further let it be assumed that the area of a portion of the (x, y)-space is given 
f d {x, y') 

by J d (£ Tj) an absolutely convergent integral, which may be positive , 

negative , or zero. Then Jf ( x , y) d (x, y), taken over the portion of the ( x , y)- 
space which corresponds to a cell in the (f, rj)-space, is equal to 


if 


{*<*. vh y (I, ^s{tv) d( -’ n) 


taken over the cell. 

For the proof of this theorem referencet may be made to a memoir 
by W. H. Young. 


* See Proc. Lond. Math. 80 c. (2), vol. xvm (1918), p. 339, “ On a formula for an area. 1 
t Proc. Royal 80 c. vol. xcvi (1920), p. 82. 
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harnack’s definition of an integral 
443. A definition of the integral of a function / (#), unbounded in the 
linear interval for which it is defined, was given* by Harnack, which in 
its original form depended upon the employment of Riemann’s integrals 
in sub-intervals of (a, b) in which such integrals exist. The definition given 
by Harnack can however, without essential change of form, be extended 
to the case in which Lebesgue integrals take the place of Riemann integrals. 

The set of points of infinite discontinuity of a function / (#), defined 
for the finite interval (a, 6), form a closed set G . 

It will be assumed that this closed set G is of content zero. Let G be 
enclosed within the intervals 8 J} 8 2 , ... S„, of a finite set A, such that each 
interval of A contains at least one point of G. The complement C (A) 
consists of a finite set of intervals, free at their ends, and in their interiors, 
from points of G. 

Let it be assumed that / (a:) is integrable (L) in each of the intervals 

of C (A), and thus that / (a;) dx exists as an //-integral. The integral 
Jr (a) 

of / (x), in ( a , 6), when it exists, is defined as follows: 

Let it be assumed that a number 1 exists such that , corresponding to each 
arbitrarily chosen positive number e, a positive number £« can be determined 
so that 

1 — [ f (x) dx < e 

JC( A) 

for every finite set A of intervals which satisfy the conditions given above , 
and which is such that m (A) < f e . The number I is then taken to define 
the value of b 

[ f (*) dx. 

J a 

It is convenient to define a function f A (x) which is zero in all points 
of the intervals A, and is equal in value to / (#), at all interior points 
of C(A). 

We have then, in accordance with the above definition, 

I / (x) dx = lim j f A (x) dx , 

Ja m(A)^0 Jo 

the convergence being uniform with respect to all sets A. 

The necessary and sufficient condition for the existence of the integral 
rb 

I f (x) dx, as a finite number, is that, e being assigned, can be so deter- 
J a 

mined that b b 

( / 4 (x) dx - I / a . (x) dx < 2e 

J a J a 

* Math. Annalen, vol. xxiv(1884), p. 220. See also Jordan, Couth d 1 Analyse, vol. ii( 1884), p. 50, 
where a similar definition is given, except that the condition that the set of points of infinite 
discontinuity should have content zero is omitted. 



658 The Lebesgue Integral [ ch . vii 

for every pair of finite sets A, A' of intervals, such that each interval of 
either set encloses within it at least one point of G y provided 
rn (A) < £ e , m (A') < 

and that this condition is satisfied whatever value € may have. 

It is easily seen that the above definition is equivalent to that of 
Cauchy (§ 352), in case the set G consists of a finite number of points. 

It will be shewn that the above definition is more general than that of 
Lebesgue, in that it defines integrals which do not necessarily exist in 
accordance with Lebesgue’s definition. It is clearly less general than 
Lebesgue’s definition, in that it applies only to the case in which the set 
of points of infinite discontinuity of the integrand is of content zero. 

As m (A) converged to zero, the number n, of intervals of A, will increase 
indefinitely. If any sequence A 1? A 2 , A 3 , ... corresponding to a sequence 
*i, € 3 > ••• > °f values of e, which converges to zero, be taken, a sequence 

> A„ 2 , A„ 3 , ... 

may be chosen out of the given sequence in such a way that A„ r contains 
A„ r+1 , for all values of r. For let G (A nj ) consist of, say, r ni intervals (a, ft) 
each contained in some interval (a, ft) contiguous to G. The number n 2 
can be so chosen that it is the smallest integer (> n x ) such that m (A nj ) 
is less than the least of all the 2 r x numbers a — a, f$ — Clearly then 
(a, ft) is within an interval complementary to the set A nz . The numbers 
7i 3 , /i 4 , ... may then be chosen successively by the same procedure. The 
sequence A ni , A nt , ... is therefore such that each set contains the next. 
r& fb 

When both / (#) dx and / \ f (x) \ dx exist, in accordance with the 

J a J a 

above definition, I / (a?) dx is said to be absolutely convergent. Those 

I a 

integrals which exist in accordance with the above definition, and are not 
absolutely convergent, will be considered in Chapter vm. 

444. It will be shewn that: 
rb 

If f (x) dx exists , and is absolutely convergent , the integral also exists 
J a 

as an L-integral , and conversely , that, if it exists as an L-integral, and the 
set of points of infinite discontinuity has content zero , it also exists as an 
absolutely convergent integral , in accordance with the above definition. 

First, let / (x) ^ 0, in the interval (a, 6). Taking the sequence 

Aj, A 2 , A 3 , ... 

so that each set contains the next, we see that the values of 

[ A r (x) dx, 

J a 

for r = 1, 2, 3, , form a monotone non-diminishing sequence of numbers. 
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and thus they either increase indefinitely, or they converge to a limit A, 
Assume that the latter is the case. 


Let A/, A/, A 3 ', ... be any other sequence of finite sets of intervals 
which satisfy the conditions in the definition, but not necessarily such 
that A/ contains A', +1 , for all values of s. We have 


/a'* (*) dx - [ f Af (x) dx < Pm (A r ); 

. a J a 


where P is the upper boundary of / (#) in the intervals of C (A/); for the 
above difference is less than the integral of / (x) over those intervals that 
are in A r and also in C (A/). If e be arbitrarily chosen, and s be fixed, for 
all sufficiently large values of r we have, on the assumption of the existence 

of the limit A, of |* f Ar (x) dx , 


[ h'. < x ) • 


hr ( X ) dx 


and therefore 


fvjfi) dx t ii+e; 


this holds for each value of s . Similarly, we see that, for a fixed value of r, 
and for all sufficiently large values of s , 

I h. (*) dx > | f Ar (x) dx - Je; 

J a J a 

and by choosing the fixed value of r so great that 

J hr ( x ) dx > A - £e, 


we have 


f h‘. ( x ) dx> A - e. 


for all sufficiently large values of s. As 


f h’r (x) dx 

J a 

lies in the interval (A — €, A + e) for all sufficiently large values of s, and 
since € is arbitrary, it follows that 

[b 

h( x ) dx 

J a 

converges to A. Moreover, since 

j f*r (x) dx - jh', (x) dx < P’m (A',), 
where P' is the upper boundary of / (x) in C (A r ); and also since 
[ hr (x)dx- [ h".‘ (x) dx < P'm (A",.), 

J a J a 

where {A'V} is any other sequence of intervals enclosing G; from these 
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inequalities, we see that, if m (A',), m (A" s .) are both less than $e/P', the 
integrals 

/ /a', (*) dx, / / A ",- (a?) dx 

are both > A — c, and they have been shewn to be ^ A + e; it follows 
that the integrals differ from one another by less than 2e, Hence the 

fb 

condition for the existence of the integral f (x) dx is satisfied, on the 

J a 

assumption that the integrals of f Ar (x) converge to a finite number as 
r ~ oo. 


Thus it has been shewn that : 

If f (x) be never negative , it is necessary and sufficient for the existence of 

the integral / f (x)dx in accordance with Harnacks definition that , if 
J a 

Aj, A 2 , ... be a sequence of finite sets of intervals each containing the next , 
such that each interval of each set contains within it a point of G, and such 
fb 

that m (A r ) — 0, then I f Ar (x) dx should be less than a fixed finite number , 

J a 

for all the sets A r . 

fb 

Let it now be assumed that I / (a;) dx exists as an L-integral; we 

J a 

have then b b 

[ /a (z) dx = [ / (*) dx - [ / (x) dx ; 

Ja Ja .'(A) 

and since (§ 392), / (x) dx converges to zero, uniformly as m (A) ~ 0, 

b U*) 

we see that f A (x) dx converges to / {x)dx\ and thus the integral also 

J a J a 

exists in accordance with Hamack’s definition. It appears moreover that 
in this case of a non-negative function the condition that each interval 
of A should contain a point of G is unnecessary. 


It has thus been proved that, if / (x) be non-negative, and have an 
L-integral, it has also an integral in accordance with Hamack’s definition, 
and the two have the same value. 


Next, let f (x) have both positive and negative values, in (a , 6), and 
assume that it is summable in the interval. 

If f (x) = f + (#) — / (x), /+ ( x ) and f~ {x) are summable, and therefore 
they have not only L-integrals but also Harnack integrals. It is clear that 

\ f A (x)dx=\ f A + ( x ) dx - I / A - (x) dx. 

J a j a J a 

Now, as m (A) converges to zero, the two integrals on the right-hand 

side converge to b b 

I / + (*) dx, I /- (x) dx, 

J a J a 
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as has been shewn above, although A is such that an interval of it does 
not necessarily contain a point both of G l and of G z , the sets of points of 
infinite discontinuity of /+ (x) and of f~ ( x ). 

f b 

It follows that / (x) dx exists in accordance with Harnack’s defini- 

a 

tion, and is equal to the L-integral. 

rb 

It follows immediately that \f(x)\dx exists as a Harnack integral, 

• a 

and is equal to the sum of the integrals of /+ (a;) and/ - (a:). 

Conversely, let us assume that 

[ f (a;) dx and [ \f (x)\dx 

J a J a 

exist as Harnack integrals. 

The points of infinite discontinuity of the two functions f (x), | f (x) | 
are the same, hence the Harnack integrals of the two functions are the 
limits of L-integrals taken over sets of intervals that are the same for 
the two functions. It follows that | f (x) | -f /(x), | / (x) | — / (x) have 
Harnack integrals, the sum and difference of those of | f (x) | and / (x); 
or the two functions /+ ( x),f~ (x) have Harnack integrals. 

It is then sufficient to shew that the existence of the Harnack integral 
of a non-negative function involves the existence of the L-integral of the 
same function. 

Let it therefore be assumed that f (x) ^ 0, and that the Harnack 
integral of / (x) over (a, b) exists. The points of infinite discontinuity of 
/ (x) can be enclosed in a finite set A, of intervals, where m (A) is so small 
that 


f / (x) dx 


is less than the Harnack integral 


( / (*) dx 


by less than an arbitrarily chosen positive number £. Let N be a positive 
number not less than the upper boundary of / (x) in G (A), and let fx (x) 
be the function corresponding to / (x) employed in de la Vallee Poussin’s 
definition (§ 387). Let another set of intervals A', all interior to intervals 
of A, enclose all the points of infinite discontinuity of / (x). The integral 
of / (x) over C (A') lies between the integral over C (A) and Harnack’s 
integral, and therefore differs from the last by less than £. 


It follows that 


[ f(x)dx<C, 

[ f N {x)dx< £. 


(A—A') 


and therefore 
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From this we deduce that 

[ f N (x) dx < £ + Nm (A'); 

and since this holds for an arbitrarily small value of m (A'), N being fixed, 
we have 

f N (x) dx s £. 

.'(A) 

It now follows that /jv (x) dx — f (x) dx s. £; 

Ja JC( A) 

and since £ is arbitrarily small, N being sufficiently increased, it follows that 

rb 

f N (x) dx 
J a 

has a definite limit, as JV ~ oo, and that this limit is Hamack’s integral 



It has thus been shewn that the existence of Harnack \s integral implies 
that of the integral as defined by de la Vallee Poussin, the integrals having 
the same value. By § 387, it follows that the ^-integral exists, and is the 
same in value as the Harnack integral. 

Accordingly, an absolutely convergent integral of a function such that 
the points of discontinuity form a set of measure zero, and which exists 
according to either Lebesgue’s definition, or that of Harnack, exists also 
according to the other definition, and has the same value for the two 
definitions. It has moreover been shewn that the condition that each 
interval of A must contain at least one point of G, the set of points of 
infinite discontinuity of the function, is unnecessary when the integral is 
absolutely convergent. 

THE LEBESGUE-STIELTJES INTEGRAL 

445. Let / (x) be a measurable function, defined for the linear interval 
(a, 6), and let <f> (x) be a bounded monotone non-diminishing function 
defined in the same interval. Denoting <f> (x) by £, we consider, as in § 252, 
the functional image E (n , on the ^-segment, of a set of points E {x) on the 
^-segment, such that to a point x ', at which cf> ( x ') is discontinuous, there 
corresponds the whole interval (<f> (x' — 0), <f> (x' + 0)) of points on the 
^-segment. It was remarked in § 252 that, if E (x) is measurable on the 
^-segment, it is not necessarily the case that E (f) is measurable, but that, 
if E {x) is measurable (B), then E { ^ is measurable ( B ). 

Let F (£) be the function defined, at every point of the ^-segment, by 
the condition F (£) =/(a;), where x is the point that corresponds to f; 
if x' is a point of discontinuity of </> (x), the interval (<f> (x' — 0), <f> {x f 4- 0)) 
is an interval of invariability of F (£), in which F (f) = <f> (x'). 
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ft 3 

Whenever the L-integral F (£) d£ exists , where a — <f> (a), = <f> (6), 

tte faZwe may be said to define that of the Lebesgue-Stieltjes integral , or 
LS-integral j* f (x) d<f> (x), of the measurable function f (x), with respect to 
the monotone function f> (x). 

In case f F (£) d£ exists as an jR-integral, the integral [ / (x) d<f> (x) 
J a J a 

may be an BS -integral (see § 378). It has been shewn, in § 377, that the 
necessary and sufficient condition that the integral may be an /^-integral 
is that the variation of <f> (x) over the set of points of discontinuity of f (x) 
should be zero. 

If the function f (x) is measurable ( B ), the function F (£) is measurable 
(B) on the ^-segment, and Lebesgue’s definition is applicable to define 

fW)#. 

J a 

In case <f> (x) is of bounded variation, and is therefore expressible as 
the difference of two monotone non-diminishing functions fa (x), fa (, x ), 
the ///^-integral of / (x) with respect to <f> (x) may be defined to be the 
excess of the iv/S-integral of f(x) with respect to fa (x) over its 2>$-integral 
with respect to fa (x), whenever these latter integrals exist. 

• Referring to the definition, in § 252, of the variation of the monotone 
function </> (x) over a set of points E in the x-segment, and denoting by 

e n that set of points at which c n £ / (x) < c n+J , we see that the L$-integral 

00 

/ (x) d<f> (x) is defined as the limit* of 2 c n V {en) </> (x) ; the set of numbers 

-oo 

{ c n } being such that c„ +1 — c n is less than a fixed number e, for all positive 
and negative values of n, and the limit being taken as c ~ 0; it being 
assumed that the variation V (en) (x) exists for all the sets e n , so that 
F (£) is measurable in (a, /3). 

The definition of an L$-integral, over the x-segment, as an Z-integral 
over the f-segment, may be employed to extend the properties of //-inte¬ 
grals to the case of //^-integrals. 

We have, for example, the extension of the property of L-integrals, 
that, if {/ n (x)} is a sequence of summable functions that converges to 
/ (x), and such that | /„ (x) | is bounded for all values of n and x, then 

[ f (x) dx = lim j f n (x) dx. 

J a n— oo J a 

The corresponding property of the LS-integral is that 
f / (*) d<f> (x) = lim f /„ (x) d<f> (x), 

J a n~co J a 

* See Hildebrandfc, “On integrals related to, and extensions of, Lebesgue integrals,’' Bull. 
Amer. Math . Soc. (2), vol. xxiv (1918), p. 191. 


~T 

j : 
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where the bounded functions {/ n (a:)} are such that their L$-integrals with 
respect to <f> (x ) exist. 

446. It has been pointed out by Hildebrandt (loc. cit.) that the defini¬ 
tion of an integral due to W. H. Young, referred to in § 389, may be so 
extended as to give rise to a definition of integration of a summable function 
with respect to a monotone function, more general than the definition 
given above. 

Let the interval (a, b) be divided into a finite, or enumerably infinite, 
set of parts e x , e 2 ,... e n , ... over each of which the function <f> (x) is 
measurable, and such that, over each of the sets e n , the function / (x) 
has finite upper and lower boundaries U n , L n . Consider the sums 
S x - </> (x), S t - </> (x); 

then the lower boundary of S 1 , for all such modes of division of (a, b) into 

fb 

the sum of sets, is defined to be the upper integral f (x) d(j> (x), of / (x) 

.' a 

with respect to </> (x), over the interval (a, b). Similarly, the lower integral 
/ (x) d<f> (x) is defined to be the upper boundary of $ 2 , for all such modes 

'a 

of division of the interval (a, b). When the upper and lower integrals have 

rb 

the same value, that value may be said to define the integral / (x) d<f> (x), 

J a # 

of / (x) with respect to cf> (x). In accordance with this definition any 
summable function / (x) will have* an integral with respect to <f> (x); the 
division of (a, b) into parts being restricted to be a division into sets that 
are all measurable ( B). The definition of § 445 is such that all functions 
that are measurable (B) have integrals, finite, or infinite, with respect to 
the monotone function <j> (x). 

Lebesgue has givenf two modes by which a Stieltjes integral can be 
reduced to an L-integral. A transformation of an L-integral into a Stieltjes 
integral has been givenJ by Van Vleck. If e (y) denotes the measure of 
the set of points for which L £ f (x) < y, where L is the lower boundary 

of / (x) in (a, b), the L-in tegral / (x) dx is equal to the Stieltjes integral 

J a 

[ u 

yde (y ), where U is the upper boundary of / (x) in (a, b). This includes 

J L 

the case in which one, or both, of the numbers 17, L are infinite. It has 
been shewn by Bliss § that an L-integral is reducible to an I?-integral. 
f u f u 

For, / yde (y) = TJ (b — a) — e (y) dy; and the last integral is an 
J L JL 

.K-integral, since e (y) is a monotone function of y. 

* See Hildebrandt, loc. cit. p. 191. 
t Comptes Rendus, Paris, vol. cl (1910), p. 86. 
t Trans. Amer. McUh. Soc. vol. xvm (1917), p. 326. 

§ Bull. Amer. Math. Soc. vol. xxiv (1918), p. 1. 
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447. From the theorem of § 440 we see that, if ® (x) — f <j> ( x) dx, 

- a 

where <I> (x) is a monotone function gf x } in an interval (a, b), then 

I / (x) <f> (x) dx 


is equal to F (£) d£, provided F (f) is summable in the interval (a, ft) 

J a 

that corresponds to (a, b), when the transformation £ a - f j (x) dx 

J a 

is employed, and F (£) = / (x). It then follows from the definition in § 445 
that x 

I f(x)4> (x) dx = I f (x) d<fr (x). 

* a J a 

This may be extended to the case in which <1> (x) is the indefinite integral 
of any summable function <f> (x). For (f> (x) may be expressed as 

<f>i («) " </>2 W- 

where <f> x ( x ), <f> 2 (x) are non-negative summable functions; and thus 


(x) — I (f) x (x) dx — j </> 2 (a;) dx 
- <f>, (x) - <I> 2 (x). 

In accordance with the definition in § 445, we now have 

f f(x)4> (x) dx — j / (x) do, (x) - f f (x) d<I> 2 (x), 
da -a J a 

or f / (x) 4> (x) dx = [ f (x) d<D (x), 

J a J a 

where <t> (x) denotes the indefinite integral of the summable function <f> ( x), 
Let f (x) be monotone and bounded in the interval (a, b), then 

f f (x) d<D (x) 

J a 

is an i?$-integral, since the variation of O (x) over the set of points of 
discontinuity of / (x) is zero, because <J> (x) is an indefinite integral. 

We then have (see § 376) 


( / (x) d$> (x) - \f(x) <D (x)\ ~ f O (x) df (x), 

• a L J a J a 

and thus the theorem for integration by parts may be written in the form 

fj (*) f> (x) dx - |/ (x) O (x)] a -/ a <D (x) df (x). 


Moreover, since / (x) is monotone, 


® (x) df (x) = <I> (/x) {/ (b) - f (a)}, 


where /x is some point in the interval (a, 6). 
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Therefore f(x)<f> (x)dx =/(£>) C> (6) - <h (/*) {/(ft) -/(o)} 

J a 

= / («) | («) dx + / ( 6 ) I <f> ( x) dx. 

a • M 

A proof of the second mean value theorem* has thus been obtained 
by employing the method of integration by parts, as applied to the 
i?$-integral. The more general form of the theorem, and Bonnet’s form, 
can be deduced, as in § 422. 

THE ^-INTEGRAL FOR FUNCTIONS OF TWO VARIABLES 

448. Let / (x a) , x (2) ), g (rr (1) , x {2) ) both be functions of bounded variation, 
in accordance with the definition of § 254. Let the functions be such that 
each of them has an P$-integral with respect to the other f, taken over 
the cell (a (1) , a (2) ; 6 (1) ,5 (2) ). Then these integrals satisfy the following relation: 


<A,f 


■(b<D,b(8)) 


a (2) ) 


g(x ( 1 ) ,x { 2 ) )df(x"\xW) 


= [/(*' 

rft (1 

J n( 1 


X (2) ) g (z a) , x< 2 ') 




.' a< J > 

r&<*» 


/ (# (1) , x {2) ) dg {x {1 \ x {2) ) 


(«<»), a<»>) 
6 <*> 


+ 


a <2> I 


/(^ A >, x™)dg(x^\x™) 


a< 2 > 

6<n 


a (>) 




f(x™,x (2) )dg{x (1 \x™), 


f(x (1) , x^)dg(x^, x< 2 >) 


fed) 


a (| ) 


/ (&<!>, x< 2 >) dg z< 2 >) - / (a (1) , z< 2 >) dg (a (1) , z< 2 >), 
r ~i (*><», &< 2 >> 

f (x"\ x< 2) ) g (x' l \ x' 2) ) 


where 
denotes 
and 

denotes 

/ (6 (1) , 6 (2) ) ^ (6 (1) , 6 <2) ) - / (6 (l) , a (2) ) g (b {1) , a< 2 >) - f (a (1) , 6< 2 >) ? (a* 1 *, 6< 2) ) 

+ f(a< 1 ',a< 2 ')g(a' 1) ,a< 2 '). 

In order to prove this formula, let the rectangle (a (1) , a {2) ; b {1) , b {2) ) be 
divided into (m — 1) (n — 1) partial rectangles, and let P Ti denote a corner 
of one of these partial rectangles, where i has the values 1, 2, ... m — 1, 
and j has the values 1 , 2, ... n — 1 . We have then the identity 

w-ln-l 

^ ^ ; (/t.i /»>1 ,1 ~ fi. j+l d~ 

fi+l, j+i tot.i 9i+l»i i-fl d - f7»+l.3+l)l 
m-1 n-1 

“ ^ /t +1,1 to,>1.1 ~~ l) + S / ltJ +1 tol, ;>1 “ 9l,j) 

i-1 ^’-1 

ra -1 n-1 

^ fi+l*n toi+1. n n) ^ fm,j+1 tom, i-fl ^m, j) 

i-1 1-1 

d~ {.fl,l9l»l fl,n9l,n fm,l9m,l d~ /m, n9m, n] > 

* See W. H. Young, Proc. Lond. Math. Soc. (2), vol. xvi (1916), p. 280. 
t This can be shewn to be the case if one of the functions is continuous. 



447 , 448 ] The RS-Integral for Functions of Two Variables 667 


where f it j 9 g if j denote the values of/ (z (1) , x (2) ), g (x a) , x {2) ) at the point 

P* 

If we assume that the functions / (# (1) , x l2) ), g (x {1) , x {2) ) are both of 
bounded variation in accordance with the definition of § 254, and that 
each of them has an 7?$-integral with respect to the other, as the numbers 
m and n are indefinitely increased, and the rectangular sub-division is 
such that the maximum diagonal of the partial rectangles converges to 
zero, the integrals in the formula (A) will be given as the limits of the 
summations in the above identity. 

The formula (A) may be reduced to a different form which is one of 
four different forms. 


It can easily be verified that 

/ (a?«>, 6 (2) ) dg (x< 1 >, z< 2) ) - / (x™, 6 (2 >) d 

.(«<!>, «,<*)) Ja'» 


g (x< l \ x' 2) ) 


&<*> 


a< 2 > 


where 


g (a; (1) , x { ‘ 


a< 2 > 


denotes g (x (l \ b {2) ) — g (# (1) , a (: 


Employing this result , we have 

rd<>> 


/ (x (1 >, x< 2 >) dg (x< l \ x (2) ) 

/(&<»>, fe< 2 >) 


6<2) 

a< 2 > 


(a<‘>, n < 2 >; 


f{x ll \ b< 2 >) dg (x {1 \ x (2 >) 


4* 


} \f (x< l \ b {2) ) - f (x"\ a (2) )] dg (. x <*>, a (2) ) 




(a<»,a< 2 >) 


/(x (1) , W 2) )dg(x {l) ,x v 


J a (,) 


+ [/ (6< 1) , 6< 2) ) -/(6 (1) ,u (2) )] 


f(x {1 \x {2) ) 

(a (1) , a< 2 >) 

17 (6 (1) , « (2) ) - 17 (a a) , a (1) )]- 


A similar expression for 


rb 
J a 


f ( x {1) , x (2) ) (x (1> , a; 


( 2 )\ 


b < l > 


can be obtained. On substituting these results in (A), we obtain the 
formula 


(B x ). £ (« (1) , tf (2) ) ^/(z (1) > ^ (2) ) 

J (a lt \a (2) ) 

r 

= ^(a (1, ,a ( «) /(*«>,*<«) 

J (a (1> » 


(6 (1> ,6< 2) ) 
a< 2 >) 


r(6C»), &<*>) r "KWU.ftW) 

+ x (2) ) rfgf(a; (1, ,x ( 

r&a> r "I 6 (2 >) 

+ / /(x (1) ,* (2) ) dg(x<",aM) 

J a<*> L J (*<*>,«<*>) 

/•6< 2 > r 1 (&<‘>, b< 2 >) 

+ /(* {l, ,* (2) ) dg (a (1) , # (2) ). 

J «<« L J («<’>, x< 2 >) 
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In a similar manner, or by interchange of the corners of the cell, we 
obtain the formulae (B 2 ), (B 3 ), (B 4 ). For example, the formula (B 2 ) is 


r(bi'KbW) 

i 

(a (1 >, a ( *>) 


g (z (1 >, x' 2 >) df {x™, x < 2 >) = g (a (2 >, 6< 2 >) 


f(x*\x™) 


JfaW.aW) 


[{U l \ b<*>) r 

/(^<1>,^ <2) ) (*<«, *<*>) 

J(aW,aW) L J («<*>, a<*>) 

rb(') r ~1 (x<*>, b<*>) 

- \f(z {V ,xW)\ dg (x ll) , 6 (2> ) 

.'a^L J(a< l >,a<*>) 

f6< 2 > r 1 <&<*>, x< 2 >) 

- dg(a'*>,x™). 

J a< 2 > L J (a (,) . a<*>) 

The formulae for functions of three variables have been given* by 
W. H. Young. 


448 1 . Let / (x (1) , z< 2) ) be a function of bounded variation, in accord¬ 
ance with the definition of § 254, and let 0 (x (1 \ x (2) ) denote the indefinite 


integral [ 


g (x {X \ x {2) ) d (# (1) , x {2) ), where g (x (1) , x (2) ) is any function 

-'(al'KaW) 

which is summable in the cell (a n) , a (2) ; b (l) , b {2) ). It will then be shewn 

/ (x (1) , x (2) ) g ( x {1) , x [2) ) d (# (1) , x (2) ) is equal to 

J (a<*>, a< 2 ') 


that the L-integral 
W l \bW) 


/: 


f(x {1) , x (2) ) dG (x n) ,x (2) ), which exists since G (x ll) , x {2) ) is absolutely 
(«<»>, «<*>) 

continuous . For its variation over the set. of points of discontinuity of 
/ #< 2 >), of measure zero (§ 308), is zero (see § 381). 

To prove the theorem, let it be assumed, in the first instance, that 
g (x {1) , x (2) ) is a non-negative function, so that G (x a) ,x (2) ) is quasi-mono¬ 
tone. 


We see that 


/ 

J (a 


or 


/ (x^, x {2) ) g (x™, x< 2 >) d (# a) , x< 2 >) 

(o (1 >,a (8) ) 

r»m /*(j8 r < , ),jSr< 2 )) 

S / (x«>, x (2) ) 0 z< 2 >) d (x (1) , z< 2 >), 

r — 1 j ( a rC), a 


, ar<*)) 


in accordance with the notation of § 381, lies between 


r - m r - m 

2 <7 (8 r ) A* <7 (a; (1) , z< 2 >) and 2 L (8 f ) A 6r G *< 2 >), 

r-1 r- 1 

r(&< l >,&< 2 >) 

both of which converge to / (a? a> , x {2) ) dG (x a \ x {2) ), as m is in- 

J (at*). a< 2 >) 

definitely increased. Thus the theorem holds when g (x (1) , x i2) ) is non¬ 
negative. Any summable function g (x {1) , x {2) ) can be expressed as the 
difference of two non-negative functions g 1 ( x {1) , ar (a) ) and g 2 (x (1) , a? (2) ), to 
each of which the theorem applies; it then follows that the theorem 
holds generally. 

* See W. H. Young, Proc. Lond. Math . Soc. (2), vol. xvi (1916), p. 281. 
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449. In connection with the theory of the reduction of a quadratic 
form involving an infinite number of variables to a canonical quadratic 
form, certain limits were introduced by Hellinger*, for the representation 
of which he used the notation of integration. A limit, of this species, may 
accordingly be spoken of as a Hellinger integral. It was shewn by Hahn 
that every Hellinger integral can be expressed as an L-integral. 

Let g ( y ) be a continuous non-decreasing monotone function, defined 
for the linear interval (a, P ); and let / (y) be a summable function, defined 
for the same interval, and such that it is constant in any interval, con¬ 
tained in (a, p), in which g (y) is constant. 

Let a net be fitted on to the interval (a, p ); and denote its meshes by 
(y<s, Vi), (Vi> Vi). ••• (ym-i,y m ), where y 0 = a, y m = p. 


v 1 {/ (: yr) f (yr-l)} 2 


If the sum D LLkJz — l . } ias a finite upper boundary , when all 

J r -1 g (yr) - g (y r ~i) J y 

possible nets are taken into account, that upper boundary is denoted by 
re \df (v)l 2 

j dgldf) ’ an d a Hellinger integral. In the above sum , any term 

in which the denominator, and consequently the numerator, vanishes , is 
omitted. 

Let the new variable x be defined for the interval (a, b), where a = g (a), 
b = g (P), by the definition x = g (y). 

The sum employed in the definition of the Hellinger integral becomes 

£ -— ( Xl Tr 1 ^ . f where x r corresponds to y r , and F (x), or F {g (y)}, 

r -1 x r — x r _ x 

is identical with / (y). 

If f x (,V)j /« (y) are ^ wo summable functions, defined in (a, P), and each is 
constant in any interval in which g (y) is constant, then if the sum 

r if {/l (yr) ~ fi (yr-l)} {ft (yr) - /, (yr-l)} 
r~ 1 9 (yr) - 9 (y r -l) 

has a definite limit , the same for all systems of nets fitted on to (a, p), this 

limit is denoted by f —— , and is also called a Hellinger integral. 

Ja dq (y) 

The sum here employed is equivalent to 

r £ {Fi (x r ) - F 1 (x r _ x )} {F 2 (x r ) - F 2 

r * 1 x r — x r~\ 

where F, {g (y)} =/, (y), and F 2 {g (y)} =/ 2 (y). 

* See the dissertation “Die orthogonal invarianten quadratischen Formen von unendlich 
vielen Veranderlichen,” Gottingen, 1907; also a memoir in Crelle's Journal , vol. cxxxvi (1909), 

p. 210. 
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450. The following theorem will be established : 

If cf> (x) denote a function , bounded , or unbounded , in (a, b ), such that 
{<f> (x)} 2 is summable in the interval , then 

l IL * w *’}' ■ + 5, {L * <*> +-+ S 1 i L * w *}'• 

where S lf 8 2f ... 8 m are the intervals of a net fitted on to (a, b ), converges to 
I {<£ ( x )} 2 dx, / or an y system of nets. If {(f) (x)} 2 is notf summable , the above 

J a 

sum diverges , /or a system of nets. 

In case </> (x) is integrable (ft), the truth of the theorem is obvious; for 
<f> (x) dx is equal to 8 r multiplied by a number lying between the upper 

J (S r ) 

and lower boundaries of <f> (x) in S r ; and thus the expression reduces to 

ib 

the Riemann sum employed in the definition of {<f> (a:)} 2 dx. Thus the 

J a 

theorem holds, in particular, for a continuous function <f> (x). 

Next, let <f> (x) be any function that is non-negative, summable, and 
bounded, in (a, 6). A continuous function f (a:), also non-negative, can 
be so determined that 


f {<f> (*)}* ~ U (*)}* \ dx < e, 

J a 


is an 


The sum 


r ”77i l 

Si = 2 i 

r-1 o r 


r 

and consequently that {<f> (x) — / (a:)} 2 dx < e, (see § 433); where e i 
J a 

arbitrarily prescribed positive number. 

r-m l ( f ) 2 

2 * « / <f> (x) dx[ may be expressed by + S 2 -f S 3 , where 

r-lO r (J(Sr) ) 

f {<£ (*) - / (*)} dxi , = S ^ { f f(x)dx l , 

J(Sr) J r-lMJ(5r) j 

r-m l r r 

S» = 2 2 k- {<f>(x) ~f (*)} / (*) <£c. 

r-1 

By using the Schwarzian inequality (§ 396), we have 

If {<f> ( x ) -/ (*)} * S r j {<f> (x) - f (a;)}* 

( J (Sr) ) J (Sr) 

and hence we find that S 1 < e. Also, since f (x) is continuous, we have, 
for all nets of sufficiently large order, in any system of nets, 

( {/ (*)} 2 dx > £ 2 > [ {/ (x)} 2 dx - €. 

.'a J a 


and 


We also have 

I S t I < jj t (x) -/(*) I dx< 2U [(6 - a)fj<f> (x) -f(z))»dx j ; 

and thus | S 3 | < 2U (b — a)* €*; where U denotes the upper boundary of 
<f> (x); since the function / (x) may be so chosen that its upper boundary 
does not exceed that of <f> (x). 
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Since e is arbitrary, S 1 and S 3 are arbitrarily small; and S 2 differs from 
rb 

{(f> (x)} 2 dx by less than 2e. It now follows that 
r s"g|f 4> (X) dx\ 2 

b r-1 °r ( - (fir) ) 

converges to {</> (x)} 2 dx , as the summation is taken successively over 

J a 

the nets of any system of nets. 

Next, let <f) (x) be unbounded, but everywhere £ 0; and let fiy* (x) 
have the same value as <f) (x), at a point where <f> (x) s N s , and have the 
value N Si at a point where <f> ( x ) > N s . 


Denoting by a m3 the number 


r-m l (r 
r 1 d r (-'(fir) 


t (x) dx j- ; as s has the values 


in an increasing sequence such that N s ~ oo, and m increases indefinitely 
as we proceed through the nets of a given system, the numbers a ms form 
a monotone double sequence. For a sub-division of any net increases the 


Pi , Vi 


value of a ms , as is easily seen from the inequaUty ™< V - + , . 

4 ' ^ 1 ^2 

If either of the repeated limits lim lim a m3 , lim lim a ms exists, then also 

QO .S~ OO ,S 00 }/i~ QO 

the other exists, and the two have the same value (see § 388). 

Now lim a m3 = f {</>n 8 (#)} 2 dx, and lim lim a ms = f {<f> (a;)} 2 dx; 

lit ~ oo Jo .w~ao ///~QO Jo 

hence lim lim a m# = [ {(f> (a:)} 2 dx , 

rn~oo «~ao J a 

and thus the limit, as we proceed through a system of nets, of 


r~m 1 

£ * 
r-1 d r 


I <f> (x) dx 1 , is [ {(f) (a:)} 2 dx. 


Lastly, let <f) (a;) be any summable function whose square is summable, 
and which may have both positive and negative values. It may be ex¬ 
pressed as the difference of two summable functions </> x (x), <f> 2 (, x ), each of 
which is £= 0, for all values of x. 

Since 2 l T f {<j> x (x) -f <f> 2 (^)} 2 dx converges to f {<f> x (x) + <f> 2 (a ;)} 2 dx, 

°r u (fir) J a 

that is to {<£ x (a:)} 2 dx 4- {<f> 2 (x)} 2 dx + 2 I </> x (a;) <f) 2 (x) dx ; and also 

. a - a *' a 

2 i | j* <f> x (x) dx | converges to j {<f> x (x)} 2 dx, 
and 2 * j [ <f> 2 (x) dx 1 converges to P {<^ 2 (x)} 2 dx, 

°r (j (&r) ) 'a 

it follows that 2 ~ f cf 1 (x) dx ( <f> 2 (x) dx converges to f (x) <f> % (x) dx. 

t>r.'(8r) ' (fir) Ja 
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It now appears that 

Z ~ j f {fa (a;) — fa (a:)} 2 dx converges to f {fa ( x) — fa (a;)} 2 dx; 

d r (' (f r) ' a 

and thus the theorem is established generally for every function <f> (x) 
whose square is summable. 

The following theorem, which includes the theorem established above, 
may now be stated: 

If fa (a;), fa (x) be any functions whose squares are summable in the 
interval (a, 6), the sum 

r~rn I r r rb 

Z 5- / fa (x) dx fa (x) dx converges to I fa (x) fa (x) dx , 

r~ 1 °r J Ur) J(&r) J(Z 

as the summation is taken successively over the nets of any system of nets 
fitted on to (a, b). 

To prove this theorem, we observe that fa (a;), fa (x) may be replaced 
by fa + (x) — fa~ (x), fa+ (x) — (f> 2 ~ (x), respectively, where the four func¬ 
tions in these expressions are all non-negative. As the theorem holds for 
each of the integrals 

(b rb 

fa+ (x) fa* (x) dx , fa+ (x) fa~ (x) dx , 

J a J a 

rb rb 

(x) <f> 2 + (x) dx, <£,- (x) <f> 2 - (x) dx, 

J a J a 

it clearly holds also for 

((*) - <Ar (*)} {^ + (*) - <k~ (*)) 

J a 

rb 

that is for fa (x) fa (x) dx. 

a 

451. Let x = g (y), a = g (a), b = g (ft), where g (y) is a continuous, non- 
diminishing function of y, in the interval (a, ft ); also let / (y) = F {g (y)}, 

where F (x) = j <f> (x) dx. 

J a 

The sum employed in the theorem of § 450 then becomes 
r ~£ {f(yr)-f{yr- 1)} 2 
r-1 9 (Vr) ~ 9 (y r -l) ’ 

where the net (y 0 , yj, (y l9 y 2 ), ... (y^, y m ) corresponds to the net 
(x 0 , Xj), (x 2 , x 2 ), ... (x m _ lt x m ). 

It will now be shewn that: 

The necessary and sufficient condition * that j**should exist is 

that , if x = y (y), the function F (x) =/ (y) should be the indefinite integral 
of a function </> (x) whose square is summable in the interval (a, b), of x. 

* This theorem was given by Hahn, see Monatehefte fur Math. u. Physik, vol. xxiii (1912), 
p 172. Another proof was given by Hobson, Proc. Lond. Math. Soc. (2), vol. xvm (1918), p. 258. 




450, 451] 


Moreover 
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]2 f& 

— |{c/> {x)} 2 dx; the Hellinger integral being thus 


and thence we see that 


{< f> {x)Ydx, 


A0 (7/); 


expressed as an L-integral. 

To establish the sufficiency of the condition, let it be assumed that 
F (x) = <f> (x) dx , where {</> (x)} 2 is summable. 

J a 

[ Xr 

We have then F ( x r ) — F (x r _ x ) = <f> (x) dx , and thence we see that 

* Xr-\ 

{F (x T ) - F (* r _ i)} 2 i (x T - x r ^) j {(f> (z)} 2 dx; 

and therefore 'if a {x y t dx 

for any net fitted on to (a, b). It follows that the limit of f exists. 

Ja dg (y) 

Next, let the existence of the Hellinger integral be assumed. It is 

evident, from the definition, that f exists, for any value of y in 

. '« dg(y) J 9 

the interval (a, /?), and that it is a monotone increasing function of y , 

say 0 ( y ). 

We have then, in any interval of y, contained in (a, yS), 

(4/ (y)|* s Al p ( y 

(y) 

and thus 

I F (x r ) - F (x r _,) | s [(x r - x^) {x (x T ) - x (x r _j)} |i, where <f>(y)=x (*)• 
From this, we have 

r - m r «■ m 

S | F (x r ) - F (*,_,) | - x r _j) {x (x r ) - x to-,)}]* 

s [(6 - a) (x (b) - x (a)}]*; 

and thus the function F {x) is of bounded variation in (a, b ); from which 
it follows that F' {x) exists for almost all the values of x. In order to 
shew that F (x) is the indefinite integral of F' (x), it is sufficient to shew 
that the variation of F (x) over any set of points of measure zero vanishes 
(see § 407). Such a set can be enclosed in the intervals (8) of a non-over¬ 
lapping set of which the measure is arbitrarily small. The variation of 
F (x) over this set of intervals is, as above, £ [£S {* (b) - * (a)}]*, where 
£8 is arbitrarily small; hence the variation is zero. Denoting F' (x), at 
every point at which it exists, by <f> (x), we may suppose <f> (#) to have 
the value zero at all points at which F' (x) does not exist. The function 
<f> (x) being summable in (a, 6), the theorem of § 450 is applicable, and it 

follows that | {<f> (a;)} 2 dx exists, and is the limit of the sum 
r ” m [F (x r ) - F (x r _j)] 2 
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f 


2 dx = {F' (a;)} 2 dx , 


for any system of nets fitted on to (a, b). Thus we have 

dg ( y) J <i ( )} 

and accordingly the theorem has been established. 

It is sufficient for the existence of the Hellinger integral 

f dfi ( y) dfi (y) 

Jo dg (y) 

that the functions F 1 (x ), F 2 (x) should be the indefinite integrals of two 
functions (a:), <j> 2 (a:) whose squares are summable in ( a , 6); where F x (a?), 
F 2 (x) are the functions which are equivalent to f x (y),/ 2 (y), and x = g (y). 
In accordance with the second theorem of § 449, we then see that 

\jAx) <f>, = 

452. The theorems of § 450 can be generalized so as to relate to a 
function cf> ( x ), such that {</> (a;)} 1+A is summable in the interval (a, b), 
where A is a positive number. The proofs of the theorems in the extended 
form are essentially similar to those of the original theorems. Thus we 
obtain* the theorems: 

If cf> (x) be a function that is non-negative in the interval (a, b), and such 

that {(f> (a;)} 1 ** is summable in (a, b), where A denotes some positive number , 

then the sum r , m , (f )1+ * 

S sJ <H x )dx\ 
r 1 O r [J (g r) J 

[b 

converges to {</> (a;)} l+A dx, as the sumi is taken successively over the nets of 

■ a 

any system of nets fitted on to (a, b). In case A is a positive integer , <f> (x) 
may be any summable f unction, not necessarily of one sign in (a, b). 
Hellinger’s definition may be extended to apply to the integral 

{* W (y)] l+K 
Jo [dg (y)] A ’ 

defined as the limit, when it exists, of the sum 

r if [/. (y<-) ~/(yr-i)? +x 

r-\ [g (y r ) - g{y r ~i)Y 

In this case: 

f0 \df ( V )11+A 

The necessary and sufficient condition that ~=-should exist i 

Ja f dg(y)f 

that the function F (x) (= / (y)) should be the indefinite integral of a function 
<f> (x) such that [<f> (x)] 1+A is summable in (a, b). Moreover 

m-&**»*■ 

Other generalizations of the Hellinger integral have been made by 
Radonf and by E. H. MooreJ. 

* See Hobson, loc. cit. p. 263. See also F. Riesz, Math. Annalen , vol. lxix (1910), p. 462. 
f See the memoir “Absolut additive Mengenfunktionen,” Wiener Sitzungsber. vol. oxxn*, 
(1913), p. 1361. $ See Hildebrandt, Bull. Amer. Math. Soc. (2), vol. xxiv (1918), p. 198. 
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CHAPTER VIII 

NON-ABSOLUTELY (CONVERGENT INTEGRALS 


453. The definitions of Hamack and Lebesgue have been shewn 
(§ 444) to lead to the same values in the case of absolutely convergent 
integrals when the points of infinite discontinuity of the function form a 
set of measure zero. We proceed to consider the integral of a function 
/ (#) over a linear interval, in accordance with Harnack’s definition, 
when that integral is non-absolutely convergent, that is, when | / (x) | 
is not integrable over (a, b) in accordance with Harnack’s definition. 
A point of (a, b) which is such that / (x) is not summable in any interval 
which contains the point within it, or at an end-point, may be called a 
point of non-summability , or a Hamack point , for the function / (#). It 
is clear that the points of non-summability of / ( x ) form a closed set H , 
for if P be a limiting point of H , any interval that contains P contains 
points of the set, and consequently f (x) is not summable in the in¬ 
terval. The points of H are points of infinite discontinuity of / (x), 
but H is not necessarily identical with the set G , employed in § 443, 
of all the points of infinite discontinuity of f (x). In any interval in¬ 
terior to a contiguous interval of H the function / (x) is summable, 
though it is not necessarily bounded. The definition of § 443 will be 
modified, by employing the set H instead of G ; so that the L-integrals 
employed in the definition are not necessarily L-integrals of a bounded 
function, those points of infinite discontinuity of / {x) which are not points 
of non-summability being interior to the contiguous intervals of H . Thus 
the definition takes the following form: 


If there be a non-dense closed set H , of content zero , of points of L non- 
summability of f (x) in (a, 6), and H be enclosed within intervals of a finite 
set A, such that each interval of A contains at least one point of H, then if 


the L-integral I / (x) dx exists , and is such that , corresponding to an arbi- 

J CU) 

trarily chosen number c, a number which converges with € to zero , exists 


such that 



< c, 


for all such pairs of sets of intervals A, A', provided m (A) < m (A') < 
then the limit of f (x) dx, as m (A) ~ 0 , defines a number which is denoted 

JCf(A) 
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by I f (x) dx, and is called the Harnack-Lebesgue, or HL-integral, of f (x), 
J a 

in (a, 6). 

The i/L-integrals of functions contain as a sub-class the Harnack- 
Riemann, or /LR-integrals, in which the integral of the function over any 
interval interior to a contiguous interval of H is an R-integral. 

It will appear that the condition that each interval of A must contain 
a point of H is an indispensable condition. It has been shewn (§ 444) 
that, in the case of absolutely convergent integrals, the corresponding 
condition relating to G is unnecessary. When this condition is a necessary 
one, the integral has been termed* by E. H. Moore a narrow integral. In 
the contrary case the integral is called a broad integral; and it will appear 
that a broad integral is necessarily absolutely convergent, so that the 
set H does not exist. The L-integral in the definition may be quite general. 

It will now be shewn that, if / (x) have an //L-integral in (a, 6), it has 
an //L-integral in any interval (a', b') contained in (a, b). 

Taking two sets of intervals A, A', as in the definition given above, 
we have, for every value of e, 

] f f u (x) dx - ( f A . (x) dx < (, 

| J a J a 

provided m (A) < £«, m (A') < £ e ; where f A (x) = f (x), in the intervals C (A), 
and/ A (x) = 0, in A; with a corresponding definition for/ A , (x). Assuming 
that this condition is satisfied for every value of c, it will be shewn that 

I [ /a (x) dx - f (x) dx < e, 

j a' •' a' 

provided A, A' satisfy the more stringent conditions 

m( A)<K., m(A')<K«. 

Let it be assumed that, if possible, A and A' can be determined so as to 
satisfy these last conditions, and so that 

I / A (x) dx - [ ( x ) dx z e. 

It will then be shewn that finite sets of intervals A, A' can be determined 
which satisfy the condition that each interval of either set contains at 
least one point of H , and such that 

m(A)<£«, m( A') < £., f f~ A 

• a 

and since this is contrary to the hypothesis made above, the impossibility 
of the assumption will have been demonstrated. 

JFo define A, A' we take each interval of A, in (a', 6'), as an interval 
of A, and each interval of A', in (a\ b'), as an interval of A'. Further, we 
• Trans . Amer. Math. Soc. vol. n (1901), p. 296. 


rb 

(x) dx — fz (x) dx sc; 
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take for the parts of A, A' in (a, a') and (&', 6), the set of those intervals 
which are common to the parts of A and A' that lie in (a, a') and (b' y b). 

In case a is contained in intervals (a, fi), (a\ £') of A, and of A', respec¬ 

tively, we take (a', £), (a\ fi') as intervals of A and A' respectively, where 
a' > a. A similar specification will refer to b f . 

It is now clear that f A (x) = / A (x), and f A -(x) = / A > (x), when x is in 
(a', 6'); and that, in (a, a') or in (b', b), we have f A (x) = (x). It follows 

that 

( /a (*0 dx - f fc (x) dx ) f- (x) dx - [ f A * (x) dx, 

and hence that f A (x) dx — fe (x) dx €. 

I J a .'a 

Moreover it is clear, from the mode of construction of A, A', that 
m (A) < £, and m (A') < 

The impossibility in question has therefore been demonstrated. 

Since, for every pair of numbers a', b\ such that a € a < 6' £ 6, corre¬ 
sponding to an arbitrarily chosen e, the number ££« can be so chosen that 

I [ /a (*) dx - I f A . ( X) dx < f 
I J a' j a' 

for every pair of sets A, A' that enclose the set H narrowly, and such that 

rb' 

m (A) < m (A') < J£,, it follows that / (; x) dx exists. 

. a' 

Moreover, since £, is independent of a' and b\ we have established the 
following theorem: 

rb t rb' 

If f (x) dx exists as an HL-integral, then f (x) dx also exists , where 
J a «' 

a ^ a' < b's b; and the convergence of this integral is uniform with respect 

to a' and b\ 

The last part of the theorem expresses the fact that 
I I f(x)dx - [ f A (x) dx < «r, 

I J a' J a' 

provided m (A) < £,, for every value of a' and £', the number £, depending 
on c- 

454. It will be shewn that: 

For the HL-integral, the theorem 

f / (x) dx = ( / (x) dx + f / (x) dx 

J a J a .c 

is valid . 

This follows from the corresponding theorem for the Z-integral of 
f A (x). For it appears that, employing the last theorem, the expressions 
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on the two sides of the equation differ from one another by less than 2e ; 
and since e is arbitrary, their equality is established. 

Since the existence of the iTX-integral of / (x) in any sub-interval 
(a\ b'), of (a, 6), has been shewn to be a necessary consequence of the 
existence of the HL-integral over (a, 6), it is clear that the integral of 
/ (x) taken over any finite set of non-overlapping intervals, contained in 
(a, b), also exists; being the sum of the integrals taken over the separate 
intervals. However, if a non-finite set of non-overlapping intervals be 
taken in (a, b), it is not in genera] true that the sum of the integrals taken 
over these intervals converges to a definite number, unless the integral 
of / (x) is absolutely convergent, which case has been treated in connection 
with the Z-integrals. It will in fact be shewn, by means of an example, 
that the property in question, that / (#) is integrable over every non-finite 
set of intervals in (a, 6), does not appertain to non-absolutely convergent 
integrals, and must be regarded as peculiar to absolutely convergent 
integrals. This does not however seem to be a sufficient reason for re¬ 
fraining from applying the term “integral” to non-absolutely convergent 
integrals, as defined above. 

455. The following property of an indefinite /iX-integral will be 
established: 

The HL-integral [ / (x) dx is a continuous function of the upper limit x. 

• a 

Denoting the indefinite integral by F (z), we have, in accordance with 
the addition theorem proved in § 454, 

f jc-4- h 

f (x) dx. 

. x 

Employing the function / A (x), which vanishes at all points of the 
intervals of the set A enclosing the set of points H, we have 

rx+h 

J f(x)dx- | / A (x) dx < €, 

provided m (A) < £ c , for every point x , and for all values of h , such that 
x + h is in (a, b). The integral [ / A (x) dx being a continuous function 

. a 

rx+h 

of x , the numerical value of / A (x) dx in the interval (0, h x ), of h, is 

J X 

less than €, if 7q be properly chosen. Hence the numerical value of 
/ (x) dx is less than 2e, if h is in the interval (0, h x ). Since € is arbitrary, 

X 

this shews that F (x) is a continuous function of x, on the right, in the 
interval (a, b). Since F (x) — F (x — h) = / (x) dx , it can be shewn, in 

Jx-h 

a similar manner, that F (x) is continuous at x, on the left. 
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In an interval of the set C (A), f / (:r) dx differs by a constant from an 

J a 

Z-integral, and therefore / (x) dx possesses, almost everywhere in the 

J a 

interval, a differential coefficient equal to / (x). Since the measure of C (A) 
converges to b — a, when A belongs to a sequence {A n }, such that m (A^) 
converges to zero, we have the following theorem, which is the extension 
to the HL- integral, of the property of the Z-integral proved in § 405. 

The indefinite HL-integral ( f (a;) dx has, almost everywhere in (a, 6), 

J a 

a differential coefficient of which the value is f (x). 

THE //Z-INTEGRAL OVER A FINITE SET OF INTERVALS 

456. An extension of the theorem that the integral of / (z) over any 
interval contained in (a, b) necessarily exists, if the HL- integral over (a, b) 
exists, will now be made. 

Let 1) denote a finite set of intervals, each one of which is contained 
in an interval complementary to the set //, and no two of which are con¬ 
tained in any one such interval. 

Using the notation of § 453, as regards A (1) , A (2) , such that rn (A (1 M < |8 e , 
m (A (2) ) < £8 e , it will be shewn that 

/a<’> (x) dx - / / a < 2 > (x) dx < e. 

J (D) J (fJ) 

For, assuming that the expression on the left-hand side is > c, we can 
define A (3) , A (4) as follows: 

The parts of A (3) , A (4) that are interior to C (D), which is a finite set 
of intervals, we take to consist, in each case, of those intervals that are 
common to parts of A (1) and A (2) interior to C (D). If intervals (a, /?) 
(a', /?'), of A (1) and A (2> , contain an end-point a y , of an interval of D, we 
take (a' y /?), («', /?'), when a ' > a , or (a, /?), (a, ft), if a' < a, as intervals 
of A (3) and A (4) respectively. No interval of A (1) or A (2) is interior to an 
interval of D. 

We have now m (A (3 >) < |8 f , m (A (4) ) < and 

/ a <3> (x) = /*(.) (x), / A <« (x) = / A (») (a;), 

where x is in Z>; also, in C (D), we have/ A (*> ( x ) =/ A («> (x)\ it then follows 

that rb rb 

/ A(S ) (x) dx - / A <*> (x) dx £ c, 

J a J a 

which is inconsistent with the conditions that 
m (A (3) ) < jS., m( A (4) )< 

/ A (d (x) dx — I / A (t) (x) dx < €. 

(D) J(D) 


Hence 
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It follows that [ f (x) dx — (* / A (x) dx 

J(D) HD) 


< c, provided m (A) < £8,. 

. (D) I 

It has thus been established that : 

If D denote a finite set of intervals , each of which is in one of the com¬ 
plementary intervals of H, and no two of which are in one and the same such 
complementary interval , then if e is prescribed , there exists a number £8,, 
such that 


I j 

J(D) 


f (x) dx 


HD) 


/a (X) dx 


< € 


for every such set D, and every set A enclosing H narrowly , such that 

m (A) < £8.. 

This is a particular case of a somewhat more general theorem given by 
E. H. Moore, which allows more latitude as regards the set of intervals D. 

Since the function f A (x) is summable, we see that, in accordance with 
the property of summable functions established in § 392, 


I /a (X) dx 

' mi 


I ■' ( D) 


provided m ( D ) is less than some number 8/ which converges to zero with e. 
Also, provided D satisfies the condition of the above theorem, we have 


, ( / (x) dx - [ / A (x) 

I J (D) Hn\ 


dx 


It now follows that 


(D) 


HD) 

f (x) dx I < 2c. 


< €. 


We have now the theorem that : 

If D be a set of intervals , finite in number , each of which is in an interval 
complementary to the set H, of points of non-summability , and no two of 
which are in the same such interval , then 


I (D) 


f (x) dx 


< € 


for all such sets D , provided m (D) < 8/'; where c is arbitrary , and 8/ 
depends upon e, and converges to zero as e does so. 


THE CONDITIONS FOR THE EXISTENCE OF AN #Z/-INTEGRAL 

457. The following theorem, due to E. H. Moore*, contains the 
necessary and sufficient conditions for the existence of the H jk-integral 
of a function / (#), defined in the linear interval (a, 6), in which H is the 
set of points of non-summability of / (x). The content of H is assumed to 
be zero: 

The complementary intervals of II being denoted by (a yf b y ), the necessary 

rb 

and sufficient conditions for the existence of f (x) dx are: 

J a 

* Trans . Amer . Math. Soc. vol. ii (1901), p. 324. 
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(1), that all the integrals | f (x) dx shall exist, each such integral being 

J a v 

rK~ 

defined as the limit of I / (x) dx, when e, e converge independently to the 

J a v +« 

limit zero, and, 


(2), that a -f <o 2 4- + co v shall converge to a definite number, as v is 

indefinitely increased; where <o p denotes the fluctuation of | f (x) dx in the 

. a v 

interval ( a v , b P ). 

Moreover, when the conditions (1) and (2) are satisfied , the sum 


v Cbr 

2 / (x) dx 

r-1 j Or 

is convergent, and its limit, as v ~ co, is I f (x) dx . 

J a 

To shew that the conditions are necessary, we assume that J / (x) dx 

rK 

exists; it then follows from the theorem of § 453, that I / (x) dx exists. 

J a v 

If if, if' he the points of (a„, 6„) at which / (:r) dx attains its maximum 

f f (*) dx. 


and minimum values we have oj„ 


The number p may be chosen so large that S (6„ — af) < 8/'. For 

some values of v we may have if > if', and for others if < if' ; one, or 
both, of these two sets of values of v will be an infinite set. If £co„ is not 

m' 

convergent, a number p' (> p + 1) can be found such that £ co„ > 2 <e. 

V-fL+l 

One at least of the two sets of those values of v between p - f 1 and p' for 
which if > if' , and for which if < if' , must be such that the sum of 
the corresponding values of a> y is > €, and the sum of the corresponding 
intervals | if' — if | is < 8,". We have thus a finite set of intervals D, 
whose sum is < 8f', all contained in complementary intervals of 0, no 
two of which are in the same such interval, and such that 



dx 


> €, 


which is contrary to the last theorem of § 456. 


It follows that £<o„ must converge. 

Further, let (a v , /?„) be interior to (a v , b v ) and such that 

fir a v — ^1 ^ {by af). 
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We may choose m so that E (b v — a v ) > b — a — r), where r\ is positive, 

V “ 1 

and arbitrarily chosen. Then 

v-m / l\ 1 

E ()S„ — a„) > (b — a — rj) ^1 — > b — a — rj — ^ (b — a). 

If we take A to be the set of intervals complementary to ( a v , j 8 „), for 
v = 1 , 2, 3, ... m, we have ra (A) < 77 4- ^ (6 — a) < S e if 17 and & are properly 
chosen. It then follows that 


rb v”°m v 

l f(x)dx - E I / (x) dx 
Ja *~1 J a 


< 6 . 


Also k may be so chosen that SI / (x) dx differs from S 

— 1 K 

by less than e; we have then 

rb v~vi r^v 

f (x) dx — E / (x) dx 

.'a i-l Ja, 


m r^v 

• /(*) 
1 d a„ 


dx 


It now follows that S 


rb v 

I /(*) 


< 2e. 


dx converges to the value of 


[ / (x) dx. 


1 dx 


< 


To prove that the conditions (1) and (2) are sufficient ; let v be such 

00 

that S a>„ < then if ( a v , /3 r ) denote any interval contained in (a„, b„), 
s+i 

we have . * 

S f(x)i 
-v'+l Ja y 

Let the finite set A enclose H narrowly; the complementary set C (A) 
is a finite set of intervals interior to a finite set of the intervals C (H) 
complementary to H. Assume that these intervals C ( H ) are arranged in 
descending order of magnitude. Those of them that are of length > m (A) 
must each contain an interval of G (A), and others may also contain 
intervals of C (A). Let the first s of the intervals (a„, 6„) be each of length 

CD 

> m (A). The convergence of E 
have then 


if 

1J a v 


f (x) dx follows from that of ha> v . We 
fh (X) dx - S f y f(x)dx 

■ a v — 1 J o„ 


v-8 

^ E / (x) dx 

v ~1 J a v 




dx 


+ s 

v~8 +1 | J a. 


f (x) dx 


+ E 


[fin 

| /(*) 


dx 


where n* (> $) has a finite set of values, and (a„, /S„) are the intervals of 
C (A). 

00 

We may suppose s to be so chosen that E a >„ < €, and thus that 

v-S 


E 

n’>8 


fin 

fix) 

J a m 


dx 


< e, 
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A 

/(*> 

J a„ 


dx 


< e. The number s having been so 


and also so that 2 

V - s 4 1 I 

chosen, we may suppose A so chosen that m (A) is less than each of the 
first s intervals of C ( H ). Also m (A) may be chosen, still smaller if neces¬ 
sary, so that 


v~8 rP y 

2 f (x) dx — \ f (x) dx 

1 . a,, j a v 


< e. 


We now have I f f A (x) dx — 2 P f (x) 

l.'rt y~l Ja v 

less than some number depending on c; and since this holds for every 

rb 

value of €, we see that f A (a;) dx must converge to the value of 

s f 6 / (x) dx, 

V* 1 J a v 

rb 

as m (A) — 0; and thus the existence of / (x) dx has been established. 

a 

(b 

It has been here established that, if / (#) dx exists as an /2X-integral, 

J a 

it is necessary that 2 f (x) dx should converge, where ( a„ , b„) denotes 

'a lf 

a contiguous interval of the set H of points of non-summability of / (x), 
it being assumed that m (H) — 0, and / (x) dx being determined as 

• a v 

lim / (x) dx, where (a, p) is interior to (a v , b v ), but it is also necessary 

«~a„ j a 

p-K 

to assume that, a> y denoting the fluctuation of f(x)dx in (a„, b v ), the 

• a v 

series of positive terms 2 co v should be convergent. 

rb 

458. A more general definition of I f (x) dx has been suggested by 

J a 

Lebesgue*, as a development of a definition given by Jordan. This is 
obtained by substituting for the condition that 2o> n should converge, the 

should be convergent. It is 


dx 


< 3c, provided m (A) is 


[bn 

less stringent condition that 2 f (x) dx 

J am 


clear that an I/L-integral is also an integral in accordance with this 
definition of Lebesgue, but the converse is not necessarily the case. It 
can be seen that the substitution of Lebesgue’s definition for that of 
Harnack involves the introduction of a limitation in the selection of the 
sets of intervals A that include the points of //. 


* See his “Remarques sur les theories de la mesure et de 1’integration,” Annates sc. dc Vicolt 
normale (3), vol. xxxv (1918), p. 204. 
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This limitation must be such that, in the proof of sufficiency in E. H. 
Moore’s theorem, the 

that of 2 I f (x) dx 
*-ll Ja v 

In case the sum of the integrals over the intervals (a„, b ¥ ) were not 
absolutely convergent, the sum of the integrals themselves, if convergent, 
would have a sum which would depend upon the order in which the 
intervals are taken, and thus would not afford a suitable definition of the 
integral over (a, b). 


convergence of 2 j f (x) dx can be inferred from 

v — 1 J 


459. A method will now be given of constructing a function f (x) 
which is continuous at every point of the interval (<z, 6), except at the 
point b , at which the function has an infinite discontinuity of such a 

[b 

character that f (x) dx converges non-absolutely. 

J a 

Let a sequence of intervals (a lt b x ), (a 2 , b 2 ), ... (a n , 6 n ), ... be defined 
in the interval (a, 6), such that no two of them overlap, and that b is the 
limiting point of each of the sequences 

(a 1} a 2 , ... a n , ...), (b ly b 2 , ... b ny ...). 

Let «! + «, + ...+«„ + ... denote a non-absolutely convergent arithmetic 
series; that is, the series is to be convergent, but not the series 
I u i | + | u 2 | 4- ... 4- | u n | -f .... 

In (a n , b n ), let / (x) be defined so as to be continuous, and everywhere of 
the same sign, and let / ( x) be zero at a n and b n . Further, let / (x) be so 

rbn 

chosen, in ( a n , b n ), that f (x) dx — u n . At all points of (a, 6), external 

j an 

to all the intervals (a n , 6 n ), let / (x) = 0. 

The function f(x), so defined in (a, b), is continuous, except at the 
point b. 

In (a n ,6 w ), the function | / (x) | has a maximum value greater than 
I |/(6 n — a n ), and therefore f (x) has indefinitely great positive and 
negative values in every neighbourhood of the point 6. 


For, if there existed a positive number k, such that | u n \/(b n — a n ) < k, 

r~n r-n 

for all values of n, we should have 2 | u n | < 1c 2 (6 r — a r ), and thus 

r-1 r-1 

oo 

the series 2 | u n | would be convergent, which is not the case. 

r-1 


We have now 


J! /w ' 


r-» 

■■ z u n + 6u n+1 , 

r-1 


if x lies in the interval (6 n , 6 n+1 ), where 0 & 6 £ 1. 
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Also the integral f f (x) dx is defined as lim [ / (x) dx; and its value 

Ja r~bJa 

r-n 

is therefore lim E u ni which, by hypothesis, has a definite value. 

r - 1 

[b 

It is further clear that ] / (x) | dx is not finite, since the series 

J a 

| u x | + | u 2 | + ... H- | u n | -f ... is not convergent. 


This result may be employed to illustrate the fact that the non-abso- 
lutely convergent integral is not necessarily the limit of the sum of the 
integrals taken over a set of intervals which, in the limit, converges to 
the whole interval of integration; and thus that such an integral is not 
a broad integral. 


Let the integral of / (x) be taken over the intervals (a, b m ), (a Pl , b Pl ), 
(a Pi > ... ( a Pr , b Pr ), where p 1 , p 2 > • • • Pr are increasing integers, all greater 

than m, and such that u Pl , u Pz , ... u Pr are all of the same sign. It is clear 

( bm 

that m may be so chosen that I f (x) dx is arbitrarily near in value to 

J a 



dx; then, for such a value of m, the integers p l7 p 2 , ... p r may be 


so chosen that u Vy -f u v% -f ... -f u Vr is as large as we please, since the 
series l>u n does not converge absolutely. As m is increased indefinitely, 
the set of intervals (a, 6 m ), ( a Vl , b Pi ), ... (a Pr , b Pr ) converges to the whole 
interval (a, b), the complementary part of (a, 6) diminishing indefinitely, 
and yet the sum of the integrals of / ( x) taken over the intervals of the set 
does not converge. 


460 . The construction here given of an //L-integral, with a single 
point of non-summability, may be employed to illustrate the fact that, 
in the definition of such an integral given in § 453, the condition is indis¬ 
pensable that the set A of intervals must be such that there is at least 
one point of non-summability in each interval of A. This fact differen¬ 
tiates the non-absolutely convergent //L-integral from an absolutely con¬ 
vergent integral, in which / (x) dx converges to / (x) dx , for every 

JC(A) ■ a 

set of intervals A, when m (A) ~ 0. It is not in fact true that, in defining 
the #L-integral, the set of points a lt a 2 , ... a n , ... b lt b 2 , ... b n , ... 6, which 
is of content zero, may be excluded by enclosing these points in a set of 
intervals of arbitrarily small sum. 

For we may include all the 2m points a l9 a 2 , ... a my b X) b 2i ... b m which 
occur in (a, b m ) in a finite set of intervals, so that when these are excluded 

fbm 

from the interval (a, b m ), the integral / (x) dx is altered by an arbitrarily 

J a 

small amount. Again, we may shorten each of the intervals (< a Pl , 6^), 
(a p%t b Pi ), ... (a,^, b Pr ) at each end, so that the sum of the integrals of / (x) 
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taken over these intervals is diminished by an arbitrarily small amount. 
All the points a lt a 2 , ... b x , b 2 , ... are now included in intervals of a finite 
set, such that the integral of / (x) over the complementary intervals is 
arbitrarily great. These complementary intervals consist of those intervals 
which have been obtained by shortening the intervals (a Pl , b Pl ), (a P2 , b Pi ),... 
(a Pr , b Pr ), and of the parts of (a, b m ) which remain when the points 

«i, V a 2 > b 2y ... a m , b m 
have been included in a suitable set of intervals. 

Let <f> (x) be a function for which I </> (x) dx exists as an 7/L-integral, 

a 

with the points a 1 , a 2 , ... a n , ... b t , b 2 , ... b n , ... 6, as the set of points of 

infinite discontinuity, in the neighbourhoods of which <j> (x) is not sum- 

fb 

mable. Also the integral / (x) dx exists, as constructed above, with its 

. a 

single point of non-summability at b. It appears however that 

[ {/ (*) + </> (*)} dx 
J a 

does not exist as an HL- integral. The set //, of points of non-summability 
of the function/ (x) -f (f> ( x ), consists of the points 
a i j bn a 2 j b 2 , • • • & n i b n) ... t 
and in general of the point b. 

We may now define a set A, of intervals enclosing the points of H, to 
consist of the finite set defined above for the case of the integral of / (#). 

For a sequence of such sets A, f (x) dx increases indefinitely, as 

•'C(A) 

m (A) ~ 0, whereas <f> (x) dx has a finite limit; thus 
JC(A) 

f {/ (X) + <f> (tf)} dx 

JC(A) 

rb 

increases indefinitely as rn (A) ~ 0, and therefore I {/ (x) -f- <f> (#)} dx does 

J a 

not exist. It will however appear in §§ 472 473 that the integral* exists 
as a Z)-integral. 

It will be proved that: 

If f (x), <f> (x) have HL-integrals in (a, 6), and the sets H lt H Zi of points 
of non-summability of the two functions have no point in common f then 
f (x) -f <f> (x) has an HL-integral in (a, 6), given by 

f {f( x ) + <f> (*)} dx = [ / (x) dx + f <f>{x)dx. 

Ja Ja 'a 

Denoting f (x) + <f> («) by i/j ( x ), it is clear that the set of points of non- 
summability of if* (x) is K ~ H 1 -h H 2 . The two closed sets H lt H z have 
a finite distance between them, and therefore H z must be contained 



dx, and this is 
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within the intervals Z) 2 of a finite set, each interval of which is within 
one of the intervals contiguous to H 1 . The part of H 2 in each such interval 
is closed. If the set K is enclosed narrowly in a finite set of intervals A, 
then, provided m (A) is sufficiently small, A is the sum of a set A x of 
intervals enclosing narrowly the points of II 1 , and of a set A 2 of intervals 
contained in D 2 , enclosing narrowly the points of II 2 . The integrals 

f /a (#) dx, f f Al (x) dx differ from one another by [ f (x) dx , and this is 
J a J a J (a 2 ) 

an 2,-integral which converges to zero with m (A); therefore, in defining 
f b 

f (#) dx, we may employ the set K instead of the set H x . Similarly, it 

j a 

fb 

is seen that, in defining </> (x) dx, we may employ the set K instead of H 2 . 

J a 

Since (a:) dx — f (x) dx -h <j> (#) dx 

J a JC( a) JC( A) 

and since the limits of the integrals on the right-hand side, as m (A) ~ 0, 
are the i/L-integrals of / (x) and <f> (x) respectively, the truth of the 
theorem follows. 


by f / (x) 
j (A,) 


When II 1 and II 2 have points in common, such a point may or may 
not be a point of non-summability of f (#) -f <f> (x). It can be shewn that: 

If f (x), (f> (x) both have HL-integrals in (a, b), and have one and the same 
set II of points of non-summability , then if the set of points of non-summa¬ 
bility of f {x) <f> (#) consists of the points of II with the exception of those in 

a dosed set L contained in H (which set L may be finite or absent), then 
f (x) f (f) (a;) has an HL-integral in (a, b), given by the sum of the HL- 
integrals of f (x) and <j> (x). 

The sets H — L and L both being closed, it follows, as above, that L 
is contained within intervals of a finite set D, each interval of which is 
interior to one of the intervals contiguous to II — L. It can then be 
shewn that, in defining the integral of i/j ( x), the set H may be employed 

rb rb 

instead of the set H — L. For j( x) dx, (x) dx differ from one 

J a -'a 

another by the L-integral of ip (a;) taken over the set A 2 ; where A = A x -f A 2 , 
and A x , A 2 enclose narrowly the points of the sets H — L and L respec¬ 
tively. The result then follows as in the last case, 
fb 

461. If <f> (x)dx is an HL- integral, for the function <f> (x), and the 

. a 

set of points of non-summability of <f> (x) is H, and / (x) is bounded and 
summable in (a, b), it may happen that some or all of the points of H 
are not points of non-summability of the product function / (x) (x). 

This will be the case, for example, if / (a;) has the value zero in a neigh¬ 
bourhood of a point of H. If, at any point £, of H, f (£ -f 0) exists and 
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is numerically greater than some positive number a, we have, provided 
e is sufficiently small, 

£ + « ft+f 

\ f (x) <f> (x) \ dx > a \ I <f> (x) I dx. 

£ •'£ 

It follows that if £ is a point of non-summability on the right, of <f> (#), 
then it is also a point of non-summability of / (#) <f> (x). A similar remark 
applies to the left of the point £. 

The following property* of an ////-integral will now be established : 

rb 

If <f> (x) dx is an HL-integral, the set of points of non-summability of 

• a 

<f) (x) being H , and if f (x) be a function with bounded variation in (a, b), 
such that f (x) <f> (x) is, non-summable at every point of //, with the possible 
exception of those belonging to a closed set K , of points of H, then 

[ f (x) <f> (x) dx 
J a 

exists as an HL-integral. The set K may be finite , or absent. 

The condition in the theorem is certainly satisfied, in particular, if 
there are only a finite number of points of H at which / (x) is continuous 
and has the value zero. An important example of the theorem is that, 
if <f> (x) has an ////-integral in (a, 6), then <f> {x) cos Ax , <f> (x) sin Ax have 
///^-integrals in the interval; A denoting any constant. For the only points 
of H which may not be points of infinite discontinuity of the product 
functions are the zeros of cos Ax , or of sin Ax. 

First, let it be assumed that all the points of H are points of non- 
summability of f (x) <f> (#). Let A, A' be two sets of intervals enclosing 
narrowly the points of H, and such that 

| j <f> A (x)dx-J <f> A .(x)dx <e, 

where m (A) < £,, m (A') < for all points x in (a, /). 

Let F (#) = (x) — ( x ), and suppose that / (x) = f x (x) — / 2 (#), 

where/* (x),f 2 (#) are monotone bounded functions. 

We have 

[ F (x)f x (z) dx = /* (a) f ( F (x) dx + /* (b) f F (x) dx , 

.'a Ja J£ 

with a similar equation for f 2 (x)\ £ is in each case some point in (a, 6). 
From this we have 

[ <£a (x)f(x) dx - f (x)f(x) dx 

J a. J a 

< 2e{|/x (a) | + |/, (b) | + | (a) | + |/ 2 (b) |}. 

* See Hobson, Proc. Lond. Math. 8oc. (2), vol. vn (1909), p. 22. 
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Denoting the number on the right-hand side by e', we see that, since 
£« converges to zero with e', the condition for the existence of 

f f (*) <f> (x) dx 

J a 

as an ///-integral is satisfied. 

In case there exists a closed set K, of points of //, which are not points 
of non-summability of f (x) </> (x), we see, exactly as at the end of § 460, 
that iri defining the ////-integral of / (x) (x) we may employ the set //, 

instead of the closed set H — K, of points of non-summability of the 
function. Thus the theorem is at once extended to this case. If H and K 
are identical, the ///-integral becomes an //-integral. 

THE SECOND MEAN VALUE THEOREM FOR AN ///-INTEGRAL 

462. With a view to the extension of the second mean value theorem 
given in § 422 to the case of ////-integrals, it will be first proved that: 

If f (x) has an H L-integral, and f (.r) is bounded and monotone in the 
interval (a, b ), then f (#) (x) has an HL-integral, or an L-integral, in the 

same interval. 

Let II and 77 be the sets of points of non-summability of the functions 
c f > (x),f ( x) f ($) respectively. Any point x, of //, at which neither/ (x + 0) 
nor f (x - 0) has the value zero is also a point of H . If there be no line 
of invariability of / (x) for which the function / (.r) has the value zero, 
there may be a single point c, in (a, b ), at which / (c f 0) or/ (c — 0) has 
the value zero, or at which both of them are zero. If c be a point of II 
it is not necessarily a point of //, and in that case it is an isolated point 
of H. In accordance with the theorem of § 461, f (x) <f> (x) has an IIL- 
integral in (a, 6), the set K consisting of the single point c. In defining 
the integral of / (x) <j> (x), the set H may be employed instead of H. In 
case there is a line of invariability (a, /3) contained in (a, 6), within which 
/ (x) = 0, any points of H within (a, fl) do not belong to H. The end¬ 
point a, or the end-point /?, in case it is a point of H , may, or may not, 
belong to 77. In the interval (a, /?), f (x) </> (x) has an L-integral, and in 
the interval (a, a), or (/?, 6) it has an H //-integral, or an //-integral, ac¬ 
cording as the interval does or does not contain points of II. In any case 
it follows that / (x) <j> (x) has an HL-integral (or an //-integral if all the 
points of H are within (a, jS)) in the interval (a, b)\ and in defining the 
integral the set H may be employed. 

It will now be proved that: 

The second mean value theorem 

(* / (a;) <f> (x) dx = A f *<f> (x) dx 4- B f <£ (#) dx , 

J a -'a J X. 
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as given in § 422, holds good when <f> (x) has an HL-integral in (a, b), the 
function f (x) being bounded and monotone in the same interval. 

rb 

In accordance with the last theorem, f (x) <f> {x) dx exists as an 

• a 

HL-integral (or as an jL-integral), and in its definition the set H , of points 
of non-summability of <f> (x), may be employed, whether or not all the 
points of H are points of non-summability of / (z) cf> (x). 

We have 


f f (z) <f>* (#) dx — A l A </> A (a:) dx -f B f <f> A (x) dx, 

J a J a J X A 

where A is a set of intervals enclosing narrowly the set H, and X A is 
some point in (a, b), dependent on A, A, B. 

[X A rX A rb rb 

Now <j> A (x) dx — I (x) dx, I <f> A (#) dx — cf> (#) dx 

Ja J a Jx A Jx A 

are both numerically less than an arbitrarily chosen positive number e, 
provided m (A) is sufficiently small; this follows from the uniform con¬ 
vergence of [ <f> A (x) dx to I <f> (x) dx. Also j f (x) (f> A (x) dx differs from 
J a J a J a 


rb 

I f (x) (f> (#) dx by less than e, if m (A) is sufficiently small. Hence we 


have 


f f (x) (f> {x) dx — A f A <j> (x) dx + B f cf> (x) dx 
J a J a X a 


f t). 


where | 77 | is arbitrarily small. From the continuity of 


I A <f> (*), I <f> (z) dx, 

Ja JX A 

we see, by reasoning similar to that in § 423, that 

f f (x) <f> (x) dx ~ A [ </> (x) dx 4 -B [ <f> (x) dx. 

J a J a J X 

Bonnet’s form of the mean value theorem may be deduced as in § 423. 

It should be observed that the only case in which the monotone 
function/ ( x ) is such that the condition that all the points of II, with the 
possible exception of the points of a closed set contained in H, are points 
of non-summability of / (x) </> (x) may not be satisfied is when / (x) is zero 
in the interior of an interval (a, j8) contained in (a, b). For example, if ft 
is a limiting point of II on both sides, and / (/3) = 0, f (P -{- 0) — 1, the 
point p is a point of non-summability of / (x) </> (x), but the points of H 
on the left of ft are points of summability of / (x) <f> (x). Thus, in this case, 
the set of points of H which are points of summability of / (x) <f> (x) is not 
a closed set; and thus the condition in the theorem of § 461 is not satisfied. 
In such a case, by changing / (x) into / (x) -f c, we have 
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the integral on the left-hand side existing as an #L-integral. We could 

rb rb 

only infer the existence of J / (x) </> (x) dx , by subtracting cxf> (x) dx 

• a J a 

from both sides, and employing the theorem of § 454. 


INTEGRATION BV TARTS FOR THE HARNACK-LEBESGUE INTEGRAL 


463 . The formula for integrating by parts, given in § 420, may be 
extended 5 *' to apply to the ease in which one of the integrals, f U (x) dx, 

■ a 

which may be denoted by u ( x ), is an //L-integral; and the other I V (x) dx, 

J a 

or v (x), is an L-integral. 

Let u n (x) — I U n (x) dx, where U n (x) — U (x), at every point not be- 

- a 

longing to a finite set of intervals A, enclosing the points H, of non- 
summability of U (a:); and U n (x) — 0, in A. 

Since u n and u n v are L-integrals, we have, since ^ exist almost 
everywhere (see § 455), 


“ J a v Yx dx + j a u " 


dx. 


As n . ~ oo, so that m (A) ^ 0, we have [u n vf a ~ [u 


Also 


•b 


dv 7 [ h dv 7 
u , dx — u n , dx 
dx J tl dx 


f" 

dv 

<e 

•' a 

dx 


dx, 


because, for sufficiently large values of n, we have | u (x) — u n (#) | < e, 
whatever value x may have, in accordance with the theorem of § 453. 
It follows that 

f b dv 


lim 

7J ~ to J a 


[ b dv , [ b 

) a U «d X dx= \„ 


dv 7 
u . dx, 
dx 


and hence, utilizing the above equation, we see that lim 

U— CO 

and is equal to [ uv — j* u dx. 


v ~~ n dx exists 
dx 


Thus the equation 


I V (x) j I U (x) dx | dx = [ U (x) dx f V (x) dx — I U (x) j I V (x) 

J a ( J a j -'a -a J a [J a 

holds good, when one of the two indefinite integrals 

[ U ( x) dx, f V (x) dx, 

exists only as an f/X-integral, the other being an L-integral. 


dx 1 dx 


W. H. Young, Proc. Lond. Math. Soc. ser. 2, vol. ix, p. 432. 
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THE DENJOY INTEGRAL 

464. In the definition of the HL- integral, even if it be modified in the 
manner proposed by Lebesgue (§ 458), it is assumed that the set H, of 
points of non-summability of the function, has content zero. A definition 
has been introduced* by Denjoy, which is applicable when the set H is 
not so restricted, but is capable of being any non-dense closed set. The 
process of passing from a given function / (#), defined in the linear interval 
(a, b), to a function F ( x ), which has a relation to / (x) similar to that of 
the indefinite L- integral of a summable function to the function itself, 
has been termed by Denjoy totalization of / ( x ); it will however be here 
spoken of as integration (D), and the function F (x) will be spoken of as 
the indefinite D-integral off (x), whenever it exists. Thus, in any interval 

ffi 

(a, /?), of (a, b), the D-integral off (x) mil be denoted by / (x) dx, or by 
F (p) — F («), or also by V (a, P). 

The definition of the Denjoy Integra], or D-integral, in the most general 
case, can only be given by indicating the steps of a gradual process by 
which, commencing with the employment of L-integrals, a function is 
obtained which satisfies certain postulations which are of the nature of 
definitions. These may be stated as follows: 

Having given a measurable function / (.r), defined for the interval 
(a, b); 

(1) . In any interval (a, p), contained in (a, 6), in which f (x) is integrable 
(L), V (a, P) is taken to be the L-integral f {x) dx. 

J a 

(2) . For a finite set of intervals (a x , a 2 ), (« 2 , « 3 ), ... (a w _ l5 a n ), each one of 
which abuts on the next , and for which V (a x , a 2 ), V (a 2 , a 3 ), ... V (a n _ lf a n ) 

r -1 

have been defined , V ( a Jf a n ) is defined as D V (a r , a r+1 ). 

r -1 

(3) . If (a, P) be any interval in (a, 6), and V (a\ P') has been defined 
for every interval (a', p') interior to (a, P), then V (a, P) is taken to be the 
limit of V (a', /?'), when a' ~ a, p' ~ p, independently of one another , it being 
assumed that this limit exists . 

(4) . Let P be a perfect set of points in an interval (a, p) contained in 
(a, b) y and assume that f (x) is summable over the set P , and suppose moreover 
that V (a', P') has been defined for every interval (a', p'), of (a, P ), which 
contains no point of P as an interior point. Let (a n , P n ), where n — 1,2,3,..., 

* Comptes Bendas. Paris, vol. cijv (1912), p. 859 and p. 1075. See also Lusin, ibid. vol. xlv, 
p. 474. Denjoy’s detailed investigations are given in four memoirs referred to on p. 715. See 
also a thesis by Nalli, Esposizione e confronto critico delle diverse definizioni di unafunzione limitata 
o no, Palermo, 1914. For an extension of Denjoy’s method of integration to the case of the 
derivatives of functions of two or more variables, see Looman, Fundamenta Math. vol. iv (1923), 
p. 246. 
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denote the intervals of (a, B) that are contiguous to P, and let W (a „, /8„) 

denote the upper limit of | V (a', B') |, for all intervals (a', /S') contained in 

00 

(«„, ft n ). Let it be assumed that the series 2 W (a n , ft n ) is convergent. Then 

n - T 

V (a, ft) is defined by 

V (a, P) =- £ V (a n , p n ) + f / (*) dx. 

n-1 (P) 

It is clear that IT {a n , ft n ) is the fluctuation of V (a n , x) in the interval 

It will be shewn that the D-integral of / (x), in (a, b ), exists, or in 
accordance with the expression employed by Denjoy, that/ (x) is totaliz- 
able in (a, b), provided / (x) satisfies the following conditions: 

I. For every perfect set P, in ( a, b), the set of those points of P which are 
points of non-summability of f [x), with respect to P, is non-dense in P. 

In particular, if P consists of all the points of (a, b), the set H , of 
points of non-summability of f (x), with respect to the interval (a, b), is 
a non-dense set. 

II. If ( a, ft) be any interval in (a, 6), and V (a', ft') has been calculated 
for every interval (a\ ft'), interior to (a, ft), then V {a', ft') tends to a definite 
limit , as a', ft' converge independently to a, ft respectively. 

When this condition is satisfied V (a, ft) has been defined, in (3) above, 
to be the value of this limit; thus 

I f (x) dx - lim f f (x) dx. 

J a a'~a,0'~/3 .’a' 

III. For every perfect set P, if V (a n , ft n ) has been calculated for every 
interval ( a n , ft n ), contiguous to P, the set of points of P which are not points 

CO 

of convergence of 2 W ( a n , ft n ) is non-dense in P. 

n-l 

no 

The series 2 W ( a n , ft n ) is said to be convergent in an interval ( aft') 

n = 1 

if the series consisting of those terms for which (a n , ft n ) is interior to (a', ft') 
is convergent. The series is said to be convergent at a point p if p is interior 
to some interval ( a', ft') in which the series is convergent. 

If every point of a perfect set P is such as to be a point, of convergence 
of 2 IT (a n , ft n ), then the series 2 W (a n , ft n ), taken for all the intervals 

n-l n-1 

contiguous to P, is convergent. 

For, let a system of nets be fitted on to the interval (a, b), in which 
P is contained; then if 2 IT (a n , ft n ) is divergent, there must be at least 

n-l 

one mesh d t , of the netDj, in which 2 IT is divergent. Of the nets of D 2 , 

n-l 

that are contained in d l9 there must be one at least in which 2 IT is 

n-l 
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divergent; let ft be that net, or that of lowest rank if there is more than 
one. Proceeding in this manner, we obtain a sequence of meshes ft, ft, ..., 
defining a point Q, in all of them, such that, in each of these meshes, the 

series 2 W is divergent. This point Q is one of divergence of the series, 
»-1 

and it must clearly belong to the set P, which by hypothesis contains no 
such point. Therefore the series 2 W (a n9 p n ) 9 taken for all the intervals 

n-1 

contiguous to P, is convergent. 

It is clear that those points of a perfect set P which are points of 
divergence of 2 W (a n , ft) form a closed set. For, in any neighbourhood 
of a limiting point of the set, there are points of the set; from which the 
result follows. 

465. It will be shewn that, when the conditions given in § 464 are 
satisfied, the XMntegral of f (x) can be calculated by means of an enumer¬ 
able set of .L-integrals, of passages to the limit, and of summations in a 
certain order. 

Every point of (a, b) which does not belong to //, the set of points of 
non-summability of/ (#) with respect to the interval (a, 6), is interior to 
a contiguous interval of 77. Since/ (x) has an Z-integral in any interval 
( a ', ft'), interior to (a, ft), one of the intervals contiguous to I/, V (a , ft) 
can be determined as lim V (a', ft), in accordance with the condition II, 

a / <—a, /3'~/3 

of § 464. Thus the 71-integral is determined for each interval contiguous 
to H. Now H is the sum of a perfect set P l9 the nucleus of //, and an 
enumerable set M x \ we proceed to calculate V (a, ft) in the intervals (a, ft 
that are contiguous to P x . 

The part of in such an interval ( a , ft) is reducible, and therefore 
has a derivative of order y, some number of the first, or of the second, 
class, which contains no points, whereas the derivatives of lower order 
than y all exist. If an interval («', ft'), in (a, ft), contains no point of M l9 
the ZMntegral in (a, ft') is an L-integral; and by condition II the integral 
can be calculated for (a, ft), when a', f3' are points of M l9 whereas no 
interior points belong to M x . If (a', ft') contains a finite set of points of 
M lf the /^-integral, for (a\ £'), can then be calculated by means of the 
definition (2) of § 464. Next, let (a j9 p x ) be an interval contiguous to the 
first derivative M X9 of ; then in an interval (a/, /?/) interior to (cq, p x ) 
there is only a finite set of points of hence, as before, the D-integral 
in (a l9 ft) can be calculated. The integrals in all the intervals contiguous 
to M x having thus been determined, by proceeding as before, the integrals 
in all the intervals contiguous to M x " may be determined. Generally, 
let y be a finite number, or a transfinite number of the first species. As 
before, if the D-integral has been calculated for every interval contiguous 
to it can be calculated for every interval contiguous to If 
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y be a transfinite number of the second species, let us suppose that the 
D-integral is known for every interval contiguous to for all values 

of y f < y. In any interval (a/, j3 y ') which is interior to an interval (a yi fi y ) 
contiguous to M r {y) , from and after some value of y\ all the sets M x (y,) 
have no points in (a y , fi y )\ hence the 71-integral can be calculated 
in (a y \ fi y ), and consequently, by employing the definition (3), it can be 
determined in ( a y , /3 y ). It has thus been shewn that the /P integral can be 
calculated for every interval (a, ft), contiguous to M v iy \ where y is any 
number of the first, or of the second, class. Since M 1 {y) = 0 , for some 
number 7 , it follows that the /^-integral can be calculated for any interval 
contiguous to P lt by means of an enumerable set of additions, and of 
passages to the limit, in which the //-integrals over intervals in which 
f (x) is summable are employed. 

Let us suppose this process to have been carried out for each interval 
contiguous to P x , the nucleus of H. In case f (x) is summable relative to 
Pi, and if every point of P, is a point at which the series H W (a „, ft,) is 
convergent, we have 

f f C) <1* - f / (*) dx £ f / (x) dx. 

Ja J (r 1 ) n I J a„ 

When P x does not satisfy these conditions we proceed to analyse the set 
Pj. It should be observed that tlu^ /I-integral is determined over any 
interval (a, f3) such that p l9 the part of P x contained within it, satisfies 
the conditions that / (x) is summable over p r , and that, each point of p x 
is a point at which the series H W (a n , fi n ) is convergent. 

In accordance with the conditions 1 and III, assumed to be satisfied, 
the set // 2 , of points of P x which are either points of non-summability of 
/ (x) with respect to P x , or are points of non-convergence of SIF ( a n , j3„), 
is non-dense in P x ; moreover the set H 2 is closed. Let H 2 — P 2 + 71/ 2 , 
where P 2 is a perfect set, the nucleus of H 2 , and M 2 is an enumerable set. 
It will be shewn that the 21-integral can be calculated, first, in every 
interval having no interior points that belong to // 2 , and secondly, for 
every interval contiguous to P 2 . 

Consider an interval 8 , which contains within it, and at its ends, no 
point of H 2 . We need only consider the case when 8 contains, within it, 
points of P 1 . The interval 8 can be decomposed into a portion p l9 of P x , 
and a set of intervals, all contiguous to p x , except possibly two of them 
which are semi-contiguous to p x . The function / (x) is summable in p x , 
and it has therefore an L-inte^gral over p x . The series consisting of those 
terms of Elf (a n ,fi n ) that refer to intervals contained in 8 is convergent. 
The D-integral over 8 is therefore the sum of the P-integrals in the intervals 
contiguous and semi-contiguous to p x , together with the L-integral over 
the set p l9 in accordance with definition (4). By passing to the limit, the 



696 Non-absolutely Convergent Integrals [ch.viii 

D-integral is obtained over any interval which has no point of H % in its 
interior, but of which one extremity, or both extremities, belong to H 2 . 
By means of an enumerable set of passages to the limit, the D-integral is 
thus determined over any interval in which H 2 is reducible, as in the former 
case. Thus the D-integral is determined for any interval contiguous to P 2 . 

From P 2 , we proceed, as before, to define i/ 3 , which is composed of a 
perfect set P 3 , the nucleus of H s , and an enumerable set M z . This set H z 
consists of the points of non-summability of f (x) with respect to P 2) and 
of points of divergence of 2>W. 

In general, we obtain a set H y — P y -f- M y , where y is a number of the 
first, or of the second, class, and where it is assumed that H r and P r are 
known for all numbers y', less than y. In case y is of the first species, we 
suppose the D-integral of / (z) to have been already determined in every 
interval contiguous to P [y - 13 ; the process of determining the D-integral 
over every interval contiguous to P y is the same as in the special case 
y = 2 , which has been considered above. If y is a number of the second 
species, P y is the set of points common to all the sets P Yi where y' has 
every value < y. If none of the sets P r is devoid of points, the set 
H y certainly exists, but P y does not exist in case H y is enumerable. The set 
P y , when it exists, is non-dense in each of the sets P y , . 

It will be shewn that the D-integral can be determined over every 
interval contiguous to P y , it being assumed that the corresponding deter¬ 
mination has been made for all the sets P r , where y < y. 

First, let y be a number of the first species; and let 8 be an interval 
which contains, as interior point, or as an end-point, no point of H y . If 
p [y _i] be the portion of P[ y -i] in 8 , f(x) is summable in P[ y _i], and those of 
the D-integrals which are taken over intervals contiguous to p[ y -\] form 
an absolutely convergent series. We calculate the D-integral of / (x) over 
Pl-y-i], and add to it the sum of the D-integrals over the intervals that are 
contiguous or semi-contiguous to p [y _u; the result is the D-integral over 8. 
This D-integral is now determined over every interval which contains 
within it, or at its ends, no point of H y . By an enumerable set of passages 
to the limit the D-integrals are now determined in every interval in which 
H y is reducible, that is, in every interval of which no interior point belongs 
to P y . 

Next, let y be a number of the second species, and let 8 have the same 
meaning as before. The sets P Y (y' < y) have no points in 8 , and therefore 
the D-integral can be calculated over 8 . By a passage to the limit, the 
D-integral can be determined over any interval which has points of H y 
only at one, or both, of the end-points. Hence, by an enumerable set of 
additions and passages to the limit, the D-integral is calculated over every 
interval contiguous to P y . 
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All the integrals are obtained by an enumerable set of operations of 
the types laid down at the beginning of the discussion. Since P y is non- 
dense in all the sets Py, where y < y, there exists a number 7 , for which 
Py does not exist (see § 82). Hence, by an enumerable set of operations, 
the P-integral of / ( x ) over the interval (a, b) can be calculated, since, in 
the interval (a, 6 ), there are no points of P y . 

It is clear that the D-integral over (a, x) can be determined in the same 
manner, where x is any point in (a, b). 

466. In the construction of the D-integral, given in § 465, the con¬ 
ditions I and III have been employed only in relation to the particular 
perfect sets P 1 ,P 2 , ..., whereas the function / (x)> when integrable ( D) 
in (a, b ), has been required to be such that these conditions are satisfied 
in relation to every perfect set P. It will however be shewn that the 
ZMntegral constructed by the method indicated in § 465 actually satisfies 
the conditions I and III, of § 464, in relation to any perfect set P. 

In the first place it may be observed that, if / ( x ) be summable in a 
sub-interval, and Q be any closed set in that sub-interval, there are no 
points of Q which are points of non-summability with respect to Q, and 
also no points at which the scries, of which the terms are the fluctuations 

of f f (#) dx in the intervals contiguous to Q, is not convergent. If P be 

J a 

any perfect set in (a, b), let K be the set of those points of P such that, 
at any one of them, either f (x) is not summable with respect to P, or is 
not a point of convergence of EJF n , taken for the intervals contiguous 
to P. Unless K is non-dense in P, there is some interval in which K and P 
coincide. Let P (1) be such a portion of P; then P (1) must be contained 
in //. For, otherwise, a part of P <1) would be contained in an interval 
interior to an interval contiguous to H\ and this is not possible, since 
/ (x) is summable in such an interval. Since the perfect set P (1) is con¬ 
tained in H , it must be contained in P 1 . Similarly, if a portion of P (1) 
were contained in an interval contiguous to // 2 , since P (1) is contained 
in P 1? that portion of P (1) would be a part of P x over which / {x) is sum¬ 
mable, and such that every point would be one of convergence of 
over the intervals contiguous to the part; and this is not possible. Hence 
P (1) is contained in i/ 2 , and therefore in P 2 . Proceeding in this manner, 
it can be shewn that P (1> is contained in all the sets P 3 , P 4 , ... P«. But 
P a , for some value of a, contains no points, and therefore the set P (1> 
cannot exist. It has thus been shewn that K is non-dense in P. Therefore 
the conditions I and III are satisfied for any perfect set P. 

In the construction given in § 465, the conditions in (3) and II have 
only been employed when the limiting interval ( a , £) is a contiguous 
interval of one of the sets H , H 2) ... Hy . But, as the condition is to be 
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applicable to any interval (a, /?), it is necessary to shew that the function 
V, constructed by the process given in § 465, actually satisfies this con¬ 
dition for every interval (a, f3), before the existence of the L-integral can 
be regarded as completely established. The condition amounts to a 

postulation that the integral f j'(x)dx, or F (x), shall be continuous in 

J a 

the whole interval (a, b). For the continuity of F (x) follows from the 
definition (3), and the corresponding assumption II. If x l9 x 2 , ... x ni ... 
be a sequence of points interior to (a, x), and converging to x , 
lim V (a, x n ) — V (a, x)\ 

n~oo 

and thus F (a:) is continuous on the left. Similarly, by considering V ( x , b), 
it may be shewn to be continuous on the right, and therefore V (a, :r), 
which, by definition ( 2 ), is V (a, b) — V (x, b), is continuous on the right. 
Thus the /^-integral, V ( a , x), when it exists, must be continuous in the 
interval (a, b). 


To prove that the function constructed in § 465 satisfies this condition, 
we observe, in the first place, that V (a, x) is certainly continuous at any 
point that does not belong to H. For such a point is interior to an interval 
in which / (a;) is summable; hence V (a, x + h) — V (a, x) is the L-integral 

rx+h 

I f ( x ) dx, which converges to zero, as h ~ 0 . Next, let G be a perfect 

■ X 

set of points, contained in an interval (a, fi), and such that / ( x ) is sum¬ 
mable over G , and also such that every point of G is a point of convergence 
of the series STF, when the summation is taken for the intervals, contained 
in (a, fl), which are contiguous to G. We have then. 


V (a, x 4- h) — V («, x) — 


f / (x) dx + 2 I / (x) 
J (a) J a « 


dx 


where ( a n , fi n ) is contiguous to G. As 


(O) 


f (x) dx 


x hh 


converges to 


zero, on account of a known property of the L-integral. In case x and 

~) x+h 

f (x) dx 


x f h are both points of G 


r F n 
• 2 - 
L J a n 


also converges to zero, as h 


does so. But if x -f h is not a point of G, it is within some interval ( a rn , fi rn ) 

f0m 

contiguous to G, and we have to consider a part / (x) dx. We have 

Jx + h 

rx+h 

I f (x) dx -£ W m , and therefore, in virtue of the hypothesis made 
J am 


above, this part makes no difference in the convergence of 



dx 


jM h 
x 


to zero, when convergence takes place on account of the assumption that 
a: is a point of convergence of SF. It has thus been shewn that 

V (a, x -f h) — V ( a , x) 
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converges to zero, as h converges to zero through any sequence of values; 
therefore V (a, x) is continuous in (a, ft). 

If G be identified successively with the perfect sets in the intervals 
contiguous to P l9 P 2 , ..., the perfect sets employed in § 465, in the con¬ 
struction of V (a, x), we see that it is sufficient for the continuity of that 
integral that the condition lim W n — 0 should be satisfied for all the 

n~ oo 

perfect sets contained in the intervals contiguous to P l9 P 29 .... The 

continuity of | f (x) dx in the whole interval (a, b) is thus established. 

J a 

THE FUNDAMENTAL THEOREM OF THE INTEGRAL CALCITLUS FOR THE 
DENJOY INTEGRAL 

467. With a view to proving the fundamental theorem that the in¬ 
definite Denjoy integral has a finite differentia] coefficient, equal to the 
integrand, almost everywhere in the interval of integration, a property of 
certain types of sets of intervals will be investigated. 

It has been shewn, in § 71, that a set A of open intervals all contained 
in the given interval (a, b ), can be replaced by a non-overlapping set of 
open intervals A, the two sets A, A containing the same open set of points. 

The set of open intervals A is said to be a* restricted, set of intervals , if 
no interval of A can be removed without altering the equivalent set A. 
This is the same as the condition that each interval of A contains a point 
that is not in any of the other intervals of A. 

It is easily seen that, if three open intervals 8 X , 8^, 8 3 are such that a 
point P belongs to all of them, one at least of the intervals is such that 
every point of it belongs to one of the other two. Accordingly, that interval 
rnay be removed without affecting the equivalent set of points. 

It follows at once that, if A be a finite set of intervals, and is a restricted 
set, any point P is contained in at most two of the intervals of A. Tor 
such a set of intervals, the sum Sm (8), of the measures of all the intervals, 
cannot exceed 2 m (A), or twice the measure of the equivalent set of open 
intervals. 

A set of open intervals A is said to be complete , if it satisfies the con¬ 
dition that, when (a lf ft), (a 2 , ft), ... (a n , ft), ... is any sequence of in¬ 
tervals all belonging to A, and such that cq, a 2 , ... a n , ... converges to a 
point a u , and ft, ft, ... ft, ... converges to a point ft, then (« w , ft) is an 
interval of A. In case (a w , ft) is not necessarily an interval of A, but is 
contained in such an interval (a, ft), the set is said to be semi-complete. 

If the set A is complete (or semi-complete), a part A x , of A, exists 
which is equivalent to the same non-overlapping set A, as A, and such 

♦ See Denjoy, Journal de Math. (7), vol. I (1915), p. 223, where the theory here given is 
developed. 
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that it is a restricted set. For, considering the closed set C (A), the com¬ 
plement of A, let (a, p) be one of its contiguous intervals. Any point P, 
of the open interval (a, /?), is interior to an interval 8 , of A. 

If there exist intervals (a, a'), ft) belonging to A, the closed interval 
(a', ft') can be covered by a finite set of intervals of A, in accordance with 
the Heine-Borel theorem. This finite set, together with (a, a') and (ft', ft), 
forms a part of A which covers the open interval (a, ft), and this set can 
then be reduced to a restricted set, by suppressing, if necessary, some of 
the intervals. 

If no such intervals as (a, a '), (/?', ft) exist, we choose a set of points 
(... P- w > ••• P_i, P 0 , Pj, ... P„, ...), contained in.(a, ft), such that a is the 
limiting point of the sequence (... P_ n , ... P ±, P 0 ), and that ft is the 
limiting point of the sequence (P 0 , P t , ... P n , ...); any point P mA1 being 
taken to be on the right of P w , whether m be positive or negative.. 

If the points ... P_„, ... P_ r , P 0 are contained respectively in intervals 
... 8_ n , ... 8_j,8 0 , of A, the measure of 8__ n must converge to zero, as 
n ~ co, for otherwise their right-hand end-points would have a limiting 
point a" (> a) and (a, a") would be an interval of A, if the set A is 
restricted, or would be contained in an interval of A, in case the set is 
semi-restricted; and this is contrary to the hypothesis that no interval 
of A has its left-hand end-point at a. A similar statement holds good as 
regards ft. 

Every point of the closed interval P,„P wfl , where m is positive or 
negative, is a point of a finite set A m , of intervals of A. Let 2 denote the 
enumerable set of intervals obtained by adding the sets A m , for all values 
of m. It will be shewn that, if (A, B) is any interval, such that A > a, 
B < p, the interval (A, B) has points in common with only a finite part 
of the set 2. Let 2 (A, B) denote the set of those intervals of 2 which 
contain at least one point of (A, B). Let the intervals of 2 be denoted by 
2 1? 2 2 , ... 2 t , ..., then, if 2 (A, B) is not a finite set, it contains intervals 
2 with indices which increase indefinitely. There must then be in 2 (A, B) 
intervals of A m , for values of m that increase indefinitely, and are all 
positive, or else are all negative. In the former case their right-hand end¬ 
points will converge to p , and in the latter case their left-hand end-points 
will converge to a. But it has been shewn that the measure of such an 
interval, in either case, converges to zero, and thus an infinite set of such 
intervals cannot all contain points of (A, B). It has thus been shewn that 
the set 2 (A, B) is a finite component of the set 2. 

The case in which one of the intervals ( a , a'), (P', p) exists, but not 
the other, can be reduced to the preceding cases. 

The set 2 can be reduced to a restricted set 2, by suppressing some of 
the intervals, without altering the set of interior points. To prove this, 
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let q be the smallest value of i such that 2 tJ contains a point that does 
not belong to 2 tl+fr , for k — 1 , 2, 3, .... If P l is any point within (a, ft), 
it is interior to only a finite number of the intervals 2 t ; let N 1 be the 
greatest index such that P x is interior to 2#,; we see then that q s. N x . 
The interval 2 tJ having thus been determined, we proceed to determine i 2 
as the smallest index (> t x ) such that it contains a point that does not 
belong either to 2 t2+/f , for k — 1 , 2 , 3, ..., or to 2 tl . If we take a point P 2 , 
not in 2 (J , there is a greatest number N 2 such that P 2 is a point of 2v 2 ; 
thus 4 v N 2 . In general if 2 tl , 2 la , ... 2 tp have been determined 2 tpfl is 
determined by the condition that i Pfl is the smallest integer (> i P ) such 
that 2 tpM contains a point that does not belong to 2 tp+1+ *, for k ~ 1 , 2 , 3,, 
and that does not belong to any of the intervals 2 tl , 2< a , ... 2 tp . If P p + X 
is a point that does not belong to any of the intervals 2 t ,, 2 1# , ... 2 tp ; 
and if N p+1 is the greatest of the indices i pa , i p+2 , ... such that 2jv p+1 
contains P P + X , we have then i PKL & N p{L . The set of intervals {2 lp } is a 
restricted set, and it can be shewn to cover the whole of the open interval 
(a, ft). For, let Q be any point of that interval, then there exists a finite 
set 2 Yl , 2 Y2 , ... 2 Ym , of intervals of 2, all of which contain Q. If none of 
the indices y x ,y 2 , ... y m -i belong to the set {ij, it will be shewn that y m 
must belong to that set, and thus that 2 Ym is one of the intervals of the 
restricted set 2. The interval 2 Ym contains a point that does not belong 
to any of the intervals 2 tl , 2 lt , ... 2 tp , nor to 2 tpU , for h == 1 , 2 , 3, ...; 
and therefore, for a properly chosen value of p , it must have the least 
index for which these conditions are satisfied, and thus y m — t P + 1 . 

If we consider all the intervals («, p) contiguous to C (A), we obtain 
a restricted set of intervals that has the same interior points as the given 
set A. Moreover, a restricted set of intervals which covers the open interval 
(a, b), has only a finite component such that each interval of that com¬ 
ponent contains a point of a given interval (A, B ), interior to (a, b). It 
has been shewn that the measure of (A, B) is not less than half the sum 
of the lengths of the intervals of this finite set. By considering a sequence 
of intervals {A, B) converging to (a, b), we see that the sum of the lengths 
of the intervals of a restricted set which covers the open interval (a, b) cannot 
exceed twice the length b — a. 

468. The properties of a restricted set of intervals, covering an open 
segment, will now be applied to prove the following theorem: 

Let P be a perfect set , contained in the linear interval ( a , 6 ), and the 
intervals u x , u 2 , ... u n , ... be contiguous to P , and let (a;) be defined in 
each interval u ni or (a n , b n ), and such that \n ( a n) = 0 . Let w (u n ) or w n 
denote the upper boundary of | Xn ( x ) I * n an & ^ ^ be assumed, that 

X 

the series £ w (u n ) is convergent. Let the function F (x) — (b n ) + Xm (*) 
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consist of those terms for which b n x , and, in case x is interior to one of the 
intervals , (a m , b m ), of the term x m ( x )> f or the portion ( a m , x) of the interval; 
then F (x) is such that it has a differential coefficient equal to zero , at almost 
all points of P. 

Let the intervals p ly p 2 , ••• Pn-\i complementary to the set of N 
intervals u x , u 2 , ... u at. 


Let ft denote any segment of which the ends are points of P, and let 
a (/3) denote the sum of | Xn (&J | > for all those intervals ( a n , b n ) that are in ft. 

Let A f N be two positive numbers, and let 8 (N, A) be an interval, 
such that its end-points belong to P, and that it is contained in one of the 
intervals p x , p 2 , ... and is also such that a {8 (N, A)} is not less than 

A- 1 m{S(N t A)}. 

Let A (N, A) be the set of all such intervals 8 (A T , A); it will be shewn 
that this set of intervals is a complete set. If (a P , p v ) be an interval of a 
sequence, all the intervals of which belong to A (N, A), and such that 
a V) p p converge to a, ft respectively, as p ~ oo; it will be seen that (a, p) 
is an interval of the set A (N, A). In the first place, since a p , p p are points 
of P, for every value of p , it follows that a, p are points of P. Moreover, 
from and after some fixed value of p , all the intervals ( a P , p v ) must be 
in one and the same interval p m \ hence (a, P) is also in that interval. The 
smallest index n , of an interval u n contained in (a, a P ), or in ( p , P P ), must 
increase indefinitely, as p does so, thus u ( a P , a ), cr (p , p p ) converge to zero, 
as p — oo; hence 


a (a, P) — lim 



((z p , P v ) 


1 

A' 


It has thus been shewn that (a, P) satisfies all the necessary conditions 
that it may belong to A (N, A). The set A (N, A) being a complete set 
of intervals, it follows that it can be replaced by a restricted set of intervals 
A (N, A), containing the same interior points. The points interior to at 
least one interval of A (N, A), that is, to at least one interval of A (N, A) y 
form a non-overlapping set c (N, A) of open intervals. Every point of 
one such interval i is interior to at least one, and to not more than two, 
intervals of A (N y A). Thus the end-points of the intervals of A ( N , A ), 
that are interior to i, divide it into segments alternately in one, and in 
two, of the intervals of A (N, A); let A a (l) denote the first, and A 2 (i) the 
second, set of these segments. We have then 

m (0 = m {A x (0} 4- m {A 2 (t)}, 
and also a (i) = a {A 2 (t)} - 4 - a {A 2 (t)}. 


Every interval 8 ( N , A) having points in common with t, and consequently 
contained in is decomposed into one interval of A x (i) and two of A 2 (t); 

hence we have — — _ 

*{A (i)} — cr {Aj (i)} + 2a{A # (i)}. 
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Now ,7 {A (i)} =i A ~ l m {A (t)} a A - 1 m (t), 

find a {A, (<.)} + 2a {A 2 (i)} < 2a (t); 

therefore m (i) < 2Aa (i). This inequality holds for each interval covered 
by A (A T , A), and these intervals i are interior to the segments 


Pi> Pi, ■■■ Pv-u- 

The intervals u n , interior to an interval <. all have indices greater than N , 

00 

hence, denoting by t N the sum £ I (b n ) |, the total measure of all the 

intervals t, covered by the intervals A (N, A), is < 2AtN* This measure 
diminishes indefinitely as N is increased indefinitely. 

Let each interval u n be increased at each end by adding on a segment 
of length Aw n ; and denote the interval so increased by u n ' (A). 

If we exclude from (a, b) all the segments u l9 u 2 ,... un, and the seg¬ 
ments A ( N , A), and also the segments u' N tJ) (A), where p — 1 , 2 , 3, ..., 
the measure of these excluded segments is less than 

Z>u n -f 2 At N 4 £ u n ' (^4), 

1 N-\ 1 

oo oo 

or than £ u n -{■ 4Asjv, where Sn — £ w n . 

1 N t 1 


The measure of P being b — a — £ u n , the measure of the points ex- 

i 

eluded is at most b — a m ( P) + 4.4 . If N is so large that 4^4 6 \y < m (P), 

the interval (a, b) contains points that are not excluded. These points 
form a set E (N, A) of which the measure is at least m (P) — 4As*jy; and 
this set is contained in P, since it does not contain any points of u n , for 
any value of n. Moreover E (N, A) is contained in E (N f 1 , A), as is 
seen by observing that every interval of A (N + 1 , A) is an interval of 
A (N, A). If E (A) denote the outer limiting set of the sequence of sets 
{E (N, A)}, as N is increased indefinitely, we see that E (A) is contained 
in P, and has the same measure as P, since 4^4^ — 0 , as N ~ oo. 

Let £ be any point of E (A); it then belongs to E ( N , yl) for a certain 
value of N, and for all greater values; it is moreover a point of P which 
is a limiting point of P, on both sides. The point f is interior to a segment 
p t belonging to the finite set p l9 p 2i ... p N }1 ; a sequence of values of x 
converging to f may be taken to be interior to p t . 

If all the points x , of the sequence, are points of P, we have 

| F (x) - F (£) | s | | *„ (b n ) | - a d, x), ifx>£, 


and | -P 1 (a;) — JF 1 (^) j & o- (x, £), if x < £. 

Since f is in the set E(N,A), we see that a (£, x), or a (x, f), is 

I r (*)-/*(« | 


< A - l | x — $ |; hence 


1 

< A- 
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If x be a point of p t that does not belong to P, and is thus interior to 
u m , for some value of m , greater than N + 1 , we have | x — £ | > Aw m , 
since £ is not in the closed interval u' m (A). 

If x > £ , we have F (x) - F(£) - X m (x) + F (a m ) - F (() ; hence 
| F (x) - F (£) | < w rn + ^\a m -{\< 2 \x-£\. 


If x < we have 

Fl£)-F (x) = F (f) - F (b m ) + x (bj - Xm (.x ); 

and therefore 

I F (x) ~F(i) 


< 


+ 2w m < -7 | f - x 


It is now seen that, for any point x, in p t , 


F (*> - F (f) 


< hence 
A 


x - £ 

none of the four derivatives of F (x), at can exceed 3/A. If we assign 
to A y the values in an increasing sequence that diverges, we see that each 
of the sets E (A) is contained in the preceding set; for E (N> A') is con¬ 
tained in E (N, A), if A' > A. Each of the sets E (A) has the measure 
m (P), and therefore the set E of points common to all the sets E ( A ) has 
the measure m (P). At a point of E, the four derivatives of F (f) are 
all less than 3/ A, for all the values of A in the sequence. Therefore F (f) 
has a differential coefficient, equal to zero, at every point of E , that is, 
at almost every point of P. The theorem has thus been established. 
Another proof of this result has been given* by Burkill. 


469. Erom the theorem of § 468, the following theorem can be deduced 
at once : 

If P be a perfect set , and {(a n , h n )} be the set of contiguous intervals , then, 
if the function V (a, x) be summable over P, and ifHW (a v , b n ) be convergent , 
the function V (a, x) has a finite differential coefficient, at almost all points of P. 

The term W (a, ft) denotes, as in § 464, the upper limit of | V ( aft') |, 
for all intervals (a', ft') contained in (a, ft). In this case 

Xn (*) = f / (*) dx, X n (K) = f 7 (*) dx. 

• (In J On 

Moreover we have 

w (a n , b„) s W (a„, b n ) s 2 w (a„, b n )\ 

CO CO 

and hence, if 2 W (a n) b n ) is convergent, so also is 2 w ( a n) b n )> and the 

n-1 n-1 

converse also holds good. 

We have V (a, x) = [ /(x) dx 4* £ V (a n , b n ) + V (a m , x), 

J a a 


* Proc. Camb. Phil. Soc. vol. xxi (1923), p. 660. 
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where the summation is taken for all the intervals (a n , b n ) interior to(a, #), and 
the term V (a m , x) exists if x is interior to (a m , b m ), and the function f(x) 
is equal to / (x) at every point of P, but is elsewhere zero. Since / ( x ) is 
summable in (a, 6 ), its indefinite integral has, almost everywhere, a differen¬ 
tial coefficient, equal to / (x ); and by the theorem of § 468, the function 

X 

£ V (a ni b n ) 4 - V (a m ,x) has a differential coefficient equal to zero, at 

a 

almost all points of P. It follows that the function V (a, x) has a differ¬ 
ential coefficient, equal to / (x), at almost all points of P. 

470. The theorem of § 469 may now be employed to establish the 
following property of the D-integral : 

The indefinite D-integral F (x), of a function f ( x ), in the interval (a, b ), 
has almost everywhere in the interval a differential coefficient , equal to f (x). 

This is the extension to the D-integral, of the property of the D- 
integral, given in § 405. 

With the notation of § 465, it is clear that, in any interval contiguous 
to //, F (x) has almost everywhere a differential coefficient, equal to / (x). 
For f (x) has an D-integral in every interval («', /?'), interior to an interval 
(a, ft), contiguous to H ; and therefore (§ 405) F (x) has a differential coeffi¬ 
cient, equal to f (x), almost everywhere in (a', ft'), and therefore almost 
everywhere in ( a , ft). It can now be shewn that F (x) has this property 
in any interval contiguous to P 1 . For it holds in any interval contiguous 
to Mf, and therefore also in any interval contiguous to M". Generally, 
it can be shewn that F ( x) has the property in every interval contiguous 
to any derivative of M 1 ; and thence it is shewn that F (#) has the 

property in any interval contiguous to P 1 . 

It will next be shewn that, in any interval contiguous to P 2 , the 
function F (x) has almost everywhere a differential coefficient equal to 
/ (#). For, consider an interval 8 , which contains no points of // 2 , but 
contains a portion p lt of P 1 . The property holds in each interval con¬ 
tained in 8 , contiguous to , and by the theorem of § 469, it holds for 
the set p Y \ and consequently it holds for 8 . It can now be shewn, as in 
the case of Pj, that the theorem holds for every interval contiguous to P 2 . 
Proceeding in this manner, as in § 465, we see by induction, that, in 
virtue of the theorem of § 469, the theorem holds for any interval con¬ 
tiguous to P y , where y is a number of the first, or of the second, class. 
Since, for some value 7, of y, the set P y is non-existent, it follows that 
the function F (#) has a differential coefficient, equal to / (x)> almost 
everywhere in (a, b). 

471. The importance of the D-integral in the general theory of inte¬ 
gration depends largely upon the property of the integral expressed in the 
following theorem: 
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If F (x) have, at each point of the interval (a, b), a finite differential coeffi¬ 
cient f (x), the function f (x) is integraJble ( D ), in the interval (a, b), and 

jj(x) dx = F (x) - F (a). 

This theorem expresses the fact that the operations of differentiation 
and integration are completely reversible in the case of any function 
which has, everywhere in the interval in which it is defined, a finite differ¬ 
ential coefficient, whether that differential coefficient is bounded or un¬ 
bounded in the interval. The property (B), in the statement in § 343, of 
the fundamental theorem of the differential calculus, is covered, in a wide 
class of cases, by the above theorem. It will be observed that the corre¬ 
sponding theorem for the P-integral is subject to the condition that F (x) 
must be of bounded variation, as otherwise / (x) will not be summable 
(see § 405), whereas, in the above theorem, the function F (x) is subject 
to no condition beyond that of possessing everywhere a finite differential 
coefficient. 


In order to prove the theorem, it is necessary to shew that the function 
f (x) satisfies, in relation to F (x), the conditions I, II, and III, of § 464. 

(I). To shew that the set of points of non-summability of /(x), with 
respect to any perfect set P, contained in (a, 6), is non-dense in P, we 
observe (see § 401) that/ (x) is the limit of a sequence {ifi n (x)} of continuous 
functions. For, if ip n (x) denote {F (x -f h n ) — F (x)}/h nf where {h n } is a 
sequence of numbers which converges to zero, and is independent of x , 
these functions «/r n (x) are all continuous functions of x. 

The function ifj n (x) may be expressed by x ( x > y)> a function of the 
two variables x , y, where y — l/n . The function x ( x > y) is in the first 
instance defined only for values of y, of the form l/n, but it may be ex¬ 
tended to the case in which y has all values in the interval 0 < y £ 1, by 


such a rule as that, when y is in the interval 



x (*> y) 



i 



n n + 1 


1 

n -f 1 



The function x ( x > y)> 80 defined for the plane set of points a s x s b, 
0 < y & I, is continuous with respect to t/, for each value of x. In ac¬ 
cordance with the theorem of § 323, there must be, in every interval on 
the line y = 0, points at which x ( x > y) i fl continuous with respect to (x, y), 

and therefore with respect to x. Therefore f (x), which is lim x (%> y)> is 

v~o 

point-wise discontinuous; and this is also the case if only those values of x 
are considered which belong to a perfect set P. It follows that the points 
of infinite discontinuity of / (x), with respect to P, form a set that is non- 
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dense in P. This set contains the set of points of non-summability of / (#) 
with respect to P. Therefore / {x) satisfies the condition I, of § 464. 

(II) . If (a, p) is an interval in which / (#) is summable, we have, in 
accordance with the theorem of § 406, F (ft) — F (a) = J / {x) dx. Let it 
be now assumed that (a, ft) is an interval such that the relation 

has been shewn to hold, at some stage of the process of construction of 
the D-integral of / (x), for all intervals (a , ft') interior to (a, ft). If a' ~ a, 

P' ~ ft, F (a') converges to F (a), and F (P') converges to F (/?), on account 

re 

of the continuity of the function F (x). It follows that, J / (x) dx , defined 
as lim f / (x) dx, is equal to F (P) — F (a). Therefore the condition II, 

a'^a, • a' 

of § 464, is satisfied. 

(III) . Let it be assumed that, in each interval (a, p) contiguous to a 
perfect set P, the D-integral of / (x) has been shewn to exist, and to be 
equal to F (p) — F (a). The following theorem will be proved: 

If the continuous function F {x) have , in the perfect set P, a finite differ¬ 
ential coefficient at every point, then the set of those points of P which are 
points of divergence of the series L W n , where W n is the fluctuation of F (x) 
in the interval (a n , p n ) contiguous to P, is non-dense in P. 

In case the set of points of divergence of the series £JF n is not non- 
dense in P, there must be an interval containing a portion of P, in which 
the set is everywhere dense in that portion of P. Let it therefore be 
assumed that the set of points of divergence of Y,W n is everywhere dense 
in P. It will then be proved that there exist points of P at which one 
of the derivatives of F (x) is infinite; and this is inconsistent with the 
assumption that F (x) has a finite differential coefficient at every point 
of P. The theorem will then have been established. 

In every interval containing a point of P, the series S W n is divergent; 
it then follows that in this interval there is an infinite number of values 
of n for which W n /(P n — a n ) exceeds a fixed positive number k ; for if 

w n £ k (P„ - a„), 

00 

for all values of n with the exception of a finite number, the sum hW n , 

rn 

for some fixed value of m, would be £ (P n — a n ), and thus the series 
would be convergent. 

If i, Tj be the points of (a n , p n ) at which F (x) has its upper and lower 
limits, we have 

* it)-r (v) = {* (£) - r («»)} + {** («-) - f (,»; 
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and since both the terms on the right-hand side are ^ 0 , one of them is 
greater than \ {F (£) — F ( 77 )}. There exists therefore a point y n , in 


(a n) p n ), such that 


F ( Yn) ~ F (a n ) 




1 F 

is greater than x — 

" Pn 


F( V ) 


It 


thus appears that there exists a set of points c ly c 2 , ... c m , ... all belonging 
to P, and dense in P, such that there is a point y m on the right of each 
\F(y m )~F(c m )\ 


point c m , for which 


> A, where A is an arbitrarily chosen 


Ym ' 

positive number. From the continuity of F (#) at c m , an interval o m con¬ 
taining c m , of length not exceeding e m , and not containing the point y mi 

F (y m ) -F(x)\ 


can be so determined that 
F (y m ) - F (c m ) 


lies between the two numbers 


Ym — x 

± e mi for all points 2; in a m . The number e m is taken 

Ym - C m | 

to belong to a sequence {e m }, of decreasing numbers, which converges to 
zero. Any point £, contained in an infinite number of the intervals a mi 
belongs to P. For the length of cr m , not exceeding e m , which contains the 
point c m , of P, the point £ is a limiting point of P, and therefore belongs 
to the set. Also y m — f does not exceed — a m -f e m , and therefore y m 

F (y ) — F (£) 

converges to f, as m increases indefinitely; it follows that - — -7-- 

Ym b 

as 00, has its lower limit > A; and thus, at the point there is a 
derivative numerically > A. 


It has now been shewn that all the points interior to an infinite 
number of the intervals c m , form a residual set L relative to the perfect 
set P. At every such point f there is a derivative numerically greater 
than k. 


Let A have successively the values in an increasing sequence {A n } which 
diverges, and let {.L n } be the corresponding sequence of sets L , residuals 
with respect to P. A point that does not belong to any of the sets L n 
belongs to the sets, of the first category relative to P, that are comple¬ 
mentary to the sets L n , and all these complementary sets form an enumer¬ 
able sequence of sets, non-dense in P; therefore the points that belong to 
all the sets {P n } form a residual set, relatively to P. At a point of this 
residual set, there is a derivative that is numerically greater than A,,, 
whatever value n may have, and thus there is an infinite derivative. It 
has now been shewn that the assumption made is inconsistent with the 
existence of a finite differential coefficient at each point of P. Thus it 
has been shewn that the condition III, of § 464, is satisfied. 

It has been shewn that the conditions I, II, III, of § 464, are all 
satisfied by f (#), and it only remains to be proved that the condition (4), 
of § 464, is satisfied. We shall prove the following theorem: 
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If F (x) have a finite differential coefficient f (x), at every point of a 

00 

perfect set P, which is summable over P, and the series D W (a n ,b n ) is 

n -1 

convergent, where W (a n , b n ) denotes the fluctuation of F (x) in the interval 
(a n , b n ) contiguous to P , then 

F (b) — F (a) = I* / (x) dx+X {F (b n ) - F (a n )} 9 

J(P) n -1 

where (a, b) is the interval in which P is contained. 

In the theorem of § 468, let 

Xn (X) = F (x) - F (a n ), 

then the function 

9 ( x ) - F ( a ) + S {F (b n ) - F (a n )} + {F (x) - F (a m )} 

a 

has, at almost all points of P, the differential coefficient zero. The sum¬ 
mation in the second term on the right-hand side is taken for all the 
intervals ( a n , interior to (a, x), and x is taken to be in the interval 

(a nl , b m ). 

Let F (x) = g (x) + h (x), then h (x) has a differential coefficient, equal 
to/ (#), at almost all points of P. 

Thus, if / (x) is summable over P, we have 

F {x) =g{x) + l f (x) dx, 

J a 

where / (x) = / (x) at all points of P, and is elsewhere equal to zero. 

It now follows that 

F (b) - F (a) - f f (x) dx + £ {F (b n ) - F («„)}. 

It has thus been shewn that all the conditions necessary for the 
existence of the P-integral in (a, b) are satisfied, and that the integral is 
equal to F (b) — F (a). It is clear that, in the interval (a, x), the P-integral 

) / (x) dx exists, and is equal to F (x) — F (a). 

J a 

PROPERTIES OF THE DENJOY INTEGRAL 

472. A P-integral that is not also an L-integral cannot be an abso¬ 
lutely convergent integral, that is, the integral of the absolute value o 
the function cannot exist. For, if a and /? are two points of non-sum- 
mability of the function / (x), and the L-integral of / (x) exists in every 

interval (a', /3') interior to (a, /5), the limit of J | / (x) | dx, as a' ~ a, 
/?' ~ ft, is infinite. 

If f (1) (x), f {2) (#) be two functions , both integrable (P), in the interval 
(a, 6), then their sum is also integrable (P), in the interval , and 

f {/ (1) (x) +/ (2) (*)} dx - f/< x > (x) dx + [>> (x) dx. 

J a J a J a 
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Let H (1 \ H {2) be the sets of points of non-summability of the two 
functions f ll) (x), f {2) (x ), respectively. The points of non-summability of 
/ (1) (x)-f/ (2) (x) form the whole, or a closed part, of the closed set 
M (// (1) , H {2) ), which may be denoted by U ; and this is the sum of a perfect 
set P x , and an enumerable set M x . It has been shewn in § 466, that the 
set of points of non-summability of either of the functions/ (1) (x),/ (2) (x), 
assumed to be integrable (Z>), with respect to any perfect set whatever, 
contained in (a, 6), is non-dense in that set, and that the same is true as 
regards the points of lion-convergence of the sum of the fluctuations of their 
integrals in the intervals contiguous to the perfect set. We may therefore, 
in the process of construction of each of the integrals 

( f a) ( x ) dx, | f {2) (x) dx, 

employ the set H , instead of P (1) , or H (2) . We shall have 

P, = M (PS” P^h 

and in general P« = M (P« (1) , P« (2) ). Moreover if, at any stage of the 
process, it has been shewn that the theorem holds for every interval 
(a', P') interior to the interval (a, P), we see that, since the integrals of 
f a) (#), f i2) ( x ) i n ( P') converge to their integrals in (a, £), as a' ~ a y 
p' ~ p, the theorem also holds for the interval (a, P ). The same theorem 
therefore holds for the integrals obtained at each stage of the process of 
construction of the D-integral of f a) (x) -f f (2) (x), and therefore for the 
final integrals taken over the interval (a, b). 

473. In the case in which the set H, of points of non-sum inability of 
the function / (x), has measure zero, the function is necessarily summable 

over the nucleus P, of H t because m (P) = 0 , and thus I / (x) dx ~ 0 . 

J(P) 

In this case, the D-integral over (a, 6), assumed to exist, is the sum of 
the P-integrals over the intervals contiguous to P. In order that the 
D-integral may exist, it is sufficient that the sum of the fluctuations of 
the integrals over the intervals contiguous to P should be convergent. 
When this condition is satisfied, so that the D-integral of/(x), over (a, 6), 
exists, it is not necessarily the case that / (x) has an DX-integral over 
(a, b), because the condition for the existence of the latter is, in accordance 
with E. H. Moore’s theorem, given in § 457, that the sum of the fluctua¬ 
tions of the integrals of / (x) taken over the intervals contiguous to the 
set H should be convergent; and this condition is more stringent than the 
condition that the sum of the fluctuations over the intervals contiguous 
to the nucleus P, of H , should be convergent. This explains why the sum 
of two functions, both of which have DX-integrals, does not necessarily 
possess an DP-integral (see § 460), although, as has been shewn above, 
the sum of two functions, each of which is integrable (D), is also inte¬ 
grable (D). 
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474. The following property of a function that is integrable ( D ) will 
now be established: 

If f (x) be integrable (D), in the interval (a, 6), and cf> ( x ) be bounded and 
monotone in the interval , then the product f ( x ) <f> (x) is integrable (D), in 
(a, b ), and satisfies the relation 

fj (*) <f> (*) dx = |V (x) <f> (z)j* - j* F (x) d<f> (x), 
where F (x) denotes the indefinite integral j f (x) dx. 

J a 

It follows from this theorem that the product of / (x) into any function 
of bounded variation in (a, b) is integrable (D). 

To prove the theorem it is necessary to shew that the function/ (x) <j> ( x) 
satisfies the conditions given in § 464, in relation to the function 

[r (x) <f> (*)] - £ F (x) d<f> (*). 

If, in the interval (a, /3), the function / (x) is summable, we have (§ 447) 
[ / (x) cf> (x) dx = | <f> ( x)dF (x), 

and therefore 

ff (*) 4> (*) dx = |V (x) 0 (x)J ? - J F (x) d<f> (a;); 

thus the theorem holds in the interval (a, /?). 

Next, if the theorem holds in every interval (a', fi')> interior to the 
interval (a, £), it will be shewn to hold for (a, /3). 

We have to shew that, as a ~ a, ft' ~ ft, 

|V (*) <f> (*]*- fr (x) d<f> (x) 

converges to ^ (x) <f> (x)J — J F (x) d<f> (x). 

Since cf> (x) is monotone, it is clear that f F ( x ) d<f> (x) is equal to 

J a 

F (a") J dtf>(x), 

where a" is some number in the interval (a, a'), and this is equal to 

*■(«") {*(«') -*(«)}; 

therefore lim f F (x) <bf> (x) = F (a) {(f> (a 4- 0) — <f> (a)}. 

a'~a J a 

Similarly, we have 

lim f F (x) d*(x)~F ()8) {<f> (/8) - <f> (j8 - 0)}; 

Ifi' 
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rP' CP 

and therefore lim F (x) d<f> (x) = F (x) d<f> (x) 

a'~a,8'~f} J a' Ja 


[CH. VIII 


Since 


- F (p) {4> 08) - <f> 03 - 0)} 

- F (a) {<f> (a + 0) - <j> (a)}. 


lim \f (x) $ (x)7 = [i? (x) <f> (x) I* - F (p) {4> 03) - <j> 03 - 0)} 

3-~0L J*' l j» 

- J 1 (a) {<£ (a + 0) - 4> (a)}, 


the result follows. 


Hence, as 


( f (x) <f> (x) dx is defined to be 

J a 

lim / (z) (f> (#) dx, 

a~a t P'^P J a' 


we see that the theorem holds for the interval (a, /?). 

A point in the neighbourhood of which / (x) is summable is also a 
point in the neighbourhood of which / (x) cf> (x) is summable; thus the 
set H is the same for the function / (x) cf> (x), as for / (x), unless <f> (x) = 0, 
in a line of invariability. 

We have now to shew that the condition III, of § 464, is satisfied for 
the function / (x) <f> (x). 

If ( a n , j3 n ) be an interval contiguous to the perfect set P, in which the 
integral of / ( x) <j> (x) has already been constructed, we have 

f /(*) 4> (*) dx = \f (x) <f> (x) - ( F (x) d<f> (x). 

J a n L _ a n • a n 

If rj are the points of the interval at which F (x) attains its upper 
and lower boundaries, we have to estimate W' (a n , p n ), the fluctuation of 


i: 


f (x) </> (a?) dx 


in the interval. The fluctuation of ^F (x) </> (cc)J is not greater than 

{F ({) - F (a n )} <f> (p n ) ~ {F (rj) - F(a n )} cf> (a n ); 
it being assumed that <£ (x) is a non-diminishing function. Since 
F(£)-F (a n ) > 0, F ( V ) - F (a n ) < 0, 
the fluctuation is not greater than {F ($) — F (rj)} </> (p n ), or than 

cf> (b) W (a n , fi n ), 

where W (a n , fi n ) is the fluctuation of F (x). 

Since \y (x) dfi (x) lies between F (f) {<f> (fi n ) — <f> (a n )} and 
F (rj) {<f> (fi n ) - <f> (a n )}, 

the fluctuation of the integral does not exceed {</> (fi n ) — <f> (a n )} W (a n , fi n ), 
or is less than {<f> (b) — <f> (a)} W (a n , fi n ). Thus W' (a n , fi n ) is less than a 

oo 

fixed multiple of W (a n , fi n ). If then £ W (a n , fi n ) is convergent, so also is 

n -1 

S W' (a n , P n ). 

n -1 
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oo 

It now follows that a point of the perfect set P, at which £ W (a n , fi n ) 

71 *1 
oo 

is convergent, is also a point of convergence of £ W' (a n , /3 n ). For the 

n -1 

result obtained above may be applied to the part of P contained in a 
neighbourhood of the point, so chosen that, in it, £ IF (a n , ft n ) is con¬ 
vergent. 

It has thus been shewn that the set of points of P which are not points 
of convergence of £TF' ( a n , is non-dense in P, since the corresponding 

condition for £JF (« n ,/? n ) is satisfied. Therefore the condition III, of 
§ 464, holds for the function / ( x) <f> (x). 

For the complete establishment of the theorem, there only remains 
to be proved that: 

If f (x) be summable in a perfect set P, contained in an interval (a, /?), 
then 


{>(*)*(*)][-£ 


F (x) d<j> {x) 

'1/3. /"ft 

F (x) <t> (x) - I 

J an J a. 


F (a/ df> (x) 


W / 

) J(P) 


f(x) cf> (x) dx; 


the summation being taken for all the intervals (a n , ($ n ) contiguous to P, it 
being assumed that the sum is absolutely convergent. 

This theorem can then be applied to the perfect set contained in any of 
the intervals contiguous to any one of the perfect sets P 1 , P 2 , ..., P fi , ..., 
employed in § 465, the conditions of the theorem being in each case 
satisfied. 


Let f (x) be expressed by the sum of the two functions f y ( x ), f 2 (x); 
where f x (x) = f (x), in P, and / 2 (#) = / (#) in all interior points of the 

intervals (a n , /3 ri ). Let F x (x) = j* f x (x) dx, and P 2 (x) = J f 2 (x) dx; thus 
F (x) = F ± (x) -f F 2 (x). 


The P-integral f f x (x) <f> (x) dx exists, being equal to f f x (a;) cf> (a:) dx ; 
J a J(P) 

[ft 

also / 2 (x) <f> (x) dx exists as a P-integral, being equal to 

2 [ f 2 (*) (*) dx. 

J an 


In each interval (a„, /?„), F, (x) is constant, thus 

(x) <j> (a;)] 3 ”- (x) deb (x) = \F (x) <b (x)Y"- F"f (x) d<b (x). 

L J an •' L J a n J a n 

Now 1/ ( x ) <b (x)J — J F (x) d<b (x) 

=• | [•f’l (*) <f> (*)][ ~f a F i (*) d( t> (*)} + {(*) <f> (*)]] - £ F » 0*0 d 4> 0*0}; 
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the expression in the first bracket on the right-hand side is equal to 

r /i ( x ) $ ( x ) dx > ° t to f a m </> ( a? ) dx - 

Ja J(P) 

In order to prove the theorem, it is then sufficient to shew that 

£p 2 (x) 0 (x)J* - j* F 2 (x) d<f> (x) = | |p 2 (x) 0 (x)J - [ F 2 (x) d<f> (x)J; 

this is the special case of the original theorem which arises when f (x) = 0, 
over the set P. 


Consider the first m of the intervals (a n) and let 

(yi> &i)> (y 2 > ^ 2 )> (ym+i> &m+ i)> where y x — a y 8 m+1 = ft, 
be the intervals complementary to the intervals (a x , &), ... (a mi fi m ). 
We have 


[ ~\P fp »-m(r ”1 Pn fPn 

F 2 (x) 0 (x)J^ - J*F 2 (x) d<f> (X) - ^ 11^2 (x) 0 (x)J a - F t (X) d<j> (x) 

r-m+l f “|fir r-m+l fir 

= 2 I F 2 (x) <f> (x) —* 2 I F 2 (x) d(f> (x) ; 

r-i L Jyr f“i J yr 

and the expression on the right-hand side may be written as 

r-m+l r -m + 1 

£ 4> (3 r ) {F t (S r ) - F t ( Yr )} + £ F 2 ( Yr ){4>(8 r )-4>(y r )} 

r-1 r-1 

r-m+l fir 


which is equivalent to 


r-m+l fir 

- £ F t (x) d<f> (x), 

r-1 J y r 


r-m+l fP 

£ 4> (K) {F 2 (8 r ) - f 2 (y T )} + / (X) dcf> (x), 

r-1 j a 

where 0 m (x) is defined to be F 2 (y r ) — F 2 (x), in the intervals (y r , 8 r ), and 
to be zero in the intervals (a l9 &), (oj, j8 a ), ... ( a m , £ m ). 

__ r -m+1 

The first term is numerically less than 0 2 | F 2 (8 r ) — F 2 (y r ) |, 

r-1 

where 0 is the upper boundary of | 0 (x) J in (a, ft); and 

r-m+l 

£ | F 2 (S r ) - F 2 (y r ) I 

r-1 

does not exceed the sum of the fluctuations of F 2 (x) in the intervals 

fim+ 1 )> ( a m-f 2 > &n+ 2 )> •••» 

contained in the intervals (y ri 8 r ). It follows, from the convergence of 
the series of fluctuations in the intervals contiguous to P, that the first 
term is arbitrarily small, if m be sufficiently great; thus the first term 
is < € m , where lim e m = 0. 

oo 

If x is not a point of P, we have 0 m (x) = 0, for all sufficiently large 
values of m\ and if x is a point of P, it is in one of the intervals (y ri 8 r ), 
where y r converges to x, as m is indefinitely increased, and thus 0 m (x) 
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converges to zero. Since <p m (x) is bounded, for all values of m and x, we 
have, by a theorem established in § 445, 

Jim I <Pm (x) dcf> (x) = f [lim >p m (*)] d<f> (x) = 0. 

m~0O a J a 771 ~ 00 


It has now been shewn that the limit, as m ~ oo, of 

[**. (*) <f> (*)][ -JV. (x) (x) - £{[>. (x) 4> (x)J^-fy t (x) d<f> <*)}, 


is zero. The theorem has therefore been established; and it has been 
proved that 


f / {x) <j> ( x) dx = F ( x) <f> (x) - [ F (x) d<f> (s), 

da L Ja Ja 


which is an extension, to the ^-integral, of the theorem of § 445, for the 
L-integral. 


475. As in § 445, it may be shewn that the second mean value theorem 
is applicable to the integral of / (x) <j> (x), where / (#) is integrable (D), 
and <f> (x) is monotone and bounded. 

For £ F (x) d<f, (x) - F ( M ) {<f> (6) - <f, (a)}, where /x is some number in 
the interval (a, b) ; therefore 

I /(*)</> (z) dx ~ F (b) (f> (b) — F ( M ) {<f> (b) - <f> (a)}, 

J a 

if we assume that F (a) = 0. This is equivalent to 

I f (x) <j> (x) dx = <f> (a) (x) dx + <f> ( b) j f (x) dx. 

The more general form of the theorem, including Bonnet's form, may 
be deduced, as in §422. For the application to the representation of a 
function that is integrable (D) by trigonometrical or other series, the 
result here obtained is of decided importance. 


EXTENSIONS OF THE DEFINITION OF THE DENJOY INTEGRAL 

476. A generalization of the definition, given in § 464, of the Denjoy 
integral has been made by Denjoy* himself, by W. H. Youngf, and by 
Khintchinef, independently of each other. Instead of the series 

s W (o„, P„), 

n -1 

in which the terms are the fluctuations of V (a n , x) in the intervals (a n , )S n ), 

* Annales ec. de Viccle normale (3), vol. xxxm (1916), p. 127, and (3), vol. xxxiv (1917), 
p. 181. These memoirs are a continuation of researches in the Journal de Math. (7), vol. i (1915), 
p. 105, and Bull, de la eoc. Math, de France, vol. xun (1915), p. 161. 
t Proc. Bond, math . 8oc. (2), vol. xvi (1916), p. 175. 

X Comptes JRcndus, Paris, vol. CLxn (1916), p. 287. 
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oo 

the series S | F (a n , fi n ) | is employed. Thus, for the definition (4), in 

n -1 

§ 464, the following is substituted : 

(4)'. Let P be a perfect set of points in an interval ( a , /?), contained in 
(a, b), and let it be assumed that f ( x ) is summable over the set P, and suppose 
that V (a', ft') has been defined for every interval (a', /?'), of (a, ft), which con¬ 
tains no points of H as interior points. Let (a n , fi n ) t where n ~ 1, 2, 3,’, 

denote the intervals of (a, ft) that are contiguous to P, and let it be assumed 
00 

that the series 2 | V (a n , fi n ) | is convergent. Then V (a, ft) is defined by 

n -1 

F(«,J8)= S V (a n , fi n ) + f f(x)dx. 
n ~1 J(P) 

The definitions (1), (2), and (3), of § 464, are unaltered. 

Instead of the condition III, of § 464, the following is substituted: 
III'. For every perfect set P, if V (a n , fi n ) have been calculated for every 
interval ( a n , fi n ) contiguous to P, the set of points of P which are not points 

GO 

of convergence of 2 \ V (a n , fi n ) | is non-dense in P. 

n -1 

OD 

As before, a point of convergence of 2 | V (a n , fi n ) | is one which has 

n ~ 1 

a neighbourhood such that the series of those terms for which (a w , fi n ) is 
interior to the neighbourhood is convergent. Thus the neighbourhood 
contains a part of P, such that the series is convergent, when taken for 
the interval contiguous to that part of P. The conditions I, II, of § 464, 
remain unaltered. 

oo 

It will be observed that, if the series 2 W (a n , fi n ) is convergent, so 

n -1 
oo 

also is the series 2 | V (a n , /? n ) |; but that the converse does not hold. 

n -1 

It thus appears that the D-integral is a special class of the functions 
V (a, x) which satisfy the wider set of conditions obtained by substituting 
(4)' and III' for (4) and III respectively. 

When the function / (x) is such that all the definitions (1), (2), (3), (4)' 
and I, II, III' are realized, the actual construction of V (a, 6) is carried 
out precisely in the same manner as in the case of the D-integral, no 
essential change being necessary in the proof of existence of the integral. 

477. As has already been pointed out in § 466, the conditions (3) and II 
contain a postulation that V (a, x) shall be a continuous function of x , 
and it was shewn that the D-integral constructed in § 464 actually satisfies 
the requirement of this postulation. It will however be shewn that when 
the new conditions (4)' and III' are substituted for (4) and III, the postu¬ 
lation of continuity of V (a, x), or of F (x), is not satisfied unless the 
function f (x) satisfies a certain condition. 
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The set H denotes the aggregate of the points at which / (a;) is not 
summable, and of the points in the neighbourhood of which 21 | V (a n , fi n ) | 
is not convergent. 


The function V (a, x) is clearly continuous at any point which does 
not belong to H , since V (a, x -f h) — V (a, x) is an L-integral, for suffi¬ 
ciently small values of | h |. Next, let G be a perfect set, contained in an 
interval (a, /?), and assume that / (x) is summable over G , and that every 
point of G is a point of convergence of the series 2 | V (a n , fi n ) |, when the 
summation is taken for those intervals contained in (a, /?) which are con¬ 
tiguous to G. 


We have 

V (a, x -j- h) 



f(x)dx + 2 [ / (x) 

- On 


dx 


x+h 

3 

X 


where (a n , b n ) is contiguous to G. As h ~ 0, we see that 

f (x) dx\ ~ 0, 

L J (O) J x 


on account of the continuity of an A-integral. In case x and x -f- h are 
r rb n lx+/i 

both points of G, 2 / (#) dx also converges to zero, as h ~ 0, in 

L J On J X 


such a manner that x + h is always a point of G. But if x + h is not 
restricted to be a point of G } it is in the interior of some interval (a m , b m ), 

[ rb n ix+ft 

2 I f (x) dx\ contains the term 


J 'X + A 

/ (*) dx, 

dm 


Cbm 

if h be positive, or / (x) dx, in case h be negative. It may happen 

J x+h 

rx+h rb m 

that, as h converges in any manner to zero, / (x) dx, or / {x) dx, 

J am - x+h 

does not converge to zero, and in that case V (a, x) would not be con¬ 


tinuous at the point x, of G. We have 


fx-f-A 

/(*) 

J rw 


dx 


^ W m , where W m is 


the upper boundary of 


rb' 

f (x) dx for all intervals (a', b') contained in 

J a ' 


(i a m , b m ). Let it be supposed that, for an infinite sequence of values of 
m, W m is greater than some positive number k. However small h may 
be, there will be one of these intervals ( a r , 6 r ), for which W r > k, in the 
interval (x, x + h). Hence, in ( a r , b r ), there will be two points a/ = x + 
and b r ' = x + h 2 , for which the terms considered differ by at least k. If 
we take a sequence of values of h, such that x + h approaches x along 
the sequence {«/}, we obtain a limit which differs by at least k from the 
limit obtained when x + h has a sequence of values {6/}. It thus appears 
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that V (a, x -f h) — V (a, x ) could not, for both sequences of values of h> 
have the limit zero. Hence, with the hypothesis made, V (a, x) would be 
discontinuous at the point x. In order that V (a, x) may be continuous, 
it is therefore necessary and sufficient that lim W m = 0. This condition 

m~oo 

must be satisfied for all the perfect sets contained in all the intervals 
contiguous to each of the sets P x , P 2 > • • • Py , employed in the construction 
given in § 464. 

It thus appears that the postulation contained in (3) and II, which is 
equivalent to the postulation that the integral V ( a , x) constructed, as in 
§ 464, but with (4)' and III' substituted for (4) and III, should be con¬ 
tinuous, is satisfied only if the upper boundary of V (a ni x), in the interval 
(i a n , /? n ), converges to zero, as n ~ oo, for each of the perfect sets con¬ 
tained in any interval contiguous to P y , where P y is any one of the perfect 
sets employed in the construction of V (a, b). 


478. An integral F (x) = / / (x) dx, constructed in accordance with 

J a 

the system of definitions and postulations (1), (2), (3), (4)', and I, II, HI', 
which include the condition that F (x) is a continuous function, will be 
termed a DK Y -integral, or Denjoy-Khintchine-Young integral. It is clear 
that a D- integral is also a DK Y -integral, but that the converse is not true. 

It is shewn, as in § 472, that the sum theorem holds* also for the 
DK 7-integral: 

If / (1) ( x )>f l2) ( x ) hxive DKY-integrals in (a, b), then f {l) (x) 4 - / (2) (x) has 
also a DK Y-integral in the interval , and 

f {f a) ( x ) +/ (2) ( x )) = f/ (1> (x) dx + f / <2) (x) dx. 

J a -'a .a 

The proof given in § 466 is applicable to shew that the set of those 
points of any given perfect set P, each of which is either a point of non- 
summability with respect to P, of the function / (x), that is integrable 
(DKY), or else a point of non-convergence of S | F (fi n ) — F ( a n ) | taken 
for the intervals (a n , /? n ), contiguous to P, is non-dense in P. 

The theorem that the Z>-integral F (x) of a function / (x), that is 
integrable (Z>), has almost everywhere a differential coefficient, equal to 
f (x) } does not hold in general for the PA 7-integral. The corresponding 
theorem for the latter integral has been formulated by Denjoyf, who has 
introduced for the purpose the conception of an approximate differential 
coefficient; thus: 

A continuous function F (x) possesses , at a point x 0 , an approximate dif¬ 
ferential coefficient , finite, and equal to F' (x 0 ), if the set of points x for which 

I F(a:) ~ F{Xo) - F' (««,) I < «, 

I X - Xq 

* This is contrary to a statement of W. H. Young, foe. cit. pp. 208 and 211. 
f Annates sc. de VdcoU normals (3), vol. xxxm (1916), p. 170. 
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has the metric density unity at the point x 0 , whatever value the positive number 
€ may have . 

The property of the indefinite DKY- integral is then formulated as 
follows*: 

If F (x) be the integral j f (a) dx , of a function f (x) which is integrable 
J a 

(DKY) in the interval (a, b), F (&) has at almost every point of (a, b) } an 
approximate differential coefficient, or else an ordinary differential coefficient , 
equal to f (#). 

The necessary and sufficient conditions have been formulated by 
Khintchine (loc. cit.), that the indefinite DK 7-integral F (x) should have 
almost everywhere a differential coefficient, equal to f (x). 

THE YOUNG INTEGRAL 

479. The most general definition of a non-absolutely convergent inte¬ 
gral has been givenf by W. H. Young. His definition is obtained by 
relaxing the conditions from which it is inferred that the DK 7-integral 
is continuous. Accordingly, the Young integral, or 7-integral, is not 
necessarily a continuous function of its upper limit. When it is in fact 
continuous, it is then a DK 7-integral. When a is a limit of points of H 
on its right, and p is a limit of points of H on its left, the definition (3), 
and the assumption II, of § 464, are no longer taken to hold for all sets 
of intervals (a n , fi n ) such that a n ~ a, p n ~ ft, but only when a n and p n 
are restricted, for all values of n , to be points of H , the set of points of 
non-summability of the function / (a;). 

Thus the postulation II is replaced by the condition 

[ / (z) dx = riiin] [ / (x) dx, 

J a a n ~a,p n ~p J an 

where the square bracket is taken to indicate that the limit does not 
necessarily exist unless all the points a n , belong to the set H. When 
a is not a limiting point of a part of H on its right, the points a n will be 
unrestricted, as in the definition of the DK 7-integral, and a similar remark 
holds as regards p. When neither a nor p is such a limiting point, the 
bracket may be removed from the limit, as it is then unrestricted. 

An effect of the introduction of this restriction, as regards the mode 
of approach of the end-points of intervals to their limits, is that 

F (x) = j X f (x) dx 

is not necessarily continuous at a point x which belongs to the set H. 
However, at such a point, F (a;) always belongs to the aggregate of limits 
oiF(x + h), or of F (x — h), as h ~ 0. 

* Annates sc. de Vdcolc normale (3), vol. xxxiv (1917), p. 184. 
t Proc. Lond. Math. Soc. (2), vol. xvi (1916), p. 176. 
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In order that F (a:) may be continuous in the whole interval (a, b), it 
is clearly necessary that the restricted limits and the unrestricted limits 
should in every case give the same result. Thus the Y-integral, when it 
is continuous, is necessarily a DK Y-integral, but of course not necessarily 
a D-integral. 

480. It will be shewn that the function F (x) constructed, as in the 
case of the DK Y-integral, or the D-integral, with the continuity condition 
modified in the manner explained above, has the following property: 

In any closed interval (a, ft), contained in (a, b), F (x) assumes all values 
between its upjper and lower boundaries in the interval (a, ft). 

Let (y, 8) be an interval contained in (a, ft), and such as to contain a 
sequence of intervals (y w , 8 n ), for which y n ~ y, and 8 n ~ 8, and such that 
in each of the closed intervals (y n , 8 n ) the condition is satisfied that F (x) 
takes any value between its upper and lower boundaries in that interval. 

Also let it be assumed that F (y), F (8) are respectively the limits of 
F (y n ), F (8 W ). Let Jc be any number between the upper and lower 
boundaries of F (x) in the closed interval (y, 8). If n be sufficiently great, 
k is between the upper and lower boundaries of F (x) in the interval 
(y„, K), and is therefore the value of F (x) at some point in that interval, 
and therefore in (y, 8). Thus the theorem holds for the interval (y, 8). 

Commencing with the fact that the theorem holds for any interval 
(y, 8) in which / (x) is integrable (L), and following out the various stages 
of construction of F (x) for the interval (a, ft), as in § 465, we see, by con¬ 
tinually applying the method of passing to the limit, that the theorem 
can be established for the interval (a, ft). 


481. The following extension of the theorem of § 474 will now be 
established: 


Iff (x) have a Y-integral F (x), in (a, b), which is bounded in the interval , 
and <f> (x) be a bounded monotone function defined in the same interval , then 
f (x) <f> (#) is also integrable (Y) in (a, 6), and 


[*/(*)*(*) 

J a 


dx 


F (x) <f> (x) 


- [ F (x) d<f> (x). 

J a 


It will, in the first place, be shewn that, if F (x) be bounded in ( a, ft ), 

[Pn 

and such that F (x) d<j> ( x ) exists, in accordance with the definition of 

' a n 

§ 445, for each of the intervals ( a n , ft n ), of a sequence of intervals contained 
in (a, ft), such that a n ~ a, ft n ~ ft, then j* F (x) d<f> {x) also exists. 


Let £ = <f> (x), and suppose x (£) to be the function defined as in § 445, 
for the interval (<f> (a), <f> (ft)) of £. By hypothesis, x (€) is summable in 
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each interval (</> (a w ), cf> (ft n )), of f. Let E (A, n) denote the set of points 
of this interval for which y (£) > where' A is an arbitrarily chosen 
number; then the set E (A, n) is measurable. It then follows that the 
set E (A ), of all points £, of the interval (</> (a -|- 0), <j> (ft — 0)), at which 
X (f) > A, is measurable, since it is the outer limiting set of the sequence 
of sets E (A , n), as n is increased indefinitely. It follows that y (f) is 
measurable in the interval (</> (a + 0), <f> (ft — 0)); and since y (£) is bounded, 
it is summable in the interval. Moreover, by a known property of the 
L-integral, we have 

/*«/> (/3 - 0) f<t>(Pn) 

X (f) - lirn X (£) 

J4 >(a + 0) rj-QO Jt(a„) 

Since x (£) has the value F (a) in the interval (<f> (a), cf> (« -f- 0)), and 
the value F (ft) in the interval (</> (ft — 0), </> (/?)), we have 
r<t>(P) 

X(0 = F («) i<f> («-1- 0) - <f> («)} + F 08) {4> 08) -Hi b- o)} 

J * (a) 

pHPn) 

+ ] jm x(£) d t, 

n~<x> J <t> (a„) 
fP 

therefore F (x) dxf) (x) exists, and has the value 

a 

F (a) {</> (a + 0) - </> (a)} + F (?) {<f, (?) - <f> (? - 0)} + lim f" F (x) d<j> (x). 

7l~oo J a„ 


Next, it will be shewn that, if the theorem 
i '". . 


f / ( x ) <t> ( x ) dx = \f (x) <t> (x) * - f J 1 (x) (x) 

J o« L J«n •'«« 

holds for each of the intervals (« n , /?„), it also holds for the interval (a, ft), 

provided F {a n ) ~ jF (a), (£„) — (/?). 

We have 


lim 

n~oo 


(*) W 

it follows that 


0n 


F(x)<f,(x) 


-F(?) {</> (?)- + (?-0)} 

- ^ («) {<£ (a + 0) - <£ (a)}; 


rj’(x)^(x)] P - f ^(x)d<£ (x)-lim j IV (x) <£(*)] -f F (x)d<f> (x)j. 
L Ja Ja n~ao (. L J n n J <x n ) 

'Pn 


rp [Pn 

Since / (x) <f> (x) dx is defined to be lim I / (x) <f> (x) dx, it being 
J a n~co J a n 

assumed that/(x) <f> (x) is integrable (Y) in each of the intervals (a n) jS n ), 
f* 


it follows that, if / (x) <f> (x) dx exists, and is equal to 

•' an 

[■F (x)<f>(x)J[~ £>(x) d<j> (x) 

for every value of n , then f / (x) (f> (x) dx exists, and is equal to 

J a 

|V (x) <f> (x)J a ~ [*F (x) d<f> (x). 
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It must now be shewn that the condition III' is satisfied by the integrals 
of / (x) <f> (x), when it is assumed to be satisfied for the integrals of / (x). 

If (a n , fi n ) be an interval, contiguous to the perfect set P, in which 
the integral / ( x ) <f> (x) has already been constructed, we have 

'*0n r “| p n rp n 

I f(x)<f> ( x) dx= F(x)<f>{x)\ - I F (x) d<j> (*) 

= {F (p n ) - F («„)} </> (j8.) 4 F («„) {4 (p n ) - <f> («„)> - f> (*) d<f> (x). 

• ail 

If F denote the upper boundary of F (x) in all the intervals (a w , fi n ) 
contiguous to P, and is the upper boundary of <f> (x) in (a, 6), we have 

r / (X) cf>(x)dx <4>\F (P n ) - F (a n ) | + 2 F (&) - <f> (a n )}, 

J a n 

it being assumed that <f> (x) is non-decreasing. 

on 

It follows that, if 2 | F (f3 n ) — F (a n ) | is convergent, so also is 

n “1 

£ I f / (x) <j> (x) dx . 

n - 1 I a„ 

It is thence seen that, if the set of points of a perfect set which are 
points of divergence of the sum of the absolute values of the integrals of 
/ (x) taken over the intervals contiguous to the perfect set, is non-dense 
in the perfect set, the same condition holds as regards the absolute values 
of the integrals of f (x) <j> (x). For, if P be the part of the perfect set 
contained in any interval (a, f3), such that the sum of the absolute values 
of the integrals of / (x) over the intervals contained in (a, /?), contiguous 
to P, is convergent, it has been shewn that the sum of the integrals of 
/ (x) <f> (x) is absolutely convergent. Thus the condition III' is satisfied 
for the function / (x) <f> (x). 

Lastly, it must be shewn that, if / (x) be summable in a perfect set P 
contained in an interval (a, /?), then 

F (x) 4> (x)j 3 - F (x) d<f> (x) = JS | [jP (x) 4>(x)J - J 0 " F (x) d<j> (x)| 

+ f f(x)4>(x)dx; 
hr) 

the summation being taken for all the intervals (a n , p n ) contiguous to P, 
it being assumed that the sum on the right-hand side is absolutely con¬ 
vergent. The proof of the corresponding theorem for the Z>-integral, given 
in § 474, is applicable to the present case. 

We are now in a position to carry out all the stages of the construction 
of the F-integral of / (x) <f> (x), corresponding to the various stages in the 
construction of the 7-integral of /(x). It has been shewn that the con- 
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struction in the former case can always be carried out when it can be 
carried out in the latter case, and that at each stage the relation 

J* f (X) 4> (x) dx = ^ (x) <f> (z)J -J F (x) d<f> (x) 

holds good. The theorem is therefore completely established. 

482. It has been shewn that F ( x ) assumes, in any interval, all the 
values between its upper and lower boundaries in that interval. Thus if 
U, L be the upper and lower boundaries of F (x) in (a, b), we have 

L {<£ (b) - cf> (a)} * [ F (x) d<f> (x) £ U {<f> (b) - </> (a)}; 

. • 

it follows that 

f F (x) d+ (x) = F (x) {<f> (b) - </> (a)}, 

a 

where x is some number such that a & x & b. 

It now follows that 

[ / (x) <f> (x) dx — cf> (a) f f (x) dx + <f> (6) f f (x) dx , 

J a J a J x 

which is the second mean value theorem for the function / (x), which is 
integrable (Y). The more general form of the theorem may be deduced, 
as in § 422. 

It has been assumed throughout that F (x) is bounded in the interval 
(a, 6), and it is therefore subject to this condition that the existence of 

[ f (x) cf> (x) dx, 

J a 

as given by the formula for integration by parts, has been established. 

It may however happen that, in the final stage of the construction of 
the Y -integral of a function / (x), infinite discontinuities of the function 
F (x) appear. Thus there exist Y -integrals which possess points of infinite 
discontinuity, and are thus unbounded. 

It can be seen that the thoorem 

( 6 {fi (z) + h (*)} dx= [ f 1 (x)dx+ ( f 2 {x) dx 

J a J a J a 

does not necessarily hold for F-integrals, unless both the integrals on the 
right-hand side are DK T-integrals. If, for example, the set H, for f x (x), 
consists of the sequence c lf Cg,... c n , ... converging to zero, and the corre¬ 
sponding set for/ 2 (x) consists of a different sequence c/, c 2 ', ... c n \ ... also 

converging to zero, [ f x (x) dx is defined as lim [ f x (x) dx , and [ f 2 (x) dx 

is defined as lim / f t (x) dx. But / {h (x) + f 2 (x)} dx would be defined 
n~ooJ c% J 0 
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as the limit of f {f x (x) + f 2 (x)} dx, where c has values belonging to both 
the sequences {c ri }, {c n '}. Since it is not necessarily the case that I / 2 ( x ) dx 

J Cn 

converges to ( x ) dx, or that (x) dx converges to / / 2 (a;) dx, it 

JO Jc n JO 

follows that the sum theorem does not necessarily hold for Y-integrals 

which do not belong to the class of DK Y-integrals. 


EXAMPLES 

12 1 

1*. Let / (x) = 2x sin — - cos , for 0 < x s a, and/ (0) — 0. We have 

for 0 £=: x ^ a. j \ f (x) \ dx increases indefinitely as e <— 0, and therefore / (a;) is not sum- 
mable in the interval (0, a). But j f (x) dx ---■ jjc* sin ~ 2 J ; and thus J f (x)c 


x)dx exists as 


a D-integral, which is also an ////-integral. 
2f. Let 


( ) dx [_ '* (/S - a) 2 (X - a) m "(/3 


- *) m J’ 


in the interval (a, p), where m^2. The function <f> a p (*) is not summable in the neigh¬ 
bourhoods of the points a, p; but j* <f > a 0 (a:) dx exists as a D-integral, which is also an 

HL- integral, and it has the value zero. Let II be any non-dense closed set of points in the 
interval (a, 6); and let / (x) have the value zero at each point of H , and in each interval 

70 

( a, p) contiguous to H , let it have the value 0 a /5 (z). The integral J <f>(x)dx exists, and has 
the value zero, in each of the intervals (a, p); also the fluctuation of j f{x)dx in (a, p) is 

< J (P — a)*. It follows that the sum of the fluctuations of j f {x) dx in all the intervals 

is absolutely convergent. It is now easily seen that, if Q be the nucleus of II, the integral 

/ ‘ft* 

f(x) dx exists, and has the value zero, in every interval (a n , j S n ) contiguous to O; and 

<*i« 

moreover the sum of the fluctuations of the integrals / f (x)dx, in the intervals (a n , p n ), 

J an 

is absolutely convergent. Therefore / (x) has a D-integral, which has the value zero in any 
interval of which the end-points belong to H. 

3J. Let Q be a perfect set, of measure zero, in the interval (a, b). In any interval (a*, p n ), 
contiguous to O, let f (x) = 1 /(P n - a w ), for a n < x < £ (a n + p n ), and/(: x ) = - 1 /(ft, - «„), 
for i{*n+ Pn) ^ z < fin* an< ^ / ( x ) ^ ave set °* va l uefi over 

[Pn fi(.<*n+Pn) 

We have / / (x) dx = 0 , for all values of n; also / / (x) dx — £. It thus appears 

J an J <*n 

that / (x) is not integrable (D), because every point of G is a point of non-convergence of 
the sum of the fluctuations of the integral in the contiguous intervals. At the middle point 


* See Denjoy, Annales sc. de T&ole normals (3), vol. xxxiv (1917), p. 181. f Ibid. p. 208. 
X W. H. Young, Proc. Lond. Math. Soc. (2), vol. xvi (1916), p. 199. 
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of any interval (o n , /? n ), we have F (x) = \; hence, at any point of 0 which is a limiting 
point on both sides, we have 

F (; x) - 0, F (x + 0) = £, F {x + 0) = 0, F (x - 0) = £, F (x - 0) - 0. 

At a point of 0 which is an end-point of a contiguous interval, F ( x) is discontinuous ou 
the side away from the contiguous interval. The integral F (x) is accordingly a T-integral, 
which is not a DK Y -integral. 


4*. Let C denote Cantor’s perfect set, of measure zero, defined in the interval (0, 1) 


(see § 83, Ex. 1). In an interval (a n , 0 n ), contiguous to C , let f {x) = —--, for 

rn \Pn “ a n) 

a n <x< l (a„ + fa, and / (x) = - -----3 -for i (a„ + fa ifi x < fa where m is such 

m \Pn a n> 

that (J) m = p n — a n . Over the set 0, / {x) may have arbitrarily assigned values. 

/ Pn 

f (x)dx = 0, we have F (x) = 0, at all points of C; and in (a n , £ n ), the maxi- 

rx 1 

mum value of / f (x) dx is . 

J a n 

The values of F ( x + 0), F (x — 0), at a point x , of C, are the limits of the numbers , 


for the contiguous intervals in the neighbourhoods on the right and left of the point; and 
these limits are both zero, since m increases indefinitely with n; therefore F (x) is a con¬ 
tinuous function. The integral is not a D-integral; for, if x be any point of C, there are 
contained in any neighbourhood of x> contiguous intervals for which m has all values which 
exceed some fixed value; and thus 2 W n diverges at every such point x . To see this, we 
observe that an integer r can be so determined that the distance of x from the nearer end¬ 
point of a given neighbourhood d, of x, is > 3 -r ; and thus d will contain two consecutive 
divisions, when the whole interval (0, 1) is divided into 3 r+1 equal parts. We may suppose 
the intervals (« n , 0 n ) to be arranged in descending order of their lengths, and such that 
those which are of equal length are arranged in their order from left to right. The sub-set 
of C in any of the 3 r+1 equal parts is similar to C , so that the values of m, for those con¬ 
tiguous intervals that are interior to d , will contain all integers from and after some fixed one. 


Every point of Q is a point of convergence of the sum of the absolute values of the 
integrals of / (x) taken over all the contiguous intervals (a n , 0 n ), since all these integrals 
have the value zero. It follows that F (x) is a DK y-integral. 


5f. Let / (x) be the function defined in Ex. 4, and, denoting i (a n + p n ) by y n , let <f> (x) 
be defined in each interval (y n , 0„) by the same rule as that by which / (x) was defined in 
the interval (0, 1); and let <f> (x) = 0, at all points not in an interval (y„, p n ). The points 
of non-summability of <f> (x) form the set C, which consists of (7, together with a set similar 
to C in each of the intervals (y n , /9 n ). It is clear that the set C is non-dense in the set O. 
The points of non-summability of the function / (#) + <f> {x) are the same as those of <j> (x ); 
that is they form the set C. 


We have j y " {/ ( x) -f <t> (a:)} dx = f{x) dx = and if (a m , P n8 ) denote those intervals 


contiguous to C that are in (y w , j3 n ), we have 


f Pnt 

J a n » 


{/(*) + <Hx)}dx 


f$n. 

J ant 


f ( x ) dx ■■ 


s) 


(Pn s __ 

m(p n -a n ) 2 m.V* 


where r depends on <9, as m does on n. 


* See W. H. Young, loc. cit. p. 199. 

•f See W. H. Young, loc . cit. p. 209. The result there given is not in accordance with that 
in the text; it is there stated that the series of integrals diverges in the neighbourhood of every 
point of ?7, and thus that / (x) +<f> (x) has no DK 7-integral. 
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Those points of 0 that belong to C are points of divergence of the sum of the absolute 
values of the integrals of / {x) 4- <f> ( x ), taken over the contiguous intervals (a*, y w ), (a*,, 0*,), 
because, in any interval 8 which encloses such a point, there is an indefinitely great number 
of the intervals (a n , y n ), and the sum of the series of terms 1/2 m, taken for these intervals, 
diverges. This is however not the case for those points of C which do not belong to C, 
because such a point is interior to an interval (y n , /?„), or is at the end y n of such an interval, 

and the series 2 I [*"* {f(x) -f <f> (#)} dx I is equal to - - — 2 (j5 na — a,^), which does 

I J am I w ( Pn~ a n> 

not exceed „ . The set C being non-dense in (7, it now follows in accordance with the 
2m 

theorem of § 478, that the integral / {} {x) + <f> (a:)} dx exists as a DK 7-integral, and is 

/o' 

equal to / f(z)dx + <f> (z) dx , which has the value zero. 

Jo J o 

6 *. Let O be a perfect set of content zero, in the interval (a, b), and let 
f(x) = »/(&„ - a „), for a n Z x < J (a„ + b n ), 

/(*)-- n l(K - a n )’ for 4 (°n + b n , 

where the intervals ( a n , b n ) are all contiguous to O. The function / ( z) may be defined arbi¬ 
trarily in those points of O which are limiting points on both sides. The function F (z) is 
zero at all the points of G , and it is continuous in each of the closed intervals (a n , b n ). The 
value of F ( x) at the point £ ( a n + b n ) is Jn. Thus at each point of G , F (z) has an infinite 
discontinuity; at a point z which is a limiting point of G on both sides, we have 
F (z -f- 0) «* + °o, F (z + 0) = 0, F (x — 0) = 0, F {x — 0) — — «>. 

Thus F (,x ) is a 7-integral, with a set of points of infinite discontinuity. 


See W. H. Young, loc. cit. p. 205. 




CORRECTIONS AND ADDITIONS TO VOLUME II (1926) 


Page 8. Ex. (7). This example should be stated as follows: If {/J n } denote a monotone 
sequence of decreasing numbers which converges to 0, and {a n } be a sequence 
which converges to 0, prove, by a method similar to that employed in the 


proof of the general theorem, that, if lim —^ n+1 exists, then lim exists, 

n~oo Pn ~ Pn4l n~ooPn 

and has the same value. 

Page 249. A proof of the following more general theorem has been given by the author 
(Journal of Lond. Math. Soc. vol. I (1926), p. 211): 

Let E be a measurable set of points (x), in any number of dimensions , and of 
either finite or infinite measure , and let {f n (x)} be a sequence of functions defined 
in E, and such that, for every value of n, | f n (a;) | fe is summable over E, where k is 
some number greater than zero. If the condition 

lim [ I fn (*) - /«' (*) I* dx = 0 

n~oo, n'~<x>J (E) 

be satisfied , then there exists a function f (x), defined at almost all points of E y 
such that | f (x) | fe is summable over E y and satisfies the conditions 


lim f | / (*) - /„ (*) I* dx =■ 0, 
n~°oJ (E) 

f |/(z)|*<£r = lim f I /„ (x) |* tit. 

J (/i) n~xj (E) 

Page 290. It has been tacitly assumed here that the function s (x) and the sets of points 
e n are measurable. In order to justify these assumptions, we observe that, in 
accordance with I, § 400, s(x) is a measurable function, and consequently (see 
I, §384) the functions s(x) —s n+m (x) are also measurable; hence the set of 
points at which | s (x) — 8 n + m (x) | < e is measurable, for each value of n + m. 
The set e w is the set of points common to all the sets for which 

|*(*)-*n(*)|S<, | 8 ( x ) ~ 5 n41 (■*■) I — r > ••• ... 

are measurable. In eiccordance with the second^theorem in I, § 131, the set e n is 
measurable. 

Page 323. It is here assumed that the sets e h , e A are measurable. It has however been 
pointed out to the author by R. L. Jeffery, of Acadia University, Nova Scotia, 
that / ( x y y) can be so defined that this is not the case. For example, let E 
denote the set of all points of the interval (0, 1), and let L denote a non- 
measurable linear set of points, to which the point 0 may be taken to belong, 
which is non-measurable in each interval (0, h). Such a set has been constructed 
by Van Vleck (Trans. Amer. Math. Soc. y vol. ix (1908), p. 237). Let/(x, y) be 
defined in the rectangle (0, 0; 1, 1) to have the value 0 at every point in the 
rectangle, except those points on the diagonal (x y x) which have as their pro¬ 
jections on the side (0, 1) points of C (L) y at which / (x y x) = 1. For all points 
x on (0, 1), we have lim / (x y y) = 0; moreover, / (x y y) is measurable for each 

v~0 

value of y. The set e h consists of the interval (h -f 0, 1) together with the points 
of A in the interval (0, h ); thus e h is not measurable in the interval (0, 1). 

Whenever this possibility is realized it is necessary in § 225 to restrict h to 
have the values in an enumerable sequence {h n } which converges to zero, tie 
values of y being also restricted to have the enumerable Bet of values of y 0 -f h n . 
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Corrections and Additions to Volume II 


Page 444. 


lim/*'2 
71~coJ 0 
Page 626. 


The sets e hl , e hz , ... are then measurable, as has been shewn in the last note. 
Since then the theorem holds for all such sequences { h n }, it holds in accordance 
with Heine’s definition of the limit, and consequently in accordance with the 
definition of Cauchy, provided the equivalence of the two definitions be 
assumed (see I, § 211 ); this assumption requires the employment of the 
multiplicative axiom. The case of the sets e A can be treated similarly. The 
argument of the preceding note is not applicable to shew that the set of points 
common to an unenumerable family of measurable sots is measurable. 

Line 1. For y-fh read y Q + h. 

Line 7. For lim m {K ~ e h ) read lirri m {K — e h ). 

o 

Line 11. For \ f (£, ?/ 0 -f- h n ) - f (x, ?/ 0 -f 0) | read | / (£, y 0 4 h n ) - f (€, y 0 f 0) |. 
Line 12. For f (x, y (} -f 0) read f (£, y 0 -f 0). 

Insert, after line 4, the following: 

Further , j M F (x\ x, n) dx converges to zero , as n ~ uniformly for all 

fp 

those values of x (in Cl) that are :* > a 4- /x, and / F (x\ x, n) dx' converges to 

J Jr + M 

zero , uniformly for all values of x (in G) < fi ~ y. 

The following theorem, which is an immediate corollary of Theorem I, is 
usually sufficient for application: 

If (x', x, n) has the value F (x\ x, n) for all values of x such that | (x' -- x) | £ 
and is zero whenever | x' - x | < y, and if <1 >(x\x,n) so defined , satisfies the 
conditions ( 1 ) and (2), of Theorem /, then 

fx-ii fb 

/ F (x\ x, n) dx' 4 - / F (x\ x, w) dx' 

Ja J x- f/u. 

converges to zero , uniformly for all values of x (in G). 

Line 2. For “the integral is” read “the integral differs from 

4 f*f\(I, «) dl by.” 
a n J “n 

Line 6 . For N (a„) read A' (a„) 

Line 7 from the foot. For o n V ^ f X y-[ read a V* + Xi(“n)' 

a n ( t \ a ” l ' j 

Page 558. Lino 13 from the foot. For 

fin n -1 ^ _ P \ 

! [i t cos 2 1 4 cos 4 1 -f- ... 4 - cos 2 «/] dl read lim / 2 ( - 4 - 2 - cos 2 rt) dt . 

n~ccJo \2 r ,„i n ) 


Page 453. 


x j'J} read , 


The theorem in § 409 has been extended by Kolmogoroff and Seliverstoff to 
the case e = 0 (Atti dei Lined , Rendiconti (6), vol. II (1926), p. 307). Thus, if 

2 (a n 2 } h n 2 ) log n is convergent, then 2 (a n cos nx + b n sin nx) converges almost 
n ~ 2 n -1 

everywhere. An independent proof of this result has been given by Plessner 
(Crelle's Journal , vol. clv (1926), p. 15). 

Page 700. Delete lines 1-11. It is not correct that 7r / - r —r,r r—y„\ is summable 

° < (1) t {2) sin f (1) sm < (2) 

over the cell (0, 0; Jw). The two following corrections are requisite in conse¬ 
quence of this. 

§ 464, line 5. Far read 

(\ tUHW F(l w <(2 ,1 sinainm^ 

J Un sin f< 2 > < ’ *<*> { h 


is summable 


Page 705. § 464, line 5. For 


^F ((">, F* 1 ) * 


-d (J (1) , f (2) ) read 
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Page 709. Delete the last two lines, and substitute the following: 

The functions F x , F 2 can be so chosen as to be not only non-increasing, but 

/d>/(2> 


also non-negative. The function 


— can, by proper choice of A , be 
smi ( 1 ) sin« (2 > J r r 

t (\) t (2) 


, both of which are non- 


expressed as the difference of A and A - — 7 T t—-, 75 , , 

y sm/ (1) sin/ (2) 

negative and non-increasing. The function yjr (t (1) , t< 2) ) can be made identical 

successively with each of the four functions 

f ( 1 ) f(2) 


AF t / (2) ), 




AF 




- 00 , my 


< 00 , are 


all of which are monotone non-increasing. 

The limits of the four corresponding integrals, as m (u 

\tAAF 1 ( 4 0 , 4 - 0),}7T a (A -1) F x (\ 0, \ 0 ),} tt 2 A F 2 ( \ 0 , +0), Jtt 2 (A - 1 ) F 2 (4 0, + 0) 

respectively. 

Thus the integral 

sin m n) / (1) sin / (2) 


-[[<*’■ ,W) f 

» J («. 0 ) 


sin / (1) 


sin /< 2 > 




converges to Jf’ (4- 0, 4- 0), as w (1) ~ », m (2) ~ 00 . 

Page 768. The first theorem in § 495 holds good, as asserted by Menclioff, when 2 > A > 0. 

It is correct that the condition 2 > A > $ is necessary in order that the first 
theorem in § 495 may be deductible from the second. But, for the second theorem, 
the following more general one may be stated; it is proved on pp. 769, 770. 
From this more general theorem the more general form of the first can be 
deduced: 


00 1 

If w (u) is a positive increasing function , such that the series 2 is con - 

vergenty and if a n m o (1), the convergence of the series 

„?,[•(lo g, o g i «„ l )J(lo f; faj) 2ff » 2 ’. (A) 

entails the convergence , almost everywhere in the interval (0, 1), of the series 

2 *»*„(*)• 
n ~1 


For, let o> ( u) ---= w 2 then 2 —is convergent; also from the convergence 
P ~1*(P) 


of | a n 12 - a, we have a n = o (1). 
We have also 

[to (log log w)] 2 (log u)' 


lim 
u~-a O 


U K 


■ZL « lim 

U~QO 


(log log u)^ (log u) 2 


= 0, 


for A > 0. Hence, for all sufficiently large values of m, 

[“ ( log log ra )T ( log \ki )'" c t£t« 

where C is a positive constant; thus, when A > 0 

[" ( log log vb)l ( log i i J v<0|aJ2 " 

Therefore the convergence of the series 2 | a n [ 2 — a entails the convergence of 
the series (A), and consequently the convergence almost everywhere, of the 
series 2 a n <f> n (x). 
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objectives. In addition, direct application of such topics as Riemann, 
Lebesgue and Fourier integrals, trigonometric and power series, and 
functions as limits of integrals, can be made to many fields including 
hydrodynamics, aerodynamics, atomic physics, stellar mechanics and 
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proofs, it has become a standard reference work for mathematicians 
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